欢迎大家赞助一杯啤酒🍺 我们准备了下酒菜:Formal mathematics/Isabelle/ML, Formal verification/Coq/ACL2, C++/F#/Lisp
Search engine
来自开放百科 - 灰狐
(版本间的差异)
第1行: | 第1行: | ||
− | |||
==搜索引擎== | ==搜索引擎== | ||
*[[List of search engines]] | *[[List of search engines]] | ||
第58行: | 第57行: | ||
*Search Engine Watch - http://searchenginewatch.com/ | *Search Engine Watch - http://searchenginewatch.com/ | ||
*Search Tools - http://www.searchtools.com/ | *Search Tools - http://www.searchtools.com/ | ||
+ | *The Web Robots Pages : http://www.robotstxt.org/wc/robots.html ,一些很好的规则定义以及定义了Robots协议 | ||
+ | *Guidelines for Robot Writers : http://www.robotstxt.org/wc/guidelines.html | ||
+ | *SearchTools.com: http://www.searchtools.com/robots/ ,All About Search Indexing Robots and Spiders | ||
+ | *中文搜索引擎技术揭密:网络蜘蛛 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=183 | ||
+ | *中文搜索引擎技术揭密:中文分词 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=184 | ||
+ | *中文搜索引擎技术揭密:排序技术 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=185 | ||
+ | *中文搜索引擎技术揭密:系统架构 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=186 | ||
[[Image:Example.jpg]] | [[Image:Example.jpg]] |
2006年11月19日 (日) 10:32的版本
目录 |
搜索引擎
- List of search engines
- Google - http://www.google.com
- Yahoo - http://search.yahoo.com
- Autonomy - http://www.autonomy.com.cn
- WiseNut - http://www.wisenut.com/
- MSN Search - http://search.msn.com
- A9 - http://www.a9.com
- Baidu - http://www.baidu.com
- Koders - Source Code Search Engine http://www.koders.com/
- Ask Jeeves - http://www.ask.com/
- Teoma - http://www.teoma.com/
- WiseNut - http://www.wisenut.com/
- Gigablast - http://www.gigablast.com/
- Creative Commons Search - http://search.creativecommons.org/
- Scrub The Web - http://www.scrubtheweb.com/
- FactBites.com - http://www.factbites.com
- Dumbfind - http://www.dumbfind.com/
- Entireweb - http://www.entireweb.com/
- Objects Search - http://www.objectssearch.com/
- Pipeline - http://www.pipeline-search.com/
- Mojeek - http://www.mojeek.com/
- Ulysseek - http://www.ulysseek.com/
- SearchHippo - http://www.searchhippo.com/
- Wotbox - http://www.wotbox.com/
- meta 搜索引擎 Myriad Search - http://www.myriadsearch.com/
- Majestic-12: Distributed Search Engine - 一个搜索引擎的协作项目
开源项目
- mnoGoSearch - http://mnogosearch.org/
- Lucene Search Engine (no crawler) - http://lucene.apache.org
- CLucene is a C++ port of Lucene - http://clucene.sourceforge.net
- Nutch (open source web-scalable search engine) - http://lucene.apache.org/nutch/
- ASPSeek - http://www.aspseek.org/
- DataparkSearch - http://www.dataparksearch.org/
- JXTA Search - http://search.jxta.org/
- Managing Gigabytes - http://www.cs.mu.oz.au/mg/
- Namazu(a Full-Text Search Engine) - http://www.namazu.org/index.html.en
- OpenWebSpider - http://www.openwebspider.org/
- OpenFTS - http://openfts.sourceforge.net/
- Swish-e - http://www.swish-e.org/
- [Swishpp|[SWISH++]] - http://swishplusplus.sourceforge.net/
- Zebra - http://indexdata.dk/zebra/
- Webglimpse - http://webglimpse.net/
- Xapian - http://www.xapian.org/
- XQEngine(XML Query Engine) - http://xqengine.sourceforge.net/
- Tesseract OCR - http://sourceforge.net/projects/tesseract-ocr
- 天网千帆FTP文件搜索引擎 - http://project.mytianwang.cn/
中文资源
- 搜索引擎研究 - http://www.wespoke.com/
相关文章
相关链接
- Search Engine Watch - http://searchenginewatch.com/
- Search Tools - http://www.searchtools.com/
- The Web Robots Pages : http://www.robotstxt.org/wc/robots.html ,一些很好的规则定义以及定义了Robots协议
- Guidelines for Robot Writers : http://www.robotstxt.org/wc/guidelines.html
- SearchTools.com: http://www.searchtools.com/robots/ ,All About Search Indexing Robots and Spiders
- 中文搜索引擎技术揭密:网络蜘蛛 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=183
- 中文搜索引擎技术揭密:中文分词 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=184
- 中文搜索引擎技术揭密:排序技术 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=185
- 中文搜索引擎技术揭密:系统架构 - http://www.magicpower.com.cn/Articles/showarticle.asp?article_id=186
分享您的观点