Innovative Basistechnologien für eine skalierbare, intelligente Internet-Suchmaschine
dc.contributor.author | Lindemann, Christoph | de |
dc.date.accessioned | 2004-12-03T16:07:06Z | |
dc.date.available | 2004-12-03T16:07:06Z | |
dc.date.created | 2002 | de |
dc.date.issued | 2002-10-17 | de |
dc.description.abstract | As a consequence of the tremendous size, explosive growth, and rapidly changing nature of the Web, a major challenge for search engine design and implementation lies in providing means for scalability at large. In this paper, we introduce WebSearchBench: a parallel software architecture for Internet search engines running on commodity-of-the-shelf components (a Linux cluster comprising of Intel Pentium IV Xeon dual-processor PCs connected by a Gigabit Ethernet). The presented performance study shows that WebSearchBench running on 8 nodes of a cluster can crawl and index 40 million Web pages per day. Another 4 nodes of the cluster can manage an index of 200 million pages and answer more than 25 million search queries per day. The repository for storing 200 million pages requires 7 additional nodes. Furthermore, our study indicates that WebSearchBench running on 32 nodes should be able to manage an index of 2 billion pages and answer about 120 million search queries per day. | en |
dc.format.extent | 669795 bytes | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | http://hdl.handle.net/2003/2253 | |
dc.identifier.uri | http://dx.doi.org/10.17877/DE290R-14753 | |
dc.language.iso | de | de |
dc.publisher | Universität Dortmund | de |
dc.relation.ispartof | 06. InetBib-Tagung vom 18. bis 20. September 2002 in Göttingen | de |
dc.subject.ddc | 020 | de |
dc.title | Innovative Basistechnologien für eine skalierbare, intelligente Internet-Suchmaschine | de |
dc.type | Text | de |
dc.type.publicationtype | conferenceObject | |
dcterms.accessRights | open access |
Files
Original bundle
1 - 1 of 1