DSpace Repository

Performance and Cost Tradeoffs in Web Search

Show simple item record

dc.contributor.author Craswell Nick
dc.contributor.author Crimmins Francis
dc.contributor.author Hawking David
dc.contributor.author Moffat Alistair
dc.date.accessioned 2018-01-22T17:25:47Z
dc.date.available 2018-01-22T17:25:47Z
dc.date.issued 2004
dc.identifier.uri http://hdl.handle.net/123456789/7015
dc.description.abstract Web search engines crawl the web to fetch the data that they index. In this paper we reexamine that need, and evaluate the network costs associated with data acquisition , and alternative ways in which a search service might be supported. As a concrete example, we make use of the Research Finder search service provided at http://rf.panopticsearch.com, and information derived from its crawl and query logs. Based upon an analysis of the Research Finder system we introduce a hybrid arrangement, in which queries are evaluated partially by reference to a centrally maintained index representing a subset of the collection, and partially by referring them on to the local search services maintained by the balance of the collection. We also examine various ways in which crawling costs can be reduced.
dc.format application/pdf
dc.title Performance and Cost Tradeoffs in Web Search
dc.type generic


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account