DSpace Repository

Query-Based Sampling of Text Databases

Show simple item record

dc.contributor.author Connell Jamie Callan/Margaret
dc.date.accessioned 2018-01-22T17:23:17Z
dc.date.available 2018-01-22T17:23:17Z
dc.identifier.uri http://hdl.handle.net/123456789/6839
dc.description.abstract The proliferation of searchable text databases on corporate networks and the Internet causes a database selection problem for many people. Algorithms such as gGlOSS and CORI can automatically select which text databases to search for a given information need, but only if given a set of resource descriptions that accurately represent the contents of each database. The existing techniques for acquiring resource descriptions have significant limitations when used in wide-area networks controlled by many parties. This paper presents query-based sampling, a new technique for acquiring accurate resource descriptions. Query-based sampling does not require the cooperation of resource providers, nor does it require that resource providers use a particular search engine or representation technique. An extensive set of experimental results demonstrates that accurate resource descriptions are created, that computation and communication costs are reasonable, and that the resource descriptions do in fact enable accurate automatic database selection.
dc.format application/pdf
dc.title Query-Based Sampling of Text Databases
dc.type journal-article
dc.source.volume 19
dc.source.issue 2
dc.source.journal ACM Transactions on Information Systems


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account