DSpace Repository

Query Enrichment for Web-Query Classification Query Enrichment for Web-Query Classification

Show simple item record

dc.contributor.author Shen Dou
dc.contributor.author Pan Rong
dc.contributor.author Kong Hong
dc.contributor.author Research Microsoft
dc.contributor.author Asia
dc.contributor.author Junfeng Jeffrey
dc.contributor.author Wu Kangheng
dc.contributor.author Yin Jie
dc.contributor.author Yang Qiang
dc.contributor.author Kong Hong
dc.contributor.author Shen D
dc.contributor.author Pan R
dc.contributor.author Pan J J
dc.contributor.author Wu K
dc.contributor.author Yin J
dc.contributor.author Yang Q
dc.date.accessioned 2018-01-22T17:23:41Z
dc.date.available 2018-01-22T17:23:41Z
dc.date.issued 2006
dc.identifier.uri http://hdl.handle.net/123456789/6867
dc.description.abstract Web-search queries are typically short and ambiguous. To classify these queries into certain target categories is a difficult but important problem. In this article, we present a new technique called query enrichment, which takes a short query and maps it to intermediate objects. Based on the collected intermediate objects, the query is then mapped to target categories. To build the necessary mapping functions, we use an ensemble of search engines to produce an enrichment of the queries. Our technique was applied to the ACM Knowledge Discovery and Data Mining competition (ACM KDDCUP) in 2005, where we won the championship on all three evaluation metrics (precision, F1 measure, which combines precision and recall, and creativity, which is judged by the organizers) among a total of 33 teams worldwide. In this article, we show that, despite the difficulty of an abundance of ambiguous queries and lack of training data, our query-enrichment technique can solve the problem satisfactorily through a two-phase classification framework. We present a detailed description of our algorithm and experimental evaluation. Our best result for F1 and precision is 42.4% and 44.4%, respectively, which is 9.6% and 24.3% higher than those from the runner-ups, respectively.
dc.format application/pdf
dc.title Query Enrichment for Web-Query Classification Query Enrichment for Web-Query Classification
dc.type journal-article
dc.source.volume 24
dc.source.issue 3
dc.source.journal ACM Transactions on Information Systems


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account