Journal article

Using query logs to establish vocabularies in distributed information retrieval

Milad Shokouhi, Justin Zobel, Saied Tahaghoghi, Falk Scholer

INFORMATION PROCESSING & MANAGEMENT | ELSEVIER SCI LTD | Published : 2007

Abstract

Users of search engines express their needs as queries, typically consisting of a small number of terms. The resulting search engine query logs are valuable resources that can be used to predict how people interact with the search system. In this paper, we introduce two novel applications of query logs, in the context of distributed information retrieval. First, we use query log terms to guide sampling from uncooperative distributed collections. We show that while our sampling strategy is at least as efficient as current methods, it consistently performs better. Second, we propose and evaluate a pruning strategy that uses query log information to eliminate terms. Our experiments show that ou..

View full abstract

University of Melbourne Researchers