Journal article

The effect of pooling and evaluation depth on IR metrics

X Lu, A Moffat, JS Culpepper

Information Retrieval Journal | SPRINGER | Published : 2016

Abstract

Batch IR evaluations are usually performed in a framework that consists of a document collection, a set of queries, a set of relevance judgments, and one or more effectiveness metrics. A large number of evaluation metrics have been proposed, with two primary families having emerged: recall-based metrics, and utility-based metrics. In both families, the pragmatics of forming judgments mean that it is usual to evaluate the metric to some chosen depth such as k= 20 or k= 100 , without necessarily fully considering the ramifications associated with that choice. Our aim is this paper is to explore the relative risks arising with fixed-depth evaluation in the two families, and document the complex..

View full abstract

University of Melbourne Researchers

Grants

Awarded by Australian Research Council


Awarded by Australian Research Council DECRA Research Fellowship


Funding Acknowledgements

This work was supported by the Australian Research Council's Discovery Projects Scheme (DP140101587). Shane Culpepper is the recipient of an Australian Research Council DECRA Research Fellowship (DE140100275).