Estimating measurement uncertainty for information retrieval effectiveness metrics

A Moffat; F Scholer; Z Yang

Journal article

Estimating measurement uncertainty for information retrieval effectiveness metrics

A Moffat, F Scholer, Z Yang

Journal of Data and Information Quality | ASSOC COMPUTING MACHINERY | Published : 2018

DOI: 10.1145/3239572

Abstract

One typicalway of building test collections for offline measurement of information retrieval systems is to pool the ranked outputs of different systems down to some chosen depth d and then form relevance judgments for those documents only. Non-pooled documents.ones that did not appear in the top-d sets of any of the contributing systems.are then deemed to be non-relevant for the purposes of evaluating the relative behavior of the systems. In this article, we use RBP-derived residuals to re-examine the reliability of that process. By fitting the RBP parameter φ to maximize similarity between AP- and NDCG-induced system rankings, on the one hand, and RBP-induced rankings, on the other, an esti..

View full abstract

University of Melbourne Researchers

Alistair Moffat Author

Alicia Yang Author

Grants

Awarded by Google

Funding Acknowledgements

The authors thank the referees for their support and their helpful feedback and suggestions. This work was supported by the Australian Research Council (DP180102687) and by a Google Faculty Research Grant.