Journal article
Estimating measurement uncertainty for information retrieval effectiveness metrics
A Moffat, F Scholer, Z Yang
Journal of Data and Information Quality | ASSOC COMPUTING MACHINERY | Published : 2018
DOI: 10.1145/3239572
Abstract
One typicalway of building test collections for offline measurement of information retrieval systems is to pool the ranked outputs of different systems down to some chosen depth d and then form relevance judgments for those documents only. Non-pooled documents.ones that did not appear in the top-d sets of any of the contributing systems.are then deemed to be non-relevant for the purposes of evaluating the relative behavior of the systems. In this article, we use RBP-derived residuals to re-examine the reliability of that process. By fitting the RBP parameter φ to maximize similarity between AP- and NDCG-induced system rankings, on the one hand, and RBP-induced rankings, on the other, an esti..
View full abstractGrants
Awarded by Google
Funding Acknowledgements
The authors thank the referees for their support and their helpful feedback and suggestions. This work was supported by the Australian Research Council (DP180102687) and by a Google Faculty Research Grant.