Journal article

Multi-field query expansion is effective for biomedical dataset retrieval

Mohamed Reda Bouadjenek, Karin Verspoor

DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | OXFORD UNIV PRESS | Published : 2017

Abstract

In the context of the bioCADDIE challenge addressing information retrieval of biomedical datasets, we propose a method for retrieval of biomedical data sets with heterogenous schemas through query reformulation. In particular, the method proposed transforms the initial query into a multi-field query that is then enriched with terms that are likely to occur in the relevant datasets. We compare and evaluate two query expansion strategies, one based on the Rocchio method and another based on a biomedical lexicon. We then perform a comprehensive comparative evaluation of our method on the bioCADDIE dataset collection for biomedical retrieval. We demonstrate the effectiveness of our multi-field q..

View full abstract

Grants

Awarded by Australian Research Council


Awarded by NIH


Funding Acknowledgements

The project received funding from the Australian Research Council through a Discovery Project grant, DP150101550. The bioCADDIE Dataset Retrieval Challenge was supported by NIH grant U24AI117966.