Multi-field query expansion is effective for biomedical dataset retrieval
Mohamed Reda Bouadjenek, Karin Verspoor
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION | OXFORD UNIV PRESS | Published : 2017
In the context of the bioCADDIE challenge addressing information retrieval of biomedical datasets, we propose a method for retrieval of biomedical data sets with heterogenous schemas through query reformulation. In particular, the method proposed transforms the initial query into a multi-field query that is then enriched with terms that are likely to occur in the relevant datasets. We compare and evaluate two query expansion strategies, one based on the Rocchio method and another based on a biomedical lexicon. We then perform a comprehensive comparative evaluation of our method on the bioCADDIE dataset collection for biomedical retrieval. We demonstrate the effectiveness of our multi-field q..View full abstract
Related Projects (1)
Awarded by Australian Research Council
Awarded by NIH
The project received funding from the Australian Research Council through a Discovery Project grant, DP150101550. The bioCADDIE Dataset Retrieval Challenge was supported by NIH grant U24AI117966.