Conference Proceedings

Parallel corpora for the biomedical domain

A Névéol, AJ Yepes, M Neves, K Verspoor

LREC 2018 - 11th International Conference on Language Resources and Evaluation | Published : 2019

Abstract

© LREC 2018 - 11th International Conference on Language Resources and Evaluation. All rights reserved. A vast amount of biomedical information is available in the form of scientific literature and government-authored patient information documents. While English is the most widely used language in many of these sources, there is a need to provide access to health information in languages other than English. Parallel corpora can be leveraged to implement cross-lingual information retrieval or machine translation tools. Herein, we review the extent of parallel corpus coverage in the biomedical domain. Specifically, we perform a scoping review of existing resources and we describe the recent dev..

View full abstract