Journal article

Disambiguation in the biomedical domain: The role of ambiguity type

Mark Stevenson, Yikun Guo

Journal of Biomedical Informatics | ACADEMIC PRESS INC ELSEVIER SCIENCE | Published : 2010

Abstract

Word Sense Disambiguation (WSD), the automatic identification of the meanings of ambiguous terms in a document, is an important stage in text processing. We describe a WSD system that has been developed specifically for the types of ambiguities found in biomedical documents. This system uses a range of knowledge sources. It employs both linguistic features, such as local collocations, and features derived from domain-specific knowledge sources, the Unified Medical Language System (UMLS) and Medical Subject Headings (MeSH). This system is applied to three types of ambiguities found in Medline abstracts: ambiguous terms, abbreviations with multiple expansions and names that are ambiguous betwe..

View full abstract

University of Melbourne Researchers

Grants

Awarded by UK Engineering and Physical Sciences Research Council


Awarded by Engineering and Physical Sciences Research Council


Funding Acknowledgements

The authors are grateful to Abdulaziz Al Amri for his assistance in creating the corpus of ambiguous abbreviations and David Martinez for providing the initial version of the WSD system. The work described in this paper was funded by the UK Engineering and Physical Sciences Research Council (grants EP/E004350/1 and EP/D069548/1).