Disambiguation in the biomedical domain: The role of ambiguity type
Mark Stevenson, Yikun Guo
Journal of Biomedical Informatics | ACADEMIC PRESS INC ELSEVIER SCIENCE | Published : 2010
Word Sense Disambiguation (WSD), the automatic identification of the meanings of ambiguous terms in a document, is an important stage in text processing. We describe a WSD system that has been developed specifically for the types of ambiguities found in biomedical documents. This system uses a range of knowledge sources. It employs both linguistic features, such as local collocations, and features derived from domain-specific knowledge sources, the Unified Medical Language System (UMLS) and Medical Subject Headings (MeSH). This system is applied to three types of ambiguities found in Medline abstracts: ambiguous terms, abbreviations with multiple expansions and names that are ambiguous betwe..View full abstract
Awarded by UK Engineering and Physical Sciences Research Council
Awarded by Engineering and Physical Sciences Research Council
The authors are grateful to Abdulaziz Al Amri for his assistance in creating the corpus of ambiguous abbreviations and David Martinez for providing the initial version of the WSD system. The work described in this paper was funded by the UK Engineering and Physical Sciences Research Council (grants EP/E004350/1 and EP/D069548/1).