Determining the difficulty of Word Sense Disambiguation
Bridget T McInnes, Mark Stevenson
Journal of Biomedical Informatics | ACADEMIC PRESS INC ELSEVIER SCIENCE | Published : 2014
Automatic processing of biomedical documents is made difficult by the fact that many of the terms they contain are ambiguous. Word Sense Disambiguation (WSD) systems attempt to resolve these ambiguities and identify the correct meaning. However, the published literature on WSD systems for biomedical documents report considerable differences in performance for different terms. The development of WSD systems is often expensive with respect to acquiring the necessary training data. It would therefore be useful to be able to predict in advance which terms WSD systems are likely to perform well or badly on. This paper explores various methods for estimating the performance of WSD systems on a wid..View full abstract
Awarded by UK Engineering and Physical Sciences Research Council
Awarded by Engineering and Physical Sciences Research Council
Stevenson is grateful to the UK Engineering and Physical Sciences Research Council for supporting this work (Grant EP/J008427/1).