Conference Proceedings
The construction and evaluation of word space models
Y Peirsman, S De Deyne, K Heylen, D Geeraerts
Proceedings of the 6th International Conference on Language Resources and Evaluation Lrec 2008 | EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA | Published : 2008
Abstract
Semantic similarity is a key issue in many computational tasks. This paper goes into the development and evaluation of two common ways of automatically calculating the semantic similarity between two words. On the one hand, such methods may depend on a manually constructed thesaurus like (Euro)WordNet. Their performance is often evaluated on the basis of a very restricted set of human similarity ratings. On the other hand, corpus-based methods rely on the distribution of two words in a corpus to determine their similarity. Their performance is generally quantified through a comparison with the judgements of the first type of approach. This paper introduces a new Gold Standard of more than 5,..
View full abstract