Journal article
The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts
S De Deyne, S Verheyen, G Storms
Quarterly Journal of Experimental Psychology | SAGE PUBLICATIONS LTD | Published : 2015
Abstract
One of the most significant recent advances in the study of semantic processing is the advent of models based on text and other corpora. In this study, we address what impact both the quantitative and qualitative properties of corpora have on mental representations derived from them. More precisely, we evaluate models with different linguistic and mental constraints on their ability to predict semantic relatedness between items from a vast range of domains and categories. We find that a model based on syntactic dependency relations captures significantly less of the variability for all kinds of words, regardless of the semantic relation between them or their abstractness. The largest differe..
View full abstractGrants
Awarded by Fonds Wetenschappelijk Onderzoek
Funding Acknowledgements
Our gratitude goes to Kris Heylen, Dirk Speelman, and Dirk Geeraerts for making the LeNC corpus available, to Yves Peirsman for collaboration during the early stages of this project, and to Marc Brysbaert for his suggestions on lexicon size. This work was supported by Research Grant G.0436.13 from the Research Foundation - Flanders (FWO) to the first author and by the interdisciplinary research project IDO/07/002 awarded to Dirk Speelman, Dirk Geeraerts, and Gert Storms. Steven Verheyen is a postdoctoral fellow at the Research Foundation - Flanders.