Conference Proceedings
Learning the countability of english nouns from corpus data
T Baldwin, F Bond
Proceedings of the Annual Meeting of the Association for Computational Linguistics | ASSOC COMPUTATIONAL LINGUISTICS | Published : 2003
Open access
Abstract
This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of each noun onto a feature vector, and uses a suite of memory-based classifiers to predict membership in 4 countability classes. We were able to assign countability to English nouns with a precision of 94.6%.