Conference Proceedings

Learning the countability of english nouns from corpus data

T Baldwin, F Bond

Proceedings of the Annual Meeting of the Association for Computational Linguistics | ASSOC COMPUTATIONAL LINGUISTICS | Published : 2003

Open access

Abstract

This paper describes a method for learning the countability preferences of English nouns from raw text corpora. The method maps the corpus-attested lexico-syntactic properties of each noun onto a feature vector, and uses a suite of memory-based classifiers to predict membership in 4 countability classes. We were able to assign countability to English nouns with a precision of 94.6%.

University of Melbourne Researchers