Conference Proceedings

Deep Lexical Acquisition of Type Properties in Low-resource Languages: A Case Study in Wambaya

J NICHOLSON, R Nordlinger, TJ Baldwin

Proceedings of the 26th Pacific Asia Conference on Language | Universitas Indonesia | Published : 2012

Abstract

We present a case study on applying common methods for the prediction of lexical properties to a low-resource language, namely Wambaya. Leveraging a small corpus leads to a typical high-precision, low-recall system; using theWeb as a corpus has no utility for this language, but a machine learning approach seems to utilise the available resources most effectively. This motivates a semi-supervised approach to lexicon extension. © 2012 The PACLIC.

University of Melbourne Researchers