Conference Proceedings

Automatic detection of multilingual dictionaries on the web

G Grigonyte, T Baldwin

Association for Computing Machinery (ACM) | Published : 2014

Abstract

This paper presents an approach to query construction to detect multilingual dictionaries for predetermined language combinations on the web, based on the identification of terms which are likely to occur in bilingual dictionaries but not in general web documents. We use eight target languages for our case study, and train our method on pre-identified multilingual dictionaries and theWikipedia dump for each of our languages. © 2014 Association for Computational Linguistics.

Grants

Funding Acknowledgements

We wish to thank the anonymous reviewers for their valuable comments, and the Panlex developers for assistance with the dictionaries and experimental design. This research was supported by funding from the Group of Eight and the Australian Research Council.