Journal article

Chinese OOV Translation and Post-translation Query Expansion in Chinese-English Cross-lingual Information Retrieval

Y Zhang, P Vines, J Zobel

ACM Transactions on Asian Language Information Processing | Published : 2005

Abstract

Cross-lingual information retrieval allows users to query mixed-language collections or to probe for documents written in an unfamiliar language. A major difficulty for cross-lingual information retrieval is the detection and translation of out-of-vocabulary (OOV) terms; for OOV terms in Chinese, another difficulty is segmentation. At NTCIR-4, we explored methods for translation and disambiguation for OOV terms when using a Chinese query on an English collection. We have developed a new segmentation-free technique for automatic translation of Chinese OOV terms using the web. We have also investigated the effects of distance factor and window size when using a hidden Markov model to provide d..

View full abstract

University of Melbourne Researchers