Conference Proceedings

Japanese semcor: A sense-tagged corpus of Japanese

F Bond, T Baldwin, R Fothergill, K Uchimoto

Gwc 2012 6th International Global Wordnet Conference Proceedings | Published : 2012

Abstract

In this paper we describe the creation of the Japanese SemCor (JSEMCOR) sensetagged corpus of Japanese. The corpus is a translation of the English SEMCOR, with senses projected across from English. The final corpus consists of 14,169 sentences with 150,555 content words of which 58,265 are sense tagged. The corpus is one of the corpora used to provide sense frequency data for the Japanese Wordnet. © Christiane Fellbaum, Piek Vossen, 2012.

University of Melbourne Researchers

Citation metrics