Conference Proceedings

Matching Cultural Heritage items to Wikipedia

Eneko Agirre, Ander Barrena, Oier Lopez de Lacalle, Aitor Soroa, Samuel Fernando, Mark Stevenson, N Calzolari (ed.), K Choukri (ed.), T Declerck (ed.), MU Dogan (ed.), B Maegaard (ed.), J Mariani (ed.), J Odijk (ed.), S Piperidis (ed.)

LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA | Published : 2012

Abstract

Digitised Cultural Heritage (CH) items usually have short descriptions and lack rich contextual information. Wikipedia articles, on the contrary, include in-depth descriptions and links to related articles, which motivate the enrichment of CH items with information from Wikipedia. In this paper we explore the feasibility of finding matching articles in Wikipedia for a given Cultural Heritage item. We manually annotated a random sample of items from Europeana, and performed a qualitative and quantitative study of the issues and problems that arise, showing that each kind of CH item is different and needs a nuanced definition of what "matching article" means. In addition, we test a well-known ..

View full abstract

University of Melbourne Researchers

Grants

Awarded by European Community's Seventh Framework Programme


Awarded by KNOW2 project


Funding Acknowledgements

The research leading to these results was carried out as part of the PATHS project (http://paths-project.eu) funded by the European Communitys Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 270082 and KNOW2 project (TIN2009-14715-C04-01). We want to thank the anonymous reviewers for their comments.