Conference Proceedings

Implementing MapReduce over language and literature data over the UK National Grid Service

MS Sarwar, M Alexander, J Anderson, J Green, RO Sinnott

2011 7th International Conference on Emerging Technologies, ICET 2011 | Published : 2011

Abstract

Humanities researchers are producing large volumes and heterogeneous varieties of language and literature data collections in digital format. These collections include dictionaries, thesauri, corpora, images, audio and video resources. The increased availability of these datasets brought about by advances and adaptations of the Internet and increased digitisation of humanities data resources, poses new challenges for humanities researchers. Many of these challenges are related to data access and usage and include security, integrity, interoperability, information retrieval, sharing, licensing and copyright. The JISC-funded project Enhancing Repositories for Language and Literature Research (..

View full abstract