Conference Proceedings

Geolocation prediction in social media data by finding location indicative words

B Han, P Cook, T Baldwin

Proceedings of the 24th International Conference on Computational Linguistics | Published : 2012


Geolocation prediction is vital to geospatial applications like localised search and local event detection. Predominately, social media geolocation models are based on full text data, including common words with no geospatial dimension (e.g. today) and noisy strings (tmrw), potentially hampering prediction and leading to slower/more memory-intensive models. In this paper, we focus on finding location indicative words (LIWs) via feature selection, and establishing whether the reduced feature set boosts geolocation accuracy. Our results show that an information gain ratiobased approach surpasses other methods at LIW selection, outperforming state-of-the-art geolocation prediction methods by 10..

View full abstract

Citation metrics