Conference Proceedings
Automatic labelling of topic models
JH Lau, K Grieser, D Newman, T Baldwin
Acl Hlt 2011 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics Human Language Technologies | Published : 2011
Abstract
We propose a method for automatically labelling topics learned via LDA topic models. We generate our label candidate set from the top-ranking topic terms, titles ofWikipedia articles containing the top-ranking topic terms, and sub-phrases extracted from the Wikipedia article titles. We rank the label candidates using a combination of association measures and lexical features, optionally fed into a supervised ranking model. Our method is shown to perform strongly over four independent sets of topics, significantly better than a benchmark method. © 2011 Association for Computational Linguistics.