Conference Proceedings

A joint model for multimodal document quality assessment

A Shen, B Salehi, T Baldwin, J Qi

2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL) | IEEE | Published : 2019

Abstract

© 2019 IEEE. The quality of a document is affected by various factors, including grammaticality, readability, stylistics, and expertise depth, making the task of document quality assessment a complex one. In this paper, we explore this task in the context of assessing the quality of Wikipedia articles. Observing that the visual rendering of a document can capture implicit quality indicators that are not present in the document text-such as images, font choices, and visual layout-we propose a joint model that combines the text content with a visual rendering of the document for document quality assessment. Experimental results over a Wikipedia dataset reveal that textual and visual features a..

View full abstract