A Joint Model for Multimodal Document Quality Assessment

Aili Shen; Bahar Salehi; Jianzhong Qi; Timothy Baldwin

arxiv: 1901.01010 · v2 · pith:H66QY4T7new · submitted 2019-01-04 · 💻 cs.CL · cs.AI· cs.DL

A Joint Model for Multimodal Document Quality Assessment

Aili Shen , Bahar Salehi , Timothy Baldwin , Jianzhong Qi This is my paper

classification 💻 cs.CL cs.AIcs.DL

keywords documentqualityvisualassessmentjointmodelrenderingresults

0 comments

read the original abstract

The quality of a document is affected by various factors, including grammaticality, readability, stylistics, and expertise depth, making the task of document quality assessment a complex one. In this paper, we explore this task in the context of assessing the quality of Wikipedia articles and academic papers. Observing that the visual rendering of a document can capture implicit quality indicators that are not present in the document text --- such as images, font choices, and visual layout --- we propose a joint model that combines the text content with a visual rendering of the document for document quality assessment. Experimental results over two datasets reveal that textual and visual features are complementary, achieving state-of-the-art results.

This paper has not been read by Pith yet.

A Joint Model for Multimodal Document Quality Assessment

discussion (0)