Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content

Lundgard, Alan; Satyanarayan, Arvind

Author(s)

Lundgard, Alan; Satyanarayan, Arvind

DownloadAccepted version (1.857Mb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

Natural language descriptions sometimes accompany visualizations to better communicate and contextualize their insights, and to improve their accessibility for readers with disabilities. However, it is difficult to evaluate the usefulness of these descriptions, and how effectively they improve access to meaningful information, because we have little understanding of the semantic content they convey, and how different readers receive this content. In response, we introduce a conceptual model for the semantic content conveyed by natural language descriptions of visualizations. Developed through a grounded theory analysis of 2,147 sentences, our model spans four levels of semantic content: enumerating visualization construction properties (e.g., marks and encodings); reporting statistical concepts and relations (e.g., extrema and correlations); identifying perceptual and cognitive phenomena (e.g., complex trends and patterns); and elucidating domain-specific insights (e.g., social and political context). To demonstrate how our model can be applied to evaluate the effectiveness of visualization descriptions, we conduct a mixed-methods evaluation with 30 blind and 90 sighted readers, and find that these reader groups differ significantly on which semantic content they rank as most useful. Together, our model and findings suggest that access to meaningful information is strongly reader-specific, and that research in automatic visualization captioning should orient toward descriptions that more richly communicate overall trends and statistics, sensitive to reader preferences. Our work further opens a space of research on natural language as a data interface coequal with visualization.

Date issued

2022

URI

https://hdl.handle.net/1721.1/143862

Department

Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory

Journal

IEEE Transactions on Visualization and Computer Graphics

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Citation

Lundgard, Alan and Satyanarayan, Arvind. 2022. "Accessible Visualization via Natural Language Descriptions: A Four-Level Model of Semantic Content." IEEE Transactions on Visualization and Computer Graphics, 28 (1).

Version: Author's final manuscript

Collections

MIT Open Access Articles