Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements

Bei Chen; Dongmei Zhang; Haidong Zhang; He Huang; Jian-Guan Lou; Lei Fang; Weiwei Cui; Xiaoyu Zhang; Yun Wang

arxiv: 1907.09091 · v1 · pith:7WJ25SCBnew · submitted 2019-07-22 · 💻 cs.HC

Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements

Weiwei Cui , Xiaoyu Zhang , Yun Wang , He Huang , Bei Chen , Lei Fang , Haidong Zhang , Jian-Guan Lou

show 1 more author

Dongmei Zhang

This is my paper

Pith reviewed 2026-05-24 18:25 UTC · model grok-4.3

classification 💻 cs.HC

keywords infographicsnatural language inputautomatic generationproportion statisticsdata visualizationproof-of-concept systemdesign space studycasual users

0 comments

The pith

A proof-of-concept system automatically converts natural language statements about simple proportions into sets of pre-designed infographics.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper investigates an alternative to manual infographic tools by generating visuals directly from text. It begins with a preliminary study that maps the design space for proportion-related infographics. From that study the authors construct a system that accepts statements about simple proportion statistics and outputs multiple styled infographic variants. The goal is to let casual users produce engaging visuals without learning authoring software or possessing design skills. If the approach works, it removes a major barrier between raw proportion data and usable, memorable presentations.

Core claim

After mapping the design space through a preliminary study, the authors built a proof-of-concept system that automatically converts statements about simple proportion-related statistics to a set of infographics with pre-designed styles.

What carries the argument

The proof-of-concept system that maps proportion statements to pre-designed infographic styles on the basis of the preliminary design-space study.

If this is right

Casual users without design training can obtain multiple infographic options from a single proportion statement.
The system focuses exclusively on simple proportion-related statistics rather than arbitrary data.
Pre-designed styles replace the need for users to choose layouts or visual elements manually.
The output set of infographics is intended to be immediately usable for communication.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same pipeline might later handle statements that combine proportions with other quantitative relations if the design space is expanded.
Voice input could replace typed statements, allowing on-the-fly generation during conversations or presentations.
The pre-designed style library could be crowdsourced or learned from existing infographics rather than hand-crafted.

Load-bearing premise

The preliminary study sufficiently captures the design space so that pre-designed styles can produce acceptable infographics for the targeted class of statements.

What would settle it

A test in which participants consistently rate the system outputs as visually unappealing or factually misleading for the input statements would show the approach does not work.

Figures

Figures reproduced from arXiv: 1907.09091 by Bei Chen, Dongmei Zhang, Haidong Zhang, He Huang, Jian-Guan Lou, Lei Fang, Weiwei Cui, Xiaoyu Zhang, Yun Wang.

**Figure 2.** Figure 2: Example of breaking a search result (Infographic of Infograph [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 5.** Figure 5: Exemplars of rank-related infographics [56, 70]: (a) highlighted [PITH_FULL_IMAGE:figures/full_fig_p004_5.png] view at source ↗

**Figure 4.** Figure 4: Exemplars of change-related infographics [57, 65]: (a) con [PITH_FULL_IMAGE:figures/full_fig_p004_4.png] view at source ↗

**Figure 6.** Figure 6: Exemplars of infographics with multiple facts [50]: (a) side-by [PITH_FULL_IMAGE:figures/full_fig_p005_6.png] view at source ↗

**Figure 7.** Figure 7: An example of entities and labels in a statement. Following [PITH_FULL_IMAGE:figures/full_fig_p006_7.png] view at source ↗

**Figure 8.** Figure 8: (a) A layout blueprint example and (b) its realization. [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

read the original abstract

Combining data content with visual embellishments, infographics can effectively deliver messages in an engaging and memorable manner. Various authoring tools have been proposed to facilitate the creation of infographics. However, creating a professional infographic with these authoring tools is still not an easy task, requiring much time and design expertise. Therefore, these tools are generally not attractive to casual users, who are either unwilling to take time to learn the tools or lacking in proper design expertise to create a professional infographic. In this paper, we explore an alternative approach: to automatically generate infographics from natural language statements. We first conducted a preliminary study to explore the design space of infographics. Based on the preliminary study, we built a proof-of-concept system that automatically converts statements about simple proportion-related statistics to a set of infographics with pre-designed styles. Finally, we demonstrated the usability and usefulness of the system through sample results, exhibits, and expert reviews.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

A scoped proof-of-concept that turns simple proportion statements into pre-styled infographics after a design study, with samples and expert feedback but minimal metrics.

read the letter

The main thing to know is that the authors built a system converting natural language statements about proportions into infographics using pre-designed styles. They started with a preliminary study to map the design space, then implemented the proof-of-concept and showed it through examples and expert reviews. This targets casual users who avoid complex authoring tools, and the narrow focus on proportions keeps the task doable. The approach is practical for that slice of the problem and shows some thought in grounding the styles in the initial study. The evaluation stays qualitative. Expert reviews provide basic confirmation but leave out quantitative measures of parsing accuracy, style selection success, or how well it handles phrasing variations. Implementation details on the NLP side are not visible in the abstract, which makes it hard to assess edge cases or reproducibility. Citation context is also missing, so overlap with earlier natural language to visualization work is unclear. This paper fits researchers working on automated visualization tools or natural language interfaces in HCI. A reader interested in scoped automation for data communication could pick up ideas from the design study and the decision to limit input types. It shows honest engagement with a real user barrier without overclaiming generality. The work deserves peer review because the core claim is modest and tied to a concrete study, even if more user testing and technical specifics would help.

Referee Report

2 major / 2 minor

Summary. The paper claims to have conducted a preliminary study exploring the design space of infographics, built a proof-of-concept system (Text-to-Viz) that automatically converts natural language statements about simple proportion-related statistics into infographics using pre-designed styles, and demonstrated the system's usability and usefulness via sample results, exhibits, and expert reviews.

Significance. If the described pipeline holds, the work could lower barriers for casual users to produce engaging proportion-based infographics without requiring design expertise or time-intensive authoring, contributing to automated visualization tools in HCI.

major comments (2)

[Abstract] Abstract and overall manuscript: the central claim rests on system construction and qualitative expert feedback, yet the text supplies no implementation details, metrics, failure cases, or quantitative evaluation, preventing verification of whether the pre-designed styles reliably cover the targeted input class.
[Preliminary Study] Preliminary study section: the claim that this study sufficiently maps the design space to enable acceptable pre-designed styles for simple proportion statements is load-bearing, but no methodology, participant details, or derivation process for the styles is provided to assess coverage or completeness.

minor comments (2)

Add explicit discussion of related work on NL-to-vis systems and infographic authoring tools to better situate the contribution.
Clarify the exact scope of 'simple proportion-related statistics' with examples of supported and unsupported statement types.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed feedback. The comments highlight areas where the manuscript can be strengthened with additional details on the system and study. We will revise accordingly while maintaining the proof-of-concept nature of the work.

read point-by-point responses

Referee: [Abstract] Abstract and overall manuscript: the central claim rests on system construction and qualitative expert feedback, yet the text supplies no implementation details, metrics, failure cases, or quantitative evaluation, preventing verification of whether the pre-designed styles reliably cover the targeted input class.

Authors: We agree that the current version provides limited implementation specifics. In the revision, we will add a dedicated system implementation section describing the NLP pipeline for parsing proportion statements, the rule-based style selection mechanism, the set of pre-designed styles, and concrete examples of both successful outputs and failure cases (e.g., ambiguous statements or unsupported proportion types). We will also include a limitations subsection discussing coverage of the input class. As the contribution is framed as a proof-of-concept rather than a production system, the evaluation remains qualitative via expert reviews; we will clarify this positioning and note that quantitative metrics (e.g., coverage rate on a held-out statement set) could be added if the reviewers consider them essential. revision: yes
Referee: [Preliminary Study] Preliminary study section: the claim that this study sufficiently maps the design space to enable acceptable pre-designed styles for simple proportion statements is load-bearing, but no methodology, participant details, or derivation process for the styles is provided to assess coverage or completeness.

Authors: We acknowledge that the preliminary study section is currently high-level. In the revision, we will expand it to report the study methodology (e.g., how infographic examples were collected and analyzed), participant information (number, background, recruitment), the process used to derive the design space dimensions, and the explicit mapping from study findings to the final pre-designed styles. This will allow readers to evaluate the completeness and rationale for the chosen styles. revision: yes

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper presents a proof-of-concept system for generating infographics from proportion-related statements. It relies on a preliminary study to explore the design space, followed by system construction with pre-designed styles, and evaluation via sample results and expert reviews. No equations, derivations, fitted parameters, predictions, or load-bearing self-citations appear in the argument. The central claim reduces to system construction and qualitative demonstration rather than any self-referential loop or imported uniqueness result. The derivation chain is self-contained against external benchmarks of system-building papers.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No mathematical content, free parameters, axioms, or invented entities are present in the abstract; the work is a system-building effort in HCI.

pith-pipeline@v0.9.0 · 5714 in / 940 out tokens · 17886 ms · 2026-05-24T18:25:44.930347+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

74 extracted references · 74 canonical work pages · 4 internal anchors

[1]

Adobe color cc

Adobe. Adobe color cc. https://color.adobe.com

work page
[2]

Amini, N

F. Amini, N. H. Riche, B. Lee, A. Monroy-Hernandez, and P. Irani. Authoring data-driven videos with dataclips. IEEE transactions on visualization and computer graphics, 23(1):501– 510, 2017

work page 2017
[3]

Artacho-Ramirez, J

M. Artacho-Ramirez, J. Diego-Mas, and J. Alcaide-Marzal. In- ﬂuence of the mode of graphical representation on the percep- tion of product aesthetic and emotional features: An exploratory study. International Journal of Industrial Ergonomics , 38(11- 12):942–952, 2008

work page 2008
[4]

Asahara and Y

M. Asahara and Y . Matsumoto. Japanese named entity extraction with redundant morphological analysis. In Proceedings of the 2003 Conference of the North American Chapter of the Associ- ation for Computational Linguistics on Human Language Tech- nology, pages 8–15. Association for Computational Linguistics, 2003

work page 2003
[5]

B. Bach, Z. Wang, M. Farinella, D. Murray-Rust, and N. Henry Riche. Design patterns for data comics. InProceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 38. ACM, 2018

work page 2018
[6]

G. J. Badros, A. Borning, and P. J. Stuckey. The cassowary linear arithmetic constraint solving algorithm. ACM Transactions on Computer-Human Interaction (TOCHI), 8(4):267–306, 2001

work page 2001
[7]

Bateman, R

S. Bateman, R. L. Mandryk, C. Gutwin, A. Genest, D. McDine, and C. Brooks. Useful junk?: the effects of visual embellishment on comprehension and memorability of charts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Sys- tems, pages 2573–2582. ACM, 2010

work page 2010
[8]

Berant, A

J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic pars- ing on freebase from question-answer pairs. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1533–1544, 2013

work page 2013
[9]

D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. Nymble: a high-performance learning name-ﬁnder. In Proceedings of the ﬁfth conference on Applied natural language processing , pages 194–201. Association for Computational Linguistics, 1997

work page 1997
[10]

Borgo, A

R. Borgo, A. Abdul-Rahman, F. Mohamed, P. W. Grant, I. Reppa, L. Floridi, and M. Chen. An empirical study on using visual embellishments in visualization.IEEE Transactions on Vi- sualization and Computer Graphics, 18(12):2759–2768, 2012

work page 2012
[11]

M. A. Borkin, Z. Bylinskii, N. W. Kim, C. M. Bainbridge, C. S. Yeh, D. Borkin, H. Pﬁster, and A. Oliva. Beyond memorabil- ity: Visualization recognition and recall. IEEE transactions on visualization and computer graphics, 22(1):519–528, 2016

work page 2016
[12]

M. A. Borkin, A. A. V o, Z. Bylinskii, P. Isola, S. Sunkavalli, A. Oliva, and H. Pﬁster. What makes a visualization memorable? IEEE Transactions on Visualization and Computer Graphics , 19(12):2306–2315, 2013

work page 2013
[13]

P. F. Brown, P. V . Desouza, R. L. Mercer, V . J. D. Pietra, and J. C. Lai. Class-based n-gram models of natural language. Computa- tional linguistics, 18(4):467–479, 1992

work page 1992
[14]

Bryan, K.-L

C. Bryan, K.-L. Ma, and J. Woodring. Temporal summary im- ages: An approach to narrative visualization via interactive an- notation generation and placement. IEEE transactions on visu- alization and computer graphics, 23(1):511–520, 2017

work page 2017
[15]

Understanding Infographics through Textual and Visual Tag Prediction

Z. Bylinskii, S. Alsheikh, S. Madan, A. Recasens, K. Zhong, H. Pﬁster, F. Durand, and A. Oliva. Understanding infograph- ics through textual and visual tag prediction. arXiv preprint arXiv:1709.09215, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[16]

Byrne, D

L. Byrne, D. Angus, and J. Wiles. Acquired codes of meaning in data visualization and infographics: beyond perceptual primi- tives. IEEE transactions on visualization and computer graphics, 22(1):509–518, 2016

work page 2016
[17]

I. Cash. Infographic of infographics. http://www.ivan.c ash/infographic-of-infographics

work page
[18]

Coolors. coolors. https://coolors.co/browser

work page
[19]

Z. Cui, S. K. Badam, M. A. Yalc ¸in, and N. Elmqvist. Datasite: Proactive visual data exploration with computation of insight- based recommendations. Information Visualization, 18(2):251– 267, 2019

work page 2019
[20]

De Marneffe, B

M.-C. De Marneffe, B. MacCartney, C. D. Manning, et al. Gen- erating typed dependency parses from phrase structure parses. In Proceedings of Language Resources and Evaluation Conference, pages 449–454. Genoa Italy, 2006

work page 2006
[21]

Demiralp, P

C ¸ . Demiralp, P. J. Haas, S. Parthasarathy, and T. Pedapati. Fore- sight: Recommending visual insights. Proceedings of the VLDB Endowment, 10(12):1937–1940, 2017

work page 1937
[22]

T. Gao, M. Dontcheva, E. Adar, Z. Liu, and K. G. Karahalios. Datatone: Managing ambiguity in natural language interfaces for data visualization. In Proceedings of the 28th Annual ACM Sym- posium on User Interface Software & Technology , pages 489–

work page
[23]

Haroz, R

S. Haroz, R. Kosara, and S. L. Franconeri. Isotype visualiza- tion: Working memory, performance, and engagement with pic- tographs. In Proceedings of the 33rd annual ACM conference on human factors in computing systems , pages 1191–1200. ACM, 2015

work page 2015
[24]

Harrison, K

L. Harrison, K. Reinecke, and R. Chang. Infographic aesthet- ics: Designing for the ﬁrst impression. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pages 1187–1190. ACM, 2015

work page 2015
[25]

B. Hu, Z. Lu, H. Li, and Q. Chen. Convolutional neural net- work architectures for matching natural language sentences. In Advances in neural information processing systems, pages 2042– 2050, 2014

work page 2042
[26]

K. Z. Hu, M. A. Bakker, S. Li, T. Kraska, and C. A. Hidalgo. Vizml: A machine learning approach to visualization recommen- dation. arXiv preprint arXiv:1808.04819, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[27]

Hullman, E

J. Hullman, E. Adar, and P. Shah. The impact of social informa- tion on visual judgments. In Proceedings of the SIGCHI Con- ference on Human Factors in Computing Systems , pages 1461–

work page
[28]

Hullman, N

J. Hullman, N. Diakopoulos, and E. Adar. Contextiﬁer: auto- matic generation of annotated stock visualizations. In Proceed- ings of the SIGCHI Conference on Human Factors in Computing Systems, pages 2707–2716. ACM, 2013

work page 2013
[29]

M. Ju, M. Miwa, and S. Ananiadou. A neural layered model for nested named entity recognition. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1, pages 1446–1459, 2018

work page 2018
[30]

N. W. Kim, E. Schweickart, Z. Liu, M. Dontcheva, W. Li, J. Popovic, and H. Pﬁster. Data-driven guides: Supporting ex- pressive design for information graphics. IEEE transactions on visualization and computer graphics, 23(1):491–500, 2017

work page 2017
[31]

D. E. Knuth and M. F. Plass. Breaking paragraphs into lines. Software: Practice and Experience, 11(11):1119–1184, 1981

work page 1981
[32]

H.-K. Kong, Z. Liu, and K. Karahalios. Internal and external vi- sual cue preferences for visualizations in presentations. In Com- puter Graphics Forum, volume 36, pages 515–525. Wiley Online Library, 2017

work page 2017
[33]

R. Kosara. Presentation-oriented visualization techniques. IEEE computer graphics and applications, 36(1):80–85, 2016

work page 2016
[34]

M. Lamm, A. T. Chaganty, C. D. Manning, D. Jurafsky, and P. Liang. Textual analogy parsing: What’s shared and what’s compared among analogous facts. arXiv preprint arXiv:1809.02700, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[35]

S. Lin, J. Fortuna, C. Kulkarni, M. Stone, and J. Heer. Selecting semantically-resonant colors for data visualization. In Computer Graphics Forum, volume 32, pages 401–410. Wiley Online Li- brary, 2013

work page 2013
[36]

Z. Liu, J. Thompson, A. Wilson, M. Dontcheva, J. Delorey, S. Grigg, B. Kerr, and J. Stasko. Data illustrator: Augmenting vector design tools with lazy data binding for expressive visual- ization authoring. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 123. ACM, 2018

work page 2018
[37]

Y . Luo, X. Qin, N. Tang, and G. Li. Deepeye: Towards automatic data visualization. In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pages 101–112. IEEE, 2018

work page 2018
[38]

Mackinlay, P

J. Mackinlay, P. Hanrahan, and C. Stolte. Show Me: Automatic presentation for visual analysis. IEEE transactions on visualiza- tion and computer graphics, 13(6):1137–1144, 2007

work page 2007
[39]

Madan, Z

S. Madan, Z. Bylinskii, M. Tancik, A. Recasens, K. Zhong, S. Alsheikh, H. Pﬁster, A. Oliva, and F. Durand. Synthetically trained icon proposals for parsing and summarizing infograph- ics. arXiv preprint arXiv:1807.10441, 2018

work page arXiv 2018
[40]

M `arquez and H

L. M `arquez and H. Rodr ´ıguez. Part-of-speech tagging using decision trees. In European Conference on Machine Learning , pages 25–36. Springer, 1998

work page 1998
[41]

McCallum and W

A. McCallum and W. Li. Early results for named entity recogni- tion with conditional random ﬁelds, feature induction and web- enhanced lexicons. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, pages 188–

work page 2003
[42]

Association for Computational Linguistics, 2003

work page 2003
[43]

G. G. M ´endez, M. A. Nacenta, and S. Vandenheste. iV oLVER: Interactive visual language for visualization extraction and re- construction. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pages 4073–4085. ACM, 2016

work page 2016
[44]

Efficient Estimation of Word Representations in Vector Space

T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efﬁcient esti- mation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[45]

Mitchell

B. Mitchell. Behind the internet curtain. https://www.di gitalrealty.com/blog/behind-the-internet-c urtain, 2014

work page 2014
[46]

A. V . Moere and H. Purchase. On the role of design in infor- mation visualization. Information Visualization, 10(4):356–371, 2011

work page 2011
[47]

A. V . Moere, M. Tomitsch, C. Wimmer, B. Christoph, and T. Grechenig. Evaluating the effect of style in information vi- sualization. IEEE transactions on visualization and computer graphics, 18(12):2739–2748, 2012

work page 2012
[48]

Moritz, C

D. Moritz, C. Wang, G. L. Nelson, H. Lin, A. M. Smith, B. Howe, and J. Heer. Formalizing visualization design knowl- edge as constraints: Actionable and extensible models in draco. IEEE transactions on visualization and computer graphics , 25(1):438–448, 2019

work page 2019
[49]

Nadeau and S

D. Nadeau and S. Sekine. A survey of named entity recogni- tion and classiﬁcation. Lingvisticae Investigationes, 30(1):3–26, 2007

work page 2007
[50]

H. C. Purchase, K. Isaacs, T. Bueti, B. Hastings, A. Kassam, A. Kim, and S. van Hoesen. A classiﬁcation of infographics. In International Conference on Theory and Application of Dia- grams, pages 210–218. Springer, 2018

work page 2018
[51]

Mobile payments world view

Raconteur. Mobile payments world view. https://michae lrosensays.wordpress.com/tag/giving-usa-20 15-infographic, 2015

work page 2015
[52]

L. A. Ramshaw and M. P. Marcus. Text chunking using transformation-based learning. In Natural language processing using very large corpora, pages 157–176. Springer, 1999

work page 1999
[53]

L. F. Rau. Extracting company names from text. In Artiﬁcial In- telligence Applications, 1991. Proceedings., Seventh IEEE Con- ference on, volume 1, pages 29–32. IEEE, 1991

work page 1991
[54]

D. Ren, M. Brehmer, B. Lee, T. H ¨ollerer, and E. K. Choe. Char- taccent: Annotation for data-driven storytelling. In 2017 IEEE Paciﬁc Visualization Symposium (PaciﬁcVis) , pages 230–239. IEEE, 2017

work page 2017
[55]

D. Ren, T. H ¨ollerer, and X. Yuan. iVisDesigner: Expressive interactive design of information visualizations. IEEE trans- actions on visualization and computer graphics , 20(12):2092– 2101, 2014

work page 2092
[56]

D. Ren, B. Lee, and M. Brehmer. Charticulator: Interactive con- struction of bespoke chart layouts. IEEE transactions on visual- ization and computer graphics, 25(1):789–799, 2019

work page 2019
[57]

F. Richter. Ibm tops u.s. patent ranking for 21st consecutive year. https://www.statista.com/chart/1796/us-pat ent-ranking-2013, 2014

work page 2013
[58]

M. J. Rosen. Strong american philanthropy at a record high! https://michaelrosensays.wordpress.com/tag /giving-usa-2015-infographic , 2015

work page 2015
[59]

Satyanarayan and J

A. Satyanarayan and J. Heer. Lyra: An interactive visualization design environment. In Computer Graphics Forum, volume 33, pages 351–360. Wiley Online Library, 2014

work page 2014
[60]

Segel and J

E. Segel and J. Heer. Narrative visualization: Telling stories with data. IEEE transactions on visualization and computer graphics, 16(6):1139–1148, 2010

work page 2010
[61]

Setlur, S

V . Setlur, S. E. Battersby, M. Tory, R. Gossweiler, and A. X. Chang. Eviza: A natural language interface for visual analysis. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology, pages 365–377. ACM, 2016

work page 2016
[62]

Setlur and M

V . Setlur and M. C. Stone. A linguistic approach to categorical color assignment for data visualization. IEEE transactions on visualization and computer graphics, 22(1):698–707, 2016

work page 2016
[63]

W. V . Siricharoen. Infographics: the new communication tools in digital age. In The international conference on e-technologies and business on the web (ebw2013), pages 169–174. The Society of Digital Information and Wireless Communication, 2013

work page 2013
[64]

Skau and R

D. Skau and R. Kosara. Readability and precision in pictorial bar charts. In Proceedings of the Eurographics/IEEE VGTC Confer- ence on Visualization: Short Papers, pages 91–95. Eurographics Association, 2017

work page 2017
[65]

Srinivasan, S

A. Srinivasan, S. M. Drucker, A. Endert, and J. Stasko. Aug- menting visualizations with interactive data facts to facilitate in- terpretation and communication. IEEE transactions on visual- ization and computer graphics, 25(1):672–681, 2018

work page 2018
[66]

J. Stegman. 3 way to grow your support revenue. https: //www.tsia.com/blog/infographic-3-ways-to- grow-your-support-revenue , 2015

work page 2015
[67]

Y . Sun, J. Leigh, A. Johnson, and S. Lee. Articulate: A semi- automated model for translating natural language queries into meaningful visualizations. In International Symposium on Smart Graphics, pages 184–195. Springer, 2010

work page 2010
[68]

E. R. Tufte. The visual display of quantitative information , vol- ume 2. Graphics press Cheshire, CT, 2001

work page 2001
[69]

F. B. Viegas, M. Wattenberg, F. Van Ham, J. Kriss, and M. McK- eon. Manyeyes: a site for visualization at internet scale. IEEE transactions on visualization and computer graphics , 13(6), 2007

work page 2007
[70]

Y . Wang, H. Zhang, H. Huang, X. Chen, Q. Yin, Z. Hou, D. Zhang, Q. Luo, and H. Qu. Infonice: Easy creation of infor- mation graphics. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 335. ACM, 2018

work page 2018
[71]

Willingham

T. Willingham. Alligator pear: Imports and exports. https: //www.freightwaves.com/news/infographics/a lligator-pear-import-export , 2019

work page 2019
[72]

Winograd

T. Winograd. Understanding natural language. Cognitive psy- chology, 3(1):1–191, 1972

work page 1972
[73]

Wongsuphasawat, D

K. Wongsuphasawat, D. Moritz, A. Anand, J. Mackinlay, B. Howe, and J. Heer. V oyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE transactions on visualization and computer graphics, 22(1):649–658, 2016

work page 2016
[74]

H. Xia, N. Henry Riche, F. Chevalier, B. De Araujo, and D. Wig- dor. Dataink: Direct and creative data-oriented drawing. In Pro- ceedings of the 2018 CHI Conference on Human Factors in Com- puting Systems, page 223. ACM, 2018

work page 2018

[1] [1]

Adobe color cc

Adobe. Adobe color cc. https://color.adobe.com

work page

[2] [2]

Amini, N

F. Amini, N. H. Riche, B. Lee, A. Monroy-Hernandez, and P. Irani. Authoring data-driven videos with dataclips. IEEE transactions on visualization and computer graphics, 23(1):501– 510, 2017

work page 2017

[3] [3]

Artacho-Ramirez, J

M. Artacho-Ramirez, J. Diego-Mas, and J. Alcaide-Marzal. In- ﬂuence of the mode of graphical representation on the percep- tion of product aesthetic and emotional features: An exploratory study. International Journal of Industrial Ergonomics , 38(11- 12):942–952, 2008

work page 2008

[4] [4]

Asahara and Y

M. Asahara and Y . Matsumoto. Japanese named entity extraction with redundant morphological analysis. In Proceedings of the 2003 Conference of the North American Chapter of the Associ- ation for Computational Linguistics on Human Language Tech- nology, pages 8–15. Association for Computational Linguistics, 2003

work page 2003

[5] [5]

B. Bach, Z. Wang, M. Farinella, D. Murray-Rust, and N. Henry Riche. Design patterns for data comics. InProceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 38. ACM, 2018

work page 2018

[6] [6]

G. J. Badros, A. Borning, and P. J. Stuckey. The cassowary linear arithmetic constraint solving algorithm. ACM Transactions on Computer-Human Interaction (TOCHI), 8(4):267–306, 2001

work page 2001

[7] [7]

Bateman, R

S. Bateman, R. L. Mandryk, C. Gutwin, A. Genest, D. McDine, and C. Brooks. Useful junk?: the effects of visual embellishment on comprehension and memorability of charts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Sys- tems, pages 2573–2582. ACM, 2010

work page 2010

[8] [8]

Berant, A

J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic pars- ing on freebase from question-answer pairs. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1533–1544, 2013

work page 2013

[9] [9]

D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. Nymble: a high-performance learning name-ﬁnder. In Proceedings of the ﬁfth conference on Applied natural language processing , pages 194–201. Association for Computational Linguistics, 1997

work page 1997

[10] [10]

Borgo, A

R. Borgo, A. Abdul-Rahman, F. Mohamed, P. W. Grant, I. Reppa, L. Floridi, and M. Chen. An empirical study on using visual embellishments in visualization.IEEE Transactions on Vi- sualization and Computer Graphics, 18(12):2759–2768, 2012

work page 2012

[11] [11]

M. A. Borkin, Z. Bylinskii, N. W. Kim, C. M. Bainbridge, C. S. Yeh, D. Borkin, H. Pﬁster, and A. Oliva. Beyond memorabil- ity: Visualization recognition and recall. IEEE transactions on visualization and computer graphics, 22(1):519–528, 2016

work page 2016

[12] [12]

M. A. Borkin, A. A. V o, Z. Bylinskii, P. Isola, S. Sunkavalli, A. Oliva, and H. Pﬁster. What makes a visualization memorable? IEEE Transactions on Visualization and Computer Graphics , 19(12):2306–2315, 2013

work page 2013

[13] [13]

P. F. Brown, P. V . Desouza, R. L. Mercer, V . J. D. Pietra, and J. C. Lai. Class-based n-gram models of natural language. Computa- tional linguistics, 18(4):467–479, 1992

work page 1992

[14] [14]

Bryan, K.-L

C. Bryan, K.-L. Ma, and J. Woodring. Temporal summary im- ages: An approach to narrative visualization via interactive an- notation generation and placement. IEEE transactions on visu- alization and computer graphics, 23(1):511–520, 2017

work page 2017

[15] [15]

Understanding Infographics through Textual and Visual Tag Prediction

Z. Bylinskii, S. Alsheikh, S. Madan, A. Recasens, K. Zhong, H. Pﬁster, F. Durand, and A. Oliva. Understanding infograph- ics through textual and visual tag prediction. arXiv preprint arXiv:1709.09215, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[16] [16]

Byrne, D

L. Byrne, D. Angus, and J. Wiles. Acquired codes of meaning in data visualization and infographics: beyond perceptual primi- tives. IEEE transactions on visualization and computer graphics, 22(1):509–518, 2016

work page 2016

[17] [17]

I. Cash. Infographic of infographics. http://www.ivan.c ash/infographic-of-infographics

work page

[18] [18]

Coolors. coolors. https://coolors.co/browser

work page

[19] [19]

Z. Cui, S. K. Badam, M. A. Yalc ¸in, and N. Elmqvist. Datasite: Proactive visual data exploration with computation of insight- based recommendations. Information Visualization, 18(2):251– 267, 2019

work page 2019

[20] [20]

De Marneffe, B

M.-C. De Marneffe, B. MacCartney, C. D. Manning, et al. Gen- erating typed dependency parses from phrase structure parses. In Proceedings of Language Resources and Evaluation Conference, pages 449–454. Genoa Italy, 2006

work page 2006

[21] [21]

Demiralp, P

C ¸ . Demiralp, P. J. Haas, S. Parthasarathy, and T. Pedapati. Fore- sight: Recommending visual insights. Proceedings of the VLDB Endowment, 10(12):1937–1940, 2017

work page 1937

[22] [22]

T. Gao, M. Dontcheva, E. Adar, Z. Liu, and K. G. Karahalios. Datatone: Managing ambiguity in natural language interfaces for data visualization. In Proceedings of the 28th Annual ACM Sym- posium on User Interface Software & Technology , pages 489–

work page

[23] [23]

Haroz, R

S. Haroz, R. Kosara, and S. L. Franconeri. Isotype visualiza- tion: Working memory, performance, and engagement with pic- tographs. In Proceedings of the 33rd annual ACM conference on human factors in computing systems , pages 1191–1200. ACM, 2015

work page 2015

[24] [24]

Harrison, K

L. Harrison, K. Reinecke, and R. Chang. Infographic aesthet- ics: Designing for the ﬁrst impression. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pages 1187–1190. ACM, 2015

work page 2015

[25] [25]

B. Hu, Z. Lu, H. Li, and Q. Chen. Convolutional neural net- work architectures for matching natural language sentences. In Advances in neural information processing systems, pages 2042– 2050, 2014

work page 2042

[26] [26]

K. Z. Hu, M. A. Bakker, S. Li, T. Kraska, and C. A. Hidalgo. Vizml: A machine learning approach to visualization recommen- dation. arXiv preprint arXiv:1808.04819, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[27] [27]

Hullman, E

J. Hullman, E. Adar, and P. Shah. The impact of social informa- tion on visual judgments. In Proceedings of the SIGCHI Con- ference on Human Factors in Computing Systems , pages 1461–

work page

[28] [28]

Hullman, N

J. Hullman, N. Diakopoulos, and E. Adar. Contextiﬁer: auto- matic generation of annotated stock visualizations. In Proceed- ings of the SIGCHI Conference on Human Factors in Computing Systems, pages 2707–2716. ACM, 2013

work page 2013

[29] [29]

M. Ju, M. Miwa, and S. Ananiadou. A neural layered model for nested named entity recognition. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1, pages 1446–1459, 2018

work page 2018

[30] [30]

N. W. Kim, E. Schweickart, Z. Liu, M. Dontcheva, W. Li, J. Popovic, and H. Pﬁster. Data-driven guides: Supporting ex- pressive design for information graphics. IEEE transactions on visualization and computer graphics, 23(1):491–500, 2017

work page 2017

[31] [31]

D. E. Knuth and M. F. Plass. Breaking paragraphs into lines. Software: Practice and Experience, 11(11):1119–1184, 1981

work page 1981

[32] [32]

H.-K. Kong, Z. Liu, and K. Karahalios. Internal and external vi- sual cue preferences for visualizations in presentations. In Com- puter Graphics Forum, volume 36, pages 515–525. Wiley Online Library, 2017

work page 2017

[33] [33]

R. Kosara. Presentation-oriented visualization techniques. IEEE computer graphics and applications, 36(1):80–85, 2016

work page 2016

[34] [34]

M. Lamm, A. T. Chaganty, C. D. Manning, D. Jurafsky, and P. Liang. Textual analogy parsing: What’s shared and what’s compared among analogous facts. arXiv preprint arXiv:1809.02700, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[35] [35]

S. Lin, J. Fortuna, C. Kulkarni, M. Stone, and J. Heer. Selecting semantically-resonant colors for data visualization. In Computer Graphics Forum, volume 32, pages 401–410. Wiley Online Li- brary, 2013

work page 2013

[36] [36]

Z. Liu, J. Thompson, A. Wilson, M. Dontcheva, J. Delorey, S. Grigg, B. Kerr, and J. Stasko. Data illustrator: Augmenting vector design tools with lazy data binding for expressive visual- ization authoring. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 123. ACM, 2018

work page 2018

[37] [37]

Y . Luo, X. Qin, N. Tang, and G. Li. Deepeye: Towards automatic data visualization. In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pages 101–112. IEEE, 2018

work page 2018

[38] [38]

Mackinlay, P

J. Mackinlay, P. Hanrahan, and C. Stolte. Show Me: Automatic presentation for visual analysis. IEEE transactions on visualiza- tion and computer graphics, 13(6):1137–1144, 2007

work page 2007

[39] [39]

Madan, Z

S. Madan, Z. Bylinskii, M. Tancik, A. Recasens, K. Zhong, S. Alsheikh, H. Pﬁster, A. Oliva, and F. Durand. Synthetically trained icon proposals for parsing and summarizing infograph- ics. arXiv preprint arXiv:1807.10441, 2018

work page arXiv 2018

[40] [40]

M `arquez and H

L. M `arquez and H. Rodr ´ıguez. Part-of-speech tagging using decision trees. In European Conference on Machine Learning , pages 25–36. Springer, 1998

work page 1998

[41] [41]

McCallum and W

A. McCallum and W. Li. Early results for named entity recogni- tion with conditional random ﬁelds, feature induction and web- enhanced lexicons. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, pages 188–

work page 2003

[42] [42]

Association for Computational Linguistics, 2003

work page 2003

[43] [43]

G. G. M ´endez, M. A. Nacenta, and S. Vandenheste. iV oLVER: Interactive visual language for visualization extraction and re- construction. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pages 4073–4085. ACM, 2016

work page 2016

[44] [44]

Efficient Estimation of Word Representations in Vector Space

T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efﬁcient esti- mation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[45] [45]

Mitchell

B. Mitchell. Behind the internet curtain. https://www.di gitalrealty.com/blog/behind-the-internet-c urtain, 2014

work page 2014

[46] [46]

A. V . Moere and H. Purchase. On the role of design in infor- mation visualization. Information Visualization, 10(4):356–371, 2011

work page 2011

[47] [47]

A. V . Moere, M. Tomitsch, C. Wimmer, B. Christoph, and T. Grechenig. Evaluating the effect of style in information vi- sualization. IEEE transactions on visualization and computer graphics, 18(12):2739–2748, 2012

work page 2012

[48] [48]

Moritz, C

D. Moritz, C. Wang, G. L. Nelson, H. Lin, A. M. Smith, B. Howe, and J. Heer. Formalizing visualization design knowl- edge as constraints: Actionable and extensible models in draco. IEEE transactions on visualization and computer graphics , 25(1):438–448, 2019

work page 2019

[49] [49]

Nadeau and S

D. Nadeau and S. Sekine. A survey of named entity recogni- tion and classiﬁcation. Lingvisticae Investigationes, 30(1):3–26, 2007

work page 2007

[50] [50]

H. C. Purchase, K. Isaacs, T. Bueti, B. Hastings, A. Kassam, A. Kim, and S. van Hoesen. A classiﬁcation of infographics. In International Conference on Theory and Application of Dia- grams, pages 210–218. Springer, 2018

work page 2018

[51] [51]

Mobile payments world view

Raconteur. Mobile payments world view. https://michae lrosensays.wordpress.com/tag/giving-usa-20 15-infographic, 2015

work page 2015

[52] [52]

L. A. Ramshaw and M. P. Marcus. Text chunking using transformation-based learning. In Natural language processing using very large corpora, pages 157–176. Springer, 1999

work page 1999

[53] [53]

L. F. Rau. Extracting company names from text. In Artiﬁcial In- telligence Applications, 1991. Proceedings., Seventh IEEE Con- ference on, volume 1, pages 29–32. IEEE, 1991

work page 1991

[54] [54]

D. Ren, M. Brehmer, B. Lee, T. H ¨ollerer, and E. K. Choe. Char- taccent: Annotation for data-driven storytelling. In 2017 IEEE Paciﬁc Visualization Symposium (PaciﬁcVis) , pages 230–239. IEEE, 2017

work page 2017

[55] [55]

D. Ren, T. H ¨ollerer, and X. Yuan. iVisDesigner: Expressive interactive design of information visualizations. IEEE trans- actions on visualization and computer graphics , 20(12):2092– 2101, 2014

work page 2092

[56] [56]

D. Ren, B. Lee, and M. Brehmer. Charticulator: Interactive con- struction of bespoke chart layouts. IEEE transactions on visual- ization and computer graphics, 25(1):789–799, 2019

work page 2019

[57] [57]

F. Richter. Ibm tops u.s. patent ranking for 21st consecutive year. https://www.statista.com/chart/1796/us-pat ent-ranking-2013, 2014

work page 2013

[58] [58]

M. J. Rosen. Strong american philanthropy at a record high! https://michaelrosensays.wordpress.com/tag /giving-usa-2015-infographic , 2015

work page 2015

[59] [59]

Satyanarayan and J

A. Satyanarayan and J. Heer. Lyra: An interactive visualization design environment. In Computer Graphics Forum, volume 33, pages 351–360. Wiley Online Library, 2014

work page 2014

[60] [60]

Segel and J

E. Segel and J. Heer. Narrative visualization: Telling stories with data. IEEE transactions on visualization and computer graphics, 16(6):1139–1148, 2010

work page 2010

[61] [61]

Setlur, S

V . Setlur, S. E. Battersby, M. Tory, R. Gossweiler, and A. X. Chang. Eviza: A natural language interface for visual analysis. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology, pages 365–377. ACM, 2016

work page 2016

[62] [62]

Setlur and M

V . Setlur and M. C. Stone. A linguistic approach to categorical color assignment for data visualization. IEEE transactions on visualization and computer graphics, 22(1):698–707, 2016

work page 2016

[63] [63]

W. V . Siricharoen. Infographics: the new communication tools in digital age. In The international conference on e-technologies and business on the web (ebw2013), pages 169–174. The Society of Digital Information and Wireless Communication, 2013

work page 2013

[64] [64]

Skau and R

D. Skau and R. Kosara. Readability and precision in pictorial bar charts. In Proceedings of the Eurographics/IEEE VGTC Confer- ence on Visualization: Short Papers, pages 91–95. Eurographics Association, 2017

work page 2017

[65] [65]

Srinivasan, S

A. Srinivasan, S. M. Drucker, A. Endert, and J. Stasko. Aug- menting visualizations with interactive data facts to facilitate in- terpretation and communication. IEEE transactions on visual- ization and computer graphics, 25(1):672–681, 2018

work page 2018

[66] [66]

J. Stegman. 3 way to grow your support revenue. https: //www.tsia.com/blog/infographic-3-ways-to- grow-your-support-revenue , 2015

work page 2015

[67] [67]

Y . Sun, J. Leigh, A. Johnson, and S. Lee. Articulate: A semi- automated model for translating natural language queries into meaningful visualizations. In International Symposium on Smart Graphics, pages 184–195. Springer, 2010

work page 2010

[68] [68]

E. R. Tufte. The visual display of quantitative information , vol- ume 2. Graphics press Cheshire, CT, 2001

work page 2001

[69] [69]

F. B. Viegas, M. Wattenberg, F. Van Ham, J. Kriss, and M. McK- eon. Manyeyes: a site for visualization at internet scale. IEEE transactions on visualization and computer graphics , 13(6), 2007

work page 2007

[70] [70]

Y . Wang, H. Zhang, H. Huang, X. Chen, Q. Yin, Z. Hou, D. Zhang, Q. Luo, and H. Qu. Infonice: Easy creation of infor- mation graphics. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 335. ACM, 2018

work page 2018

[71] [71]

Willingham

T. Willingham. Alligator pear: Imports and exports. https: //www.freightwaves.com/news/infographics/a lligator-pear-import-export , 2019

work page 2019

[72] [72]

Winograd

T. Winograd. Understanding natural language. Cognitive psy- chology, 3(1):1–191, 1972

work page 1972

[73] [73]

Wongsuphasawat, D

K. Wongsuphasawat, D. Moritz, A. Anand, J. Mackinlay, B. Howe, and J. Heer. V oyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE transactions on visualization and computer graphics, 22(1):649–658, 2016

work page 2016

[74] [74]

H. Xia, N. Henry Riche, F. Chevalier, B. De Araujo, and D. Wig- dor. Dataink: Direct and creative data-oriented drawing. In Pro- ceedings of the 2018 CHI Conference on Human Factors in Com- puting Systems, page 223. ACM, 2018

work page 2018