Text-to-Viz: Automatic Generation of Infographics from Proportion-Related Natural Language Statements
Pith reviewed 2026-05-24 18:25 UTC · model grok-4.3
The pith
A proof-of-concept system automatically converts natural language statements about simple proportions into sets of pre-designed infographics.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
After mapping the design space through a preliminary study, the authors built a proof-of-concept system that automatically converts statements about simple proportion-related statistics to a set of infographics with pre-designed styles.
What carries the argument
The proof-of-concept system that maps proportion statements to pre-designed infographic styles on the basis of the preliminary design-space study.
If this is right
- Casual users without design training can obtain multiple infographic options from a single proportion statement.
- The system focuses exclusively on simple proportion-related statistics rather than arbitrary data.
- Pre-designed styles replace the need for users to choose layouts or visual elements manually.
- The output set of infographics is intended to be immediately usable for communication.
Where Pith is reading between the lines
- The same pipeline might later handle statements that combine proportions with other quantitative relations if the design space is expanded.
- Voice input could replace typed statements, allowing on-the-fly generation during conversations or presentations.
- The pre-designed style library could be crowdsourced or learned from existing infographics rather than hand-crafted.
Load-bearing premise
The preliminary study sufficiently captures the design space so that pre-designed styles can produce acceptable infographics for the targeted class of statements.
What would settle it
A test in which participants consistently rate the system outputs as visually unappealing or factually misleading for the input statements would show the approach does not work.
Figures
read the original abstract
Combining data content with visual embellishments, infographics can effectively deliver messages in an engaging and memorable manner. Various authoring tools have been proposed to facilitate the creation of infographics. However, creating a professional infographic with these authoring tools is still not an easy task, requiring much time and design expertise. Therefore, these tools are generally not attractive to casual users, who are either unwilling to take time to learn the tools or lacking in proper design expertise to create a professional infographic. In this paper, we explore an alternative approach: to automatically generate infographics from natural language statements. We first conducted a preliminary study to explore the design space of infographics. Based on the preliminary study, we built a proof-of-concept system that automatically converts statements about simple proportion-related statistics to a set of infographics with pre-designed styles. Finally, we demonstrated the usability and usefulness of the system through sample results, exhibits, and expert reviews.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to have conducted a preliminary study exploring the design space of infographics, built a proof-of-concept system (Text-to-Viz) that automatically converts natural language statements about simple proportion-related statistics into infographics using pre-designed styles, and demonstrated the system's usability and usefulness via sample results, exhibits, and expert reviews.
Significance. If the described pipeline holds, the work could lower barriers for casual users to produce engaging proportion-based infographics without requiring design expertise or time-intensive authoring, contributing to automated visualization tools in HCI.
major comments (2)
- [Abstract] Abstract and overall manuscript: the central claim rests on system construction and qualitative expert feedback, yet the text supplies no implementation details, metrics, failure cases, or quantitative evaluation, preventing verification of whether the pre-designed styles reliably cover the targeted input class.
- [Preliminary Study] Preliminary study section: the claim that this study sufficiently maps the design space to enable acceptable pre-designed styles for simple proportion statements is load-bearing, but no methodology, participant details, or derivation process for the styles is provided to assess coverage or completeness.
minor comments (2)
- Add explicit discussion of related work on NL-to-vis systems and infographic authoring tools to better situate the contribution.
- Clarify the exact scope of 'simple proportion-related statistics' with examples of supported and unsupported statement types.
Simulated Author's Rebuttal
We thank the referee for the detailed feedback. The comments highlight areas where the manuscript can be strengthened with additional details on the system and study. We will revise accordingly while maintaining the proof-of-concept nature of the work.
read point-by-point responses
-
Referee: [Abstract] Abstract and overall manuscript: the central claim rests on system construction and qualitative expert feedback, yet the text supplies no implementation details, metrics, failure cases, or quantitative evaluation, preventing verification of whether the pre-designed styles reliably cover the targeted input class.
Authors: We agree that the current version provides limited implementation specifics. In the revision, we will add a dedicated system implementation section describing the NLP pipeline for parsing proportion statements, the rule-based style selection mechanism, the set of pre-designed styles, and concrete examples of both successful outputs and failure cases (e.g., ambiguous statements or unsupported proportion types). We will also include a limitations subsection discussing coverage of the input class. As the contribution is framed as a proof-of-concept rather than a production system, the evaluation remains qualitative via expert reviews; we will clarify this positioning and note that quantitative metrics (e.g., coverage rate on a held-out statement set) could be added if the reviewers consider them essential. revision: yes
-
Referee: [Preliminary Study] Preliminary study section: the claim that this study sufficiently maps the design space to enable acceptable pre-designed styles for simple proportion statements is load-bearing, but no methodology, participant details, or derivation process for the styles is provided to assess coverage or completeness.
Authors: We acknowledge that the preliminary study section is currently high-level. In the revision, we will expand it to report the study methodology (e.g., how infographic examples were collected and analyzed), participant information (number, background, recruitment), the process used to derive the design space dimensions, and the explicit mapping from study findings to the final pre-designed styles. This will allow readers to evaluate the completeness and rationale for the chosen styles. revision: yes
Circularity Check
No significant circularity identified
full rationale
The paper presents a proof-of-concept system for generating infographics from proportion-related statements. It relies on a preliminary study to explore the design space, followed by system construction with pre-designed styles, and evaluation via sample results and expert reviews. No equations, derivations, fitted parameters, predictions, or load-bearing self-citations appear in the argument. The central claim reduces to system construction and qualitative demonstration rather than any self-referential loop or imported uniqueness result. The derivation chain is self-contained against external benchmarks of system-building papers.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
- [1]
- [2]
-
[3]
M. Artacho-Ramirez, J. Diego-Mas, and J. Alcaide-Marzal. In- fluence of the mode of graphical representation on the percep- tion of product aesthetic and emotional features: An exploratory study. International Journal of Industrial Ergonomics , 38(11- 12):942–952, 2008
work page 2008
-
[4]
M. Asahara and Y . Matsumoto. Japanese named entity extraction with redundant morphological analysis. In Proceedings of the 2003 Conference of the North American Chapter of the Associ- ation for Computational Linguistics on Human Language Tech- nology, pages 8–15. Association for Computational Linguistics, 2003
work page 2003
-
[5]
B. Bach, Z. Wang, M. Farinella, D. Murray-Rust, and N. Henry Riche. Design patterns for data comics. InProceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 38. ACM, 2018
work page 2018
-
[6]
G. J. Badros, A. Borning, and P. J. Stuckey. The cassowary linear arithmetic constraint solving algorithm. ACM Transactions on Computer-Human Interaction (TOCHI), 8(4):267–306, 2001
work page 2001
-
[7]
S. Bateman, R. L. Mandryk, C. Gutwin, A. Genest, D. McDine, and C. Brooks. Useful junk?: the effects of visual embellishment on comprehension and memorability of charts. In Proceedings of the SIGCHI Conference on Human Factors in Computing Sys- tems, pages 2573–2582. ACM, 2010
work page 2010
- [8]
-
[9]
D. M. Bikel, S. Miller, R. Schwartz, and R. Weischedel. Nymble: a high-performance learning name-finder. In Proceedings of the fifth conference on Applied natural language processing , pages 194–201. Association for Computational Linguistics, 1997
work page 1997
- [10]
-
[11]
M. A. Borkin, Z. Bylinskii, N. W. Kim, C. M. Bainbridge, C. S. Yeh, D. Borkin, H. Pfister, and A. Oliva. Beyond memorabil- ity: Visualization recognition and recall. IEEE transactions on visualization and computer graphics, 22(1):519–528, 2016
work page 2016
-
[12]
M. A. Borkin, A. A. V o, Z. Bylinskii, P. Isola, S. Sunkavalli, A. Oliva, and H. Pfister. What makes a visualization memorable? IEEE Transactions on Visualization and Computer Graphics , 19(12):2306–2315, 2013
work page 2013
-
[13]
P. F. Brown, P. V . Desouza, R. L. Mercer, V . J. D. Pietra, and J. C. Lai. Class-based n-gram models of natural language. Computa- tional linguistics, 18(4):467–479, 1992
work page 1992
-
[14]
C. Bryan, K.-L. Ma, and J. Woodring. Temporal summary im- ages: An approach to narrative visualization via interactive an- notation generation and placement. IEEE transactions on visu- alization and computer graphics, 23(1):511–520, 2017
work page 2017
-
[15]
Understanding Infographics through Textual and Visual Tag Prediction
Z. Bylinskii, S. Alsheikh, S. Madan, A. Recasens, K. Zhong, H. Pfister, F. Durand, and A. Oliva. Understanding infograph- ics through textual and visual tag prediction. arXiv preprint arXiv:1709.09215, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
- [16]
-
[17]
I. Cash. Infographic of infographics. http://www.ivan.c ash/infographic-of-infographics
-
[18]
Coolors. coolors. https://coolors.co/browser
-
[19]
Z. Cui, S. K. Badam, M. A. Yalc ¸in, and N. Elmqvist. Datasite: Proactive visual data exploration with computation of insight- based recommendations. Information Visualization, 18(2):251– 267, 2019
work page 2019
-
[20]
M.-C. De Marneffe, B. MacCartney, C. D. Manning, et al. Gen- erating typed dependency parses from phrase structure parses. In Proceedings of Language Resources and Evaluation Conference, pages 449–454. Genoa Italy, 2006
work page 2006
-
[21]
C ¸ . Demiralp, P. J. Haas, S. Parthasarathy, and T. Pedapati. Fore- sight: Recommending visual insights. Proceedings of the VLDB Endowment, 10(12):1937–1940, 2017
work page 1937
-
[22]
T. Gao, M. Dontcheva, E. Adar, Z. Liu, and K. G. Karahalios. Datatone: Managing ambiguity in natural language interfaces for data visualization. In Proceedings of the 28th Annual ACM Sym- posium on User Interface Software & Technology , pages 489–
- [23]
-
[24]
L. Harrison, K. Reinecke, and R. Chang. Infographic aesthet- ics: Designing for the first impression. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, pages 1187–1190. ACM, 2015
work page 2015
-
[25]
B. Hu, Z. Lu, H. Li, and Q. Chen. Convolutional neural net- work architectures for matching natural language sentences. In Advances in neural information processing systems, pages 2042– 2050, 2014
work page 2042
-
[26]
K. Z. Hu, M. A. Bakker, S. Li, T. Kraska, and C. A. Hidalgo. Vizml: A machine learning approach to visualization recommen- dation. arXiv preprint arXiv:1808.04819, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[27]
J. Hullman, E. Adar, and P. Shah. The impact of social informa- tion on visual judgments. In Proceedings of the SIGCHI Con- ference on Human Factors in Computing Systems , pages 1461–
-
[28]
J. Hullman, N. Diakopoulos, and E. Adar. Contextifier: auto- matic generation of annotated stock visualizations. In Proceed- ings of the SIGCHI Conference on Human Factors in Computing Systems, pages 2707–2716. ACM, 2013
work page 2013
-
[29]
M. Ju, M. Miwa, and S. Ananiadou. A neural layered model for nested named entity recognition. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1, pages 1446–1459, 2018
work page 2018
-
[30]
N. W. Kim, E. Schweickart, Z. Liu, M. Dontcheva, W. Li, J. Popovic, and H. Pfister. Data-driven guides: Supporting ex- pressive design for information graphics. IEEE transactions on visualization and computer graphics, 23(1):491–500, 2017
work page 2017
-
[31]
D. E. Knuth and M. F. Plass. Breaking paragraphs into lines. Software: Practice and Experience, 11(11):1119–1184, 1981
work page 1981
-
[32]
H.-K. Kong, Z. Liu, and K. Karahalios. Internal and external vi- sual cue preferences for visualizations in presentations. In Com- puter Graphics Forum, volume 36, pages 515–525. Wiley Online Library, 2017
work page 2017
-
[33]
R. Kosara. Presentation-oriented visualization techniques. IEEE computer graphics and applications, 36(1):80–85, 2016
work page 2016
-
[34]
M. Lamm, A. T. Chaganty, C. D. Manning, D. Jurafsky, and P. Liang. Textual analogy parsing: What’s shared and what’s compared among analogous facts. arXiv preprint arXiv:1809.02700, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[35]
S. Lin, J. Fortuna, C. Kulkarni, M. Stone, and J. Heer. Selecting semantically-resonant colors for data visualization. In Computer Graphics Forum, volume 32, pages 401–410. Wiley Online Li- brary, 2013
work page 2013
-
[36]
Z. Liu, J. Thompson, A. Wilson, M. Dontcheva, J. Delorey, S. Grigg, B. Kerr, and J. Stasko. Data illustrator: Augmenting vector design tools with lazy data binding for expressive visual- ization authoring. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 123. ACM, 2018
work page 2018
-
[37]
Y . Luo, X. Qin, N. Tang, and G. Li. Deepeye: Towards automatic data visualization. In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pages 101–112. IEEE, 2018
work page 2018
-
[38]
J. Mackinlay, P. Hanrahan, and C. Stolte. Show Me: Automatic presentation for visual analysis. IEEE transactions on visualiza- tion and computer graphics, 13(6):1137–1144, 2007
work page 2007
- [39]
-
[40]
L. M `arquez and H. Rodr ´ıguez. Part-of-speech tagging using decision trees. In European Conference on Machine Learning , pages 25–36. Springer, 1998
work page 1998
-
[41]
A. McCallum and W. Li. Early results for named entity recogni- tion with conditional random fields, feature induction and web- enhanced lexicons. In Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, pages 188–
work page 2003
-
[42]
Association for Computational Linguistics, 2003
work page 2003
-
[43]
G. G. M ´endez, M. A. Nacenta, and S. Vandenheste. iV oLVER: Interactive visual language for visualization extraction and re- construction. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pages 4073–4085. ACM, 2016
work page 2016
-
[44]
Efficient Estimation of Word Representations in Vector Space
T. Mikolov, K. Chen, G. Corrado, and J. Dean. Efficient esti- mation of word representations in vector space. arXiv preprint arXiv:1301.3781, 2013
work page internal anchor Pith review Pith/arXiv arXiv 2013
- [45]
-
[46]
A. V . Moere and H. Purchase. On the role of design in infor- mation visualization. Information Visualization, 10(4):356–371, 2011
work page 2011
-
[47]
A. V . Moere, M. Tomitsch, C. Wimmer, B. Christoph, and T. Grechenig. Evaluating the effect of style in information vi- sualization. IEEE transactions on visualization and computer graphics, 18(12):2739–2748, 2012
work page 2012
- [48]
-
[49]
D. Nadeau and S. Sekine. A survey of named entity recogni- tion and classification. Lingvisticae Investigationes, 30(1):3–26, 2007
work page 2007
-
[50]
H. C. Purchase, K. Isaacs, T. Bueti, B. Hastings, A. Kassam, A. Kim, and S. van Hoesen. A classification of infographics. In International Conference on Theory and Application of Dia- grams, pages 210–218. Springer, 2018
work page 2018
-
[51]
Raconteur. Mobile payments world view. https://michae lrosensays.wordpress.com/tag/giving-usa-20 15-infographic, 2015
work page 2015
-
[52]
L. A. Ramshaw and M. P. Marcus. Text chunking using transformation-based learning. In Natural language processing using very large corpora, pages 157–176. Springer, 1999
work page 1999
-
[53]
L. F. Rau. Extracting company names from text. In Artificial In- telligence Applications, 1991. Proceedings., Seventh IEEE Con- ference on, volume 1, pages 29–32. IEEE, 1991
work page 1991
-
[54]
D. Ren, M. Brehmer, B. Lee, T. H ¨ollerer, and E. K. Choe. Char- taccent: Annotation for data-driven storytelling. In 2017 IEEE Pacific Visualization Symposium (PacificVis) , pages 230–239. IEEE, 2017
work page 2017
-
[55]
D. Ren, T. H ¨ollerer, and X. Yuan. iVisDesigner: Expressive interactive design of information visualizations. IEEE trans- actions on visualization and computer graphics , 20(12):2092– 2101, 2014
work page 2092
-
[56]
D. Ren, B. Lee, and M. Brehmer. Charticulator: Interactive con- struction of bespoke chart layouts. IEEE transactions on visual- ization and computer graphics, 25(1):789–799, 2019
work page 2019
-
[57]
F. Richter. Ibm tops u.s. patent ranking for 21st consecutive year. https://www.statista.com/chart/1796/us-pat ent-ranking-2013, 2014
work page 2013
-
[58]
M. J. Rosen. Strong american philanthropy at a record high! https://michaelrosensays.wordpress.com/tag /giving-usa-2015-infographic , 2015
work page 2015
-
[59]
A. Satyanarayan and J. Heer. Lyra: An interactive visualization design environment. In Computer Graphics Forum, volume 33, pages 351–360. Wiley Online Library, 2014
work page 2014
-
[60]
E. Segel and J. Heer. Narrative visualization: Telling stories with data. IEEE transactions on visualization and computer graphics, 16(6):1139–1148, 2010
work page 2010
- [61]
-
[62]
V . Setlur and M. C. Stone. A linguistic approach to categorical color assignment for data visualization. IEEE transactions on visualization and computer graphics, 22(1):698–707, 2016
work page 2016
-
[63]
W. V . Siricharoen. Infographics: the new communication tools in digital age. In The international conference on e-technologies and business on the web (ebw2013), pages 169–174. The Society of Digital Information and Wireless Communication, 2013
work page 2013
-
[64]
D. Skau and R. Kosara. Readability and precision in pictorial bar charts. In Proceedings of the Eurographics/IEEE VGTC Confer- ence on Visualization: Short Papers, pages 91–95. Eurographics Association, 2017
work page 2017
-
[65]
A. Srinivasan, S. M. Drucker, A. Endert, and J. Stasko. Aug- menting visualizations with interactive data facts to facilitate in- terpretation and communication. IEEE transactions on visual- ization and computer graphics, 25(1):672–681, 2018
work page 2018
-
[66]
J. Stegman. 3 way to grow your support revenue. https: //www.tsia.com/blog/infographic-3-ways-to- grow-your-support-revenue , 2015
work page 2015
-
[67]
Y . Sun, J. Leigh, A. Johnson, and S. Lee. Articulate: A semi- automated model for translating natural language queries into meaningful visualizations. In International Symposium on Smart Graphics, pages 184–195. Springer, 2010
work page 2010
-
[68]
E. R. Tufte. The visual display of quantitative information , vol- ume 2. Graphics press Cheshire, CT, 2001
work page 2001
-
[69]
F. B. Viegas, M. Wattenberg, F. Van Ham, J. Kriss, and M. McK- eon. Manyeyes: a site for visualization at internet scale. IEEE transactions on visualization and computer graphics , 13(6), 2007
work page 2007
-
[70]
Y . Wang, H. Zhang, H. Huang, X. Chen, Q. Yin, Z. Hou, D. Zhang, Q. Luo, and H. Qu. Infonice: Easy creation of infor- mation graphics. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, page 335. ACM, 2018
work page 2018
-
[71]
T. Willingham. Alligator pear: Imports and exports. https: //www.freightwaves.com/news/infographics/a lligator-pear-import-export , 2019
work page 2019
- [72]
-
[73]
K. Wongsuphasawat, D. Moritz, A. Anand, J. Mackinlay, B. Howe, and J. Heer. V oyager: Exploratory analysis via faceted browsing of visualization recommendations. IEEE transactions on visualization and computer graphics, 22(1):649–658, 2016
work page 2016
-
[74]
H. Xia, N. Henry Riche, F. Chevalier, B. De Araujo, and D. Wig- dor. Dataink: Direct and creative data-oriented drawing. In Pro- ceedings of the 2018 CHI Conference on Human Factors in Com- puting Systems, page 223. ACM, 2018
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.