Chat Modeling: Interaction-Enhanced Agent Framework for Visualizing Literature-Grounded Biological Structures

Donggang Jia; Ivan Viola; Yunhai Wang

arxiv: 2404.01063 · v2 · pith:MG2WRLLOnew · submitted 2024-04-01 · 💻 cs.HC · cs.GR

Chat Modeling: Interaction-Enhanced Agent Framework for Visualizing Literature-Grounded Biological Structures

Donggang Jia , Yunhai Wang , Ivan Viola This is my paper

Pith reviewed 2026-05-24 02:28 UTC · model grok-4.3

classification 💻 cs.HC cs.GR

keywords 3D modelingLLM agentsbiological visualizationinteractive agentsmodeling memorychat interfaceliterature to modelhuman-computer interaction

0 comments

The pith

Collaborative LLM agents convert natural language from biology papers into structured 3D modeling operations and final models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents a framework that lets bioscientists build 3D models of structures described in research publications by feeding text inputs to a team of agents. These agents handle planning and execution while supporting new chat-based interactions that combine typed commands with mouse actions and generate dynamic widgets. A specialized modeling memory tracks past work, collects feedback, and stores skills so that modeling accuracy rises with continued use. The system outputs structured JSON commands that drive the modeling software and produces the final 3D results. Evaluation on a collected dataset plus user studies and expert interviews indicate the approach works for scientific visualization tasks.

Core claim

The framework transforms user inputs including natural language descriptions, research publication content, and textual descriptions of the existing scene into modeling operations in a structured JSON format and final 3D results. The major technical contribution lies in the collaborative agent design that simultaneously supports model planning, execution, and novel user interaction design such as interactive modeling execution and dynamic widget generation that fuse text and mouse interaction within the chat window. The framework further incorporates a customized modeling memory to enhance user interaction, featuring components such as personalized memory management, feedback collection, and

What carries the argument

Collaborative agent design for simultaneous model planning and execution plus customized modeling memory with personalized management, feedback collection, and skill library.

If this is right

Bioscientists can produce 3D visualizations without first mastering complex modeling software.
Interactive modeling execution and dynamic widget generation become available inside the chat window.
Modeling performance improves over time as the memory accumulates personalized data and skills.
The system supports direct use of publication content as input for model construction.
Quantitative results on the collected dataset confirm the framework produces usable 3D models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The memory and skill-library design could support sharing of modeling expertise across multiple users or research groups.
The same agent pattern might transfer to other domains that require turning textual descriptions into geometric models.
Live connection to experimental data streams could let the system update models when new measurements appear in the literature.
Automated checks against known geometric constraints of biological structures could be added to catch errors before rendering.

Load-bearing premise

LLM-based agents can reliably turn natural language descriptions of biological structures, publication content, and current scene states into correct structured JSON modeling operations without hallucinations or errors.

What would settle it

Provide the framework with a specific, well-documented biological structure from a publication and check whether the output 3D model contains geometry errors or invalid operations traceable to incorrect JSON commands.

Figures

Figures reproduced from arXiv: 2404.01063 by Donggang Jia, Ivan Viola, Yunhai Wang.

**Figure 1.** Figure 1: Chat Modeling is a procedural modeling system that takes users’ textual input and completes corresponding [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Examples of rule types: for each rule, the left image shows rule creation and the right image shows the [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: The framework starts with user text input, processed by the Modeling Translator, which creates prompts for [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗

**Figure 4.** Figure 4: The Modeling Translator consists of the code generator and the code interpreter. The code generator creates [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: User-refined few-shot prompting setting. The prompt includes a task description, initial examples, and user [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: The code correction operation involves an iterative process for syntax analysis and error fixing. [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 7.** Figure 7: Automatic mode generates the blood plasma model in six steps, each with a rule created from an LLM [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: The step-by-step mode is demonstrated by the biology structure modeling, together with the whole conversa [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

read the original abstract

Bioscientists frequently seek to visualize the biological systems they have empirically characterized and reported in the literature. Realizing such visualizations requires biological structure modeling, an inherently complex process that demands both biological and geometric understanding. This paper addresses the problem of constructing such 3D models for visualization. In this paper, we introduce a novel agent framework that mitigates the challenges of operating 3D modeling software by transforming user inputs, including natural language descriptions, research publication content, and textual descriptions of the existing objects and structures in the current scene, into modeling operations in a structured JSON format and final 3D results. The major technical contribution lies in the collaborative agent design that simultaneously supports model planning, execution, and novel user interaction design, such as interactive modeling execution and dynamic widget generation that fuse text and mouse interaction within the chat window. The framework further incorporates a customized modeling memory to enhance user interaction, featuring components such as personalized memory management, feedback collection, and skill library design. This modeling memory is leveraged to enable improved 3D modeling performance over time. The quantitative evaluation on our collected dataset showcases the effectiveness of our framework. We also develop a prototype tool, Chat Modeling, and demonstrate its usage through two modeling case studies. Our user study and expert interviews highlight the potential of our approach for use in scientific workflows.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper builds a multi-agent chat prototype for turning literature into 3D biological models with dynamic widgets and memory, but the untested LLM-to-JSON reliability undercuts the performance claims.

read the letter

The core contribution is a working prototype that routes natural language from papers and users through collaborative agents to produce 3D models, with added dynamic widgets that combine text and mouse input inside the chat and a modeling memory that includes a skill library for reuse. Those interaction details and the memory design are the concrete new elements beyond standard multi-agent setups. They shipped case studies, a user study, and expert interviews, which gives the work some grounding in actual use rather than pure description. The quantitative evaluation is mentioned but not detailed in the abstract, so the strength of the results is still unclear from what is shown. The load-bearing step is the agents mapping scene descriptions and paper excerpts into precise JSON modeling commands. No validation, sandbox checks, or error-recovery loops are described, and the memory improvement argument assumes the base outputs are already usable enough to build on. Without those safeguards or reported error rates, the reliability claim rests on an assumption that current LLMs handle structured biological geometry tasks cleanly. This paper is for HCI researchers working on agent-supported scientific tools and for visualization groups that want examples of chat-plus-widget interfaces. A reader focused on practical agent memory designs would pick up usable ideas from the skill library and feedback components. It deserves peer review because the prototype and human evaluation are real deliverables that can be examined and extended, even if the evaluation needs more metrics and baselines to stand up.

Referee Report

3 major / 2 minor

Summary. The paper introduces Chat Modeling, a collaborative multi-agent LLM framework that converts natural-language descriptions of biological structures from publications and scene states into structured JSON modeling operations for 3D visualization. Key components include agents for planning and execution, dynamic widget generation fusing text and mouse input in the chat interface, and a customized modeling memory (personalized management, feedback collection, skill library) intended to improve performance over time. The work is supported by quantitative evaluation on a collected dataset, two case studies, a user study, and expert interviews.

Significance. If the core LLM-to-JSON pipeline proves reliable, the system could meaningfully reduce the expertise barrier for bioscientists to produce literature-grounded 3D models. The interactive design elements and memory-augmented agents represent a concrete advance in applied agent frameworks for scientific HCI. The provision of a working prototype and multi-method evaluation (dataset, cases, users, experts) strengthens the applied contribution.

major comments (3)

[Abstract and §4] Abstract and §4 (Evaluation): The central claim that the modeling memory yields 'improved 3D modeling performance over time' rests on the untested assumption that the LLM agents produce sufficiently accurate JSON outputs. No metrics, baselines, error rates, or robustness analysis are reported for JSON schema adherence, coordinate accuracy, or biological validity; without these, the memory improvement cannot be isolated from initial hallucination failures.
[§3] §3 (System Design): The framework description does not specify any JSON schema validation, execution sandboxing, or repair loops for malformed or hallucinated outputs. Given that a single incorrect field or reference produces an invalid 3D model, the absence of these safeguards is load-bearing for the claimed reliability of the planning-execution pipeline.
[§4] §4 (Quantitative Evaluation): The abstract states that a 'collected dataset' is used to showcase effectiveness, yet no details are provided on dataset construction, size, annotation process, chosen metrics (e.g., JSON validity rate, geometric error), or comparison against non-agent baselines. This omission prevents assessment of whether the collaborative-agent design actually outperforms simpler prompting approaches.

minor comments (2)

[Abstract] The abstract and introduction would benefit from a concise statement of the exact JSON schema fields and an example of a successful transformation to ground the technical contribution.
[Case Studies] Figure captions and the case-study section should explicitly link each illustrated widget or memory component to the corresponding system module described in §3.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback, which identifies key areas where additional detail and analysis will strengthen the manuscript. We address each major comment below and will incorporate revisions to improve rigor and transparency.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (Evaluation): The central claim that the modeling memory yields 'improved 3D modeling performance over time' rests on the untested assumption that the LLM agents produce sufficiently accurate JSON outputs. No metrics, baselines, error rates, or robustness analysis are reported for JSON schema adherence, coordinate accuracy, or biological validity; without these, the memory improvement cannot be isolated from initial hallucination failures.

Authors: We agree that the evaluation would be strengthened by explicit metrics isolating JSON output quality. The reported quantitative results measure end-to-end task success on the dataset, which requires correct JSON generation, but separate breakdown of schema adherence, coordinate accuracy, and error rates was not included. We will revise §4 to add these metrics, robustness analysis, and non-agent baselines so the contribution of modeling memory can be more clearly isolated. revision: yes
Referee: [§3] §3 (System Design): The framework description does not specify any JSON schema validation, execution sandboxing, or repair loops for malformed or hallucinated outputs. Given that a single incorrect field or reference produces an invalid 3D model, the absence of these safeguards is load-bearing for the claimed reliability of the planning-execution pipeline.

Authors: The current manuscript emphasizes the agent collaboration and interaction mechanisms rather than low-level error handling. The prototype does perform JSON parsing with basic validation and error recovery, but these were not described. We will expand §3 to document the JSON schema, validation procedures, any sandboxing, and repair loops used to mitigate malformed outputs. revision: yes
Referee: [§4] §4 (Quantitative Evaluation): The abstract states that a 'collected dataset' is used to showcase effectiveness, yet no details are provided on dataset construction, size, annotation process, chosen metrics (e.g., JSON validity rate, geometric error), or comparison against non-agent baselines. This omission prevents assessment of whether the collaborative-agent design actually outperforms simpler prompting approaches.

Authors: We acknowledge that dataset and evaluation details were insufficiently specified. The dataset was built from literature-derived natural-language descriptions of biological structures paired with ground-truth modeling operations. We will revise §4 to include dataset size, construction and annotation process, full metric definitions (including JSON validity and geometric error), and direct comparisons against simpler prompting baselines. revision: yes

Circularity Check

0 steps flagged

No circularity: applied systems paper with no derivations or fitted predictions

full rationale

The paper describes a built prototype (Chat Modeling) that uses LLM agents to map natural-language inputs to JSON modeling commands for 3D biological structures. No equations, first-principles derivations, parameter fitting, or predictions appear anywhere in the manuscript. The claimed improvements from modeling memory and collaborative agents are presented as engineering outcomes evaluated via case studies and user interviews, not as quantities derived from or equivalent to the framework's own inputs. No self-citation chains or ansatzes are invoked to justify core claims. This matches the default expectation for non-theoretical systems work.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 2 invented entities

The central claim rests on the unverified reliability of LLM agents for structured output generation and the effectiveness of the newly introduced memory components; these are postulated without independent evidence outside the paper's own evaluation.

axioms (1)

domain assumption Large language models can be reliably prompted to convert complex biological descriptions and scene states into valid structured JSON modeling operations.
Invoked throughout the framework description as the mechanism for transforming user inputs into executable commands.

invented entities (2)

Collaborative multi-agent system with planning, execution, and interaction components plus dynamic widget generation no independent evidence
purpose: To handle model planning, execution, and novel text-mouse fused interactions in the chat window
New design element introduced to address challenges of 3D modeling software operation.
Customized modeling memory with personalized management, feedback collection, and skill library no independent evidence
purpose: To enhance user interaction and enable improved 3D modeling performance over time
Custom component proposed as a major technical contribution for long-term improvement.

pith-pipeline@v0.9.0 · 5769 in / 1564 out tokens · 35622 ms · 2026-05-24T02:28:58.097339+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Raiven: LLM-Based Visualization Authoring via Domain-Specific Language Mediation
cs.HC 2026-04 unverdicted novelty 7.0

Raiven mediates LLM visualization authoring via a formally defined DSL that unifies scientific and information visualization, producing deterministic, verifiable code from metadata-only inputs.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · cited by 1 Pith paper · 2 internal anchors

[1]

Research and scholarly methods: Semi-structured interviews

Omolola A Adeoye-Olatunde and Nicole L Olenik. Research and scholarly methods: Semi-structured interviews. Journal of the american college of clinical pharmacy, 4(10):1358–1367, 2021

work page 2021
[2]

Maya - 3D Computer Animation, Modeling, Simulation, and Rendering Software

Autodesk, Inc. Maya - 3D Computer Animation, Modeling, Simulation, and Rendering Software. https: //www.autodesk.ae/products/maya/overview, 2024. Accessed: 2024-03-18

work page 2024
[3]

Chloroplast Model

Mohammed Zia Baig, Ondrej Strnad, Ivan Viola, and Deng Luo. Chloroplast Model. https://www.nanovis. org/Chloroplast-model.html, Access Year. Accessed on 13/04/2023

work page 2023
[4]

Hybrid Tactile/Tangible Interaction for 3D Data Exploration

Lonni Besanc ¸on, Paul Issartel, Mehdi Ammi, and Tobias Isenberg. Hybrid Tactile/Tangible Interaction for 3D Data Exploration. IEEE Transactions on Visualization and Computer Graphics, 23(1):881–890, 2017

work page 2017
[5]

Biau, Bruno Frachet, Virginie Pineau, El Hadi Sariali, Marc Soubeyrand, Rabah Taouachi, Tobias Isenberg, and Pierre Dragicevic

Lonni Besanc ¸on, Amir Semmo, David J. Biau, Bruno Frachet, Virginie Pineau, El Hadi Sariali, Marc Soubeyrand, Rabah Taouachi, Tobias Isenberg, and Pierre Dragicevic. Reducing Affective Responses to Surgical Images and Videos Through Stylization. Computer Graphics Forum, 39(1):462–483, January 2020

work page 2020
[6]

Blender - a 3D modelling and rendering package

Blender Foundation. Blender - a 3D modelling and rendering package. https://www.blender.org/, 2024. Accessed: 2024-03-18

work page 2024
[7]

Language models are few-shot learners

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020

work page 1901
[8]

Local Standards for Sample Size at CHI

Kelly Caine. Local Standards for Sample Size at CHI. In Proc. CHI, CHI ’16, pages 981–992, New York, NY , USA, 2016. ACM

work page 2016
[9]

Layoutgpt: Compositional visual planning and generation with large language models

Weixi Feng, Wanrong Zhu, Tsu-jui Fu, Varun Jampani, Arjun Akula, Xuehai He, Sugato Basu, Xin Eric Wang, and William Yang Wang. Layoutgpt: Compositional visual planning and generation with large language models. Advances in Neural Information Processing Systems, 36, 2024

work page 2024
[10]

Procedural generation of roads

Eric Galin, Adrien Peytavie, Nicolas Mar ´echal, and Eric Gu ´erin. Procedural generation of roads. In Computer Graphics Forum, volume 29, pages 429–438. Wiley Online Library, 2010

work page 2010
[11]

Inverse procedural modeling of branching structures by inferring l-systems

Jianwei Guo, Haiyong Jiang, Bedrich Benes, Oliver Deussen, Xiaopeng Zhang, Dani Lischinski, and Hui Huang. Inverse procedural modeling of branching structures by inferring l-systems. ACM Transactions on Graphics (TOG), 39(5):1–13, 2020

work page 2020
[12]

SpyDirect: A Novel Biofunctionalization Method for High Stability and Longevity of Electronic Biosensors

Keying Guo, Raik Gr¨unberg, Yuxiang Ren, Tianrui Chang, Shofarul Wustoni, Ondrej Strnad, Anil Koklu, Escarlet D´ıaz-Galicia, Jessica Parrado Agudelo, Victor Druet, et al. SpyDirect: A Novel Biofunctionalization Method for High Stability and Longevity of Electronic Biosensors. Advanced Science, page 2306716

work page
[13]

Visual programming: Compositional visual reasoning without training

Tanmay Gupta and Aniruddha Kembhavi. Visual programming: Compositional visual reasoning without training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14953–14962, 2023

work page 2023
[14]

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Wenlong Huang, Chen Wang, Ruohan Zhang, Yunzhu Li, Jiajun Wu, and Li Fei-Fei. V oxposer: Composable 3d value maps for robotic manipulation with language models. arXiv preprint arXiv:2307.05973, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[15]

Isenberg, P

T. Isenberg, P. Isenberg, J. Chen, M. Sedlmair, and T. M¨oller. A Systematic Review on the Practice of Evaluating Visualization. IEEE Transactions on Visualization and Computer Graphics, 19(12):2818–2827, Dec 2013. 16 Chat Modeling A PREPRINT

work page 2013
[16]

VOICE: Visual Oracle for Interaction, Conversation, and Explanation, 2024

Donggang Jia, Alexandra Irger, Lonni Besancon, Ondrej Strnad, Deng Luo, Johanna Bjorklund, Anders Ynnerman, and Ivan Viola. VOICE: Visual Oracle for Interaction, Conversation, and Explanation, 2024

work page 2024
[17]

The kit object models database: An object model database for object recognition, localization and manipulation in service robotics

Alexander Kasper, Zhixing Xue, and R ¨udiger Dillmann. The kit object models database: An object model database for object recognition, localization and manipulation in service robotics. The International Journal of Robotics Research, 31(8):927–934, 2012

work page 2012
[18]

An efficient method for fully automatic 3D digitization of unknown objects

Souhaiel Khalfaoui, Ralph Seulin, Yohan Fougerolle, and David Fofi. An efficient method for fully automatic 3D digitization of unknown objects. Computers in Industry, 64(9):1152–1160, 2013

work page 2013
[19]

How many participants do researchers recruit? A look at 678 UX/HCI studies

Lisa Koeman. How many participants do researchers recruit? A look at 678 UX/HCI studies. Online. Last visited 06 January 2019, 2018

work page 2019
[20]

Molecumentary: Adaptable narrated documentaries using molecular visualization

David Kou ˇril, Ond ˇrej Strnad, Peter Mindek, Sarkis Halladjian, Tobias Isenberg, M Eduard Gr ¨oller, and Ivan Viola. Molecumentary: Adaptable narrated documentaries using molecular visualization. IEEE Transactions on Visualization and Computer Graphics, 29(3):1733–1747, 2021

work page 2021
[21]

Autonomous 3D modeling of unknown objects for active scene exploration

Simon Kriegel. Autonomous 3D modeling of unknown objects for active scene exploration. PhD thesis, Technis- che Universit¨at M¨unchen (TUM), 2015

work page 2015
[22]

VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching

Tica Lin, Alexandre Aouididi, Zhutian Chen, Johanna Beyer, Hanspeter Pfister, and Jui-Hsien Wang. VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching. IEEE transactions on visualization and computer graphics, 2023

work page 2023
[23]

Advisor: Automatic visualization answer for natural-language question on tabular data

Can Liu, Yun Han, Ruike Jiang, and Xiaoru Yuan. Advisor: Automatic visualization answer for natural-language question on tabular data. In 2021 IEEE 14th Pacific Visualization Symposium (PacificVis), pages 11–20, 2021

work page 2021
[24]

X-mesh: Towards fast and accurate text-driven 3d stylization via dynamic textual guidance

Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, and Rongrong Ji. X-mesh: Towards fast and accurate text-driven 3d stylization via dynamic textual guidance. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 2749–2760, October 2023

work page 2023
[25]

Chat2VIS: Fine-Tuning Data Visualisations using Multilingual Natural Language Text and Pre-Trained Large Language Models

Paula Maddigan and Teo Susnjak. Chat2VIS: Fine-Tuning Data Visualisations using Multilingual Natural Language Text and Pre-Trained Large Language Models. arXiv preprint arXiv:2303.14292, 2023

work page arXiv 2023
[26]

Model Synthesis: A General Procedural Modeling Algorithm

Paul Merrell and Dinesh Manocha. Model Synthesis: A General Procedural Modeling Algorithm. IEEE Transactions on Visualization and Computer Graphics, 17(6):715–728, 2011

work page 2011
[27]

Object 3dit: Language-guided 3d-aware image editing

Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, and Tanmay Gupta. Object 3dit: Language-guided 3d-aware image editing. Advances in Neural Information Processing Systems, 36, 2024

work page 2024
[28]

Facilitating conversational interaction in natural language interfaces for visualization

Rishab Mitra, Arpit Narechania, Alex Endert, and John Stasko. Facilitating conversational interaction in natural language interfaces for visualization. In 2022 IEEE Visualization and Visual Analytics (VIS), pages 6–10. IEEE, 2022

work page 2022
[29]

Clip-mesh: Generating textured meshes from text using pretrained image-text models

Nasir Mohammad Khalid, Tianhao Xie, Eugene Belilovsky, and Tiberiu Popa. Clip-mesh: Generating textured meshes from text using pretrained image-text models. In SIGGRAPH Asia 2022 Conference Papers, SA ’22. ACM, November 2022

work page 2022
[30]

Procedural modeling of buildings

Pascal M¨uller, Peter Wonka, Simon Haegler, Andreas Ulmer, and Luc Van Gool. Procedural modeling of buildings. In ACM SIGGRAPH 2006 Papers, pages 614–623. 2006

work page 2006
[31]

NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries

Arpit Narechania, Arjun Srinivasan, and John Stasko. NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries. IEEE Transactions on Visualization and Computer Graphics, 27(2):369–379, 2020

work page 2020
[32]

Modeling in the time of COVID-19: Statistical and rule-based mesoscale models

Ngan Nguyen, Ondˇrej Strnad, Tobias Klein, Deng Luo, Ruwayda Alharbi, Peter Wonka, Martina Maritan, Peter Mindek, Ludovic Autin, David S Goodsell, et al. Modeling in the time of COVID-19: Statistical and rule-based mesoscale models. IEEE transactions on visualization and computer graphics, 27(2):722–732, 2020

work page 2020
[33]

Procedural urban forestry

Till Niese, S¨oren Pirk, Matthias Albrecht, Bedrich Benes, and Oliver Deussen. Procedural urban forestry. ACM Transactions on Graphics (TOG), 41(2):1–18, 2022

work page 2022
[34]

Numeracy from literacy: Data science as an emergent skill from large language models

David Noever and Forrest McKee. Numeracy from literacy: Data science as an emergent skill from large language models. arXiv preprint arXiv:2301.13382, 2023

work page arXiv 2023
[35]

GPT-4 Technical Report, 2024

OpenAI. GPT-4 Technical Report, 2024

work page 2024
[36]

Procedural modeling of cities

Yoav IH Parish and Pascal M ¨uller. Procedural modeling of cities. In Proceedings of the 28th annual conference on Computer graphics and interactive techniques, pages 301–308, 2001. 17 Chat Modeling A PREPRINT

work page 2001
[37]

Working with Forensic Practitioners to Understand the Opportunities and Challenges for Mixed-Reality Digital Autopsy

Vahid Pooryousef, Maxime Cordeil, Lonni Besan c ¸on, Christophe Hurter, Tim Dwyer, and Richard Bassed. Working with Forensic Practitioners to Understand the Opportunities and Challenges for Mixed-Reality Digital Autopsy. In Proc. CHI, CHI ’23, New York, NY , USA, 2023. Association for Computing Machinery

work page 2023
[38]

The algorithmic beauty of plants

Przemyslaw Prusinkiewicz and Aristid Lindenmayer. The algorithmic beauty of plants. Springer Science & Business Media, 2012

work page 2012
[39]

Robust speech recognition via large-scale weak supervision

Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, and Ilya Sutskever. Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning, pages 28492–28518. PMLR, 2023

work page 2023
[40]

Infinite photorealistic worlds using procedural generation

Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, et al. Infinite photorealistic worlds using procedural generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12630–12641, 2023

work page 2023
[41]

Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification

Braden Roper, James C Mathews, Saad Nadeem, and Ji Hwan Park. Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification. In 2023 IEEE Visualization and Visual Analytics (VIS), pages 106–110. IEEE, 2023

work page 2023
[42]

Advanced procedural modeling of architecture

Michael Schwarz and Pascal M ¨uller. Advanced procedural modeling of architecture. ACM Transactions on Graphics (TOG), 34(4):1–12, 2015

work page 2015
[43]

3d-gpt: Procedural 3d modeling with large language models

Chunyi Sun, Junlin Han, Weijian Deng, Xinlong Wang, Zishan Qin, and Stephen Gould. 3d-gpt: Procedural 3d modeling with large language models. arXiv preprint arXiv:2310.12945, 2023

work page arXiv 2023
[44]

ViperGPT: Visual Inference via Python Execution for Reasoning

D´ıdac Sur´ıs, Sachit Menon, and Carl V ondrick. Vipergpt: Visual inference via python execution for reasoning. arXiv preprint arXiv:2303.08128, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[45]

Qt — Tools for Each Stage of Software Development Lifecycle

The Qt Company. Qt — Tools for Each Stage of Software Development Lifecycle. https://www.qt.io/, 2024. Accessed: 2024-03-23

work page 2024
[46]

Prompt a robot to walk with large language models

Yen-Jen Wang, Bike Zhang, Jianyu Chen, and Koushil Sreenath. Prompt a robot to walk with large language models. arXiv preprint arXiv:2309.09969, 2023

work page arXiv 2023
[47]

Towards natural language-based visualization authoring

Yun Wang, Zhitao Hou, Leixian Shen, Tongshuang Wu, Jiaqi Wang, He Huang, Haidong Zhang, and Dongmei Zhang. Towards natural language-based visualization authoring. IEEE Transactions on Visualization and Computer Graphics, 29(1):1222–1232, 2022

work page 2022
[48]

Procedural cloudscapes

Antoine Webanck, Yann Cortial, Eric Gu ´erin, and Eric Galin. Procedural cloudscapes. In Computer Graphics Forum, volume 37, pages 431–442. Wiley Online Library, 2018

work page 2018
[49]

Chain-of-thought prompting elicits reasoning in large language models

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35:24824–24837, 2022

work page 2022
[50]

T4 model

John Winfer, Aeliya Syed, Paul Ekers, Leon Thistle, Ngan Nguyen, Ondrej Strnad, David Goodsell, Ivan Viola, and Deng Luo. T4 model. https://www.nanovis.org/T4-model.html. Accessed on 09/07/2021. 18

work page 2021

[1] [1]

Research and scholarly methods: Semi-structured interviews

Omolola A Adeoye-Olatunde and Nicole L Olenik. Research and scholarly methods: Semi-structured interviews. Journal of the american college of clinical pharmacy, 4(10):1358–1367, 2021

work page 2021

[2] [2]

Maya - 3D Computer Animation, Modeling, Simulation, and Rendering Software

Autodesk, Inc. Maya - 3D Computer Animation, Modeling, Simulation, and Rendering Software. https: //www.autodesk.ae/products/maya/overview, 2024. Accessed: 2024-03-18

work page 2024

[3] [3]

Chloroplast Model

Mohammed Zia Baig, Ondrej Strnad, Ivan Viola, and Deng Luo. Chloroplast Model. https://www.nanovis. org/Chloroplast-model.html, Access Year. Accessed on 13/04/2023

work page 2023

[4] [4]

Hybrid Tactile/Tangible Interaction for 3D Data Exploration

Lonni Besanc ¸on, Paul Issartel, Mehdi Ammi, and Tobias Isenberg. Hybrid Tactile/Tangible Interaction for 3D Data Exploration. IEEE Transactions on Visualization and Computer Graphics, 23(1):881–890, 2017

work page 2017

[5] [5]

Biau, Bruno Frachet, Virginie Pineau, El Hadi Sariali, Marc Soubeyrand, Rabah Taouachi, Tobias Isenberg, and Pierre Dragicevic

Lonni Besanc ¸on, Amir Semmo, David J. Biau, Bruno Frachet, Virginie Pineau, El Hadi Sariali, Marc Soubeyrand, Rabah Taouachi, Tobias Isenberg, and Pierre Dragicevic. Reducing Affective Responses to Surgical Images and Videos Through Stylization. Computer Graphics Forum, 39(1):462–483, January 2020

work page 2020

[6] [6]

Blender - a 3D modelling and rendering package

Blender Foundation. Blender - a 3D modelling and rendering package. https://www.blender.org/, 2024. Accessed: 2024-03-18

work page 2024

[7] [7]

Language models are few-shot learners

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. Advances in neural information processing systems, 33:1877–1901, 2020

work page 1901

[8] [8]

Local Standards for Sample Size at CHI

Kelly Caine. Local Standards for Sample Size at CHI. In Proc. CHI, CHI ’16, pages 981–992, New York, NY , USA, 2016. ACM

work page 2016

[9] [9]

Layoutgpt: Compositional visual planning and generation with large language models

Weixi Feng, Wanrong Zhu, Tsu-jui Fu, Varun Jampani, Arjun Akula, Xuehai He, Sugato Basu, Xin Eric Wang, and William Yang Wang. Layoutgpt: Compositional visual planning and generation with large language models. Advances in Neural Information Processing Systems, 36, 2024

work page 2024

[10] [10]

Procedural generation of roads

Eric Galin, Adrien Peytavie, Nicolas Mar ´echal, and Eric Gu ´erin. Procedural generation of roads. In Computer Graphics Forum, volume 29, pages 429–438. Wiley Online Library, 2010

work page 2010

[11] [11]

Inverse procedural modeling of branching structures by inferring l-systems

Jianwei Guo, Haiyong Jiang, Bedrich Benes, Oliver Deussen, Xiaopeng Zhang, Dani Lischinski, and Hui Huang. Inverse procedural modeling of branching structures by inferring l-systems. ACM Transactions on Graphics (TOG), 39(5):1–13, 2020

work page 2020

[12] [12]

SpyDirect: A Novel Biofunctionalization Method for High Stability and Longevity of Electronic Biosensors

Keying Guo, Raik Gr¨unberg, Yuxiang Ren, Tianrui Chang, Shofarul Wustoni, Ondrej Strnad, Anil Koklu, Escarlet D´ıaz-Galicia, Jessica Parrado Agudelo, Victor Druet, et al. SpyDirect: A Novel Biofunctionalization Method for High Stability and Longevity of Electronic Biosensors. Advanced Science, page 2306716

work page

[13] [13]

Visual programming: Compositional visual reasoning without training

Tanmay Gupta and Aniruddha Kembhavi. Visual programming: Compositional visual reasoning without training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14953–14962, 2023

work page 2023

[14] [14]

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models

Wenlong Huang, Chen Wang, Ruohan Zhang, Yunzhu Li, Jiajun Wu, and Li Fei-Fei. V oxposer: Composable 3d value maps for robotic manipulation with language models. arXiv preprint arXiv:2307.05973, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[15] [15]

Isenberg, P

T. Isenberg, P. Isenberg, J. Chen, M. Sedlmair, and T. M¨oller. A Systematic Review on the Practice of Evaluating Visualization. IEEE Transactions on Visualization and Computer Graphics, 19(12):2818–2827, Dec 2013. 16 Chat Modeling A PREPRINT

work page 2013

[16] [16]

VOICE: Visual Oracle for Interaction, Conversation, and Explanation, 2024

Donggang Jia, Alexandra Irger, Lonni Besancon, Ondrej Strnad, Deng Luo, Johanna Bjorklund, Anders Ynnerman, and Ivan Viola. VOICE: Visual Oracle for Interaction, Conversation, and Explanation, 2024

work page 2024

[17] [17]

The kit object models database: An object model database for object recognition, localization and manipulation in service robotics

Alexander Kasper, Zhixing Xue, and R ¨udiger Dillmann. The kit object models database: An object model database for object recognition, localization and manipulation in service robotics. The International Journal of Robotics Research, 31(8):927–934, 2012

work page 2012

[18] [18]

An efficient method for fully automatic 3D digitization of unknown objects

Souhaiel Khalfaoui, Ralph Seulin, Yohan Fougerolle, and David Fofi. An efficient method for fully automatic 3D digitization of unknown objects. Computers in Industry, 64(9):1152–1160, 2013

work page 2013

[19] [19]

How many participants do researchers recruit? A look at 678 UX/HCI studies

Lisa Koeman. How many participants do researchers recruit? A look at 678 UX/HCI studies. Online. Last visited 06 January 2019, 2018

work page 2019

[20] [20]

Molecumentary: Adaptable narrated documentaries using molecular visualization

David Kou ˇril, Ond ˇrej Strnad, Peter Mindek, Sarkis Halladjian, Tobias Isenberg, M Eduard Gr ¨oller, and Ivan Viola. Molecumentary: Adaptable narrated documentaries using molecular visualization. IEEE Transactions on Visualization and Computer Graphics, 29(3):1733–1747, 2021

work page 2021

[21] [21]

Autonomous 3D modeling of unknown objects for active scene exploration

Simon Kriegel. Autonomous 3D modeling of unknown objects for active scene exploration. PhD thesis, Technis- che Universit¨at M¨unchen (TUM), 2015

work page 2015

[22] [22]

VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching

Tica Lin, Alexandre Aouididi, Zhutian Chen, Johanna Beyer, Hanspeter Pfister, and Jui-Hsien Wang. VIRD: Immersive Match Video Analysis for High-Performance Badminton Coaching. IEEE transactions on visualization and computer graphics, 2023

work page 2023

[23] [23]

Advisor: Automatic visualization answer for natural-language question on tabular data

Can Liu, Yun Han, Ruike Jiang, and Xiaoru Yuan. Advisor: Automatic visualization answer for natural-language question on tabular data. In 2021 IEEE 14th Pacific Visualization Symposium (PacificVis), pages 11–20, 2021

work page 2021

[24] [24]

X-mesh: Towards fast and accurate text-driven 3d stylization via dynamic textual guidance

Yiwei Ma, Xiaoqing Zhang, Xiaoshuai Sun, Jiayi Ji, Haowei Wang, Guannan Jiang, Weilin Zhuang, and Rongrong Ji. X-mesh: Towards fast and accurate text-driven 3d stylization via dynamic textual guidance. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 2749–2760, October 2023

work page 2023

[25] [25]

Chat2VIS: Fine-Tuning Data Visualisations using Multilingual Natural Language Text and Pre-Trained Large Language Models

Paula Maddigan and Teo Susnjak. Chat2VIS: Fine-Tuning Data Visualisations using Multilingual Natural Language Text and Pre-Trained Large Language Models. arXiv preprint arXiv:2303.14292, 2023

work page arXiv 2023

[26] [26]

Model Synthesis: A General Procedural Modeling Algorithm

Paul Merrell and Dinesh Manocha. Model Synthesis: A General Procedural Modeling Algorithm. IEEE Transactions on Visualization and Computer Graphics, 17(6):715–728, 2011

work page 2011

[27] [27]

Object 3dit: Language-guided 3d-aware image editing

Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Aniruddha Kembhavi, and Tanmay Gupta. Object 3dit: Language-guided 3d-aware image editing. Advances in Neural Information Processing Systems, 36, 2024

work page 2024

[28] [28]

Facilitating conversational interaction in natural language interfaces for visualization

Rishab Mitra, Arpit Narechania, Alex Endert, and John Stasko. Facilitating conversational interaction in natural language interfaces for visualization. In 2022 IEEE Visualization and Visual Analytics (VIS), pages 6–10. IEEE, 2022

work page 2022

[29] [29]

Clip-mesh: Generating textured meshes from text using pretrained image-text models

Nasir Mohammad Khalid, Tianhao Xie, Eugene Belilovsky, and Tiberiu Popa. Clip-mesh: Generating textured meshes from text using pretrained image-text models. In SIGGRAPH Asia 2022 Conference Papers, SA ’22. ACM, November 2022

work page 2022

[30] [30]

Procedural modeling of buildings

Pascal M¨uller, Peter Wonka, Simon Haegler, Andreas Ulmer, and Luc Van Gool. Procedural modeling of buildings. In ACM SIGGRAPH 2006 Papers, pages 614–623. 2006

work page 2006

[31] [31]

NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries

Arpit Narechania, Arjun Srinivasan, and John Stasko. NL4DV: A toolkit for generating analytic specifications for data visualization from natural language queries. IEEE Transactions on Visualization and Computer Graphics, 27(2):369–379, 2020

work page 2020

[32] [32]

Modeling in the time of COVID-19: Statistical and rule-based mesoscale models

Ngan Nguyen, Ondˇrej Strnad, Tobias Klein, Deng Luo, Ruwayda Alharbi, Peter Wonka, Martina Maritan, Peter Mindek, Ludovic Autin, David S Goodsell, et al. Modeling in the time of COVID-19: Statistical and rule-based mesoscale models. IEEE transactions on visualization and computer graphics, 27(2):722–732, 2020

work page 2020

[33] [33]

Procedural urban forestry

Till Niese, S¨oren Pirk, Matthias Albrecht, Bedrich Benes, and Oliver Deussen. Procedural urban forestry. ACM Transactions on Graphics (TOG), 41(2):1–18, 2022

work page 2022

[34] [34]

Numeracy from literacy: Data science as an emergent skill from large language models

David Noever and Forrest McKee. Numeracy from literacy: Data science as an emergent skill from large language models. arXiv preprint arXiv:2301.13382, 2023

work page arXiv 2023

[35] [35]

GPT-4 Technical Report, 2024

OpenAI. GPT-4 Technical Report, 2024

work page 2024

[36] [36]

Procedural modeling of cities

Yoav IH Parish and Pascal M ¨uller. Procedural modeling of cities. In Proceedings of the 28th annual conference on Computer graphics and interactive techniques, pages 301–308, 2001. 17 Chat Modeling A PREPRINT

work page 2001

[37] [37]

Working with Forensic Practitioners to Understand the Opportunities and Challenges for Mixed-Reality Digital Autopsy

Vahid Pooryousef, Maxime Cordeil, Lonni Besan c ¸on, Christophe Hurter, Tim Dwyer, and Richard Bassed. Working with Forensic Practitioners to Understand the Opportunities and Challenges for Mixed-Reality Digital Autopsy. In Proc. CHI, CHI ’23, New York, NY , USA, 2023. Association for Computing Machinery

work page 2023

[38] [38]

The algorithmic beauty of plants

Przemyslaw Prusinkiewicz and Aristid Lindenmayer. The algorithmic beauty of plants. Springer Science & Business Media, 2012

work page 2012

[39] [39]

Robust speech recognition via large-scale weak supervision

Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, and Ilya Sutskever. Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning, pages 28492–28518. PMLR, 2023

work page 2023

[40] [40]

Infinite photorealistic worlds using procedural generation

Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang, et al. Infinite photorealistic worlds using procedural generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 12630–12641, 2023

work page 2023

[41] [41]

Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification

Braden Roper, James C Mathews, Saad Nadeem, and Ji Hwan Park. Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification. In 2023 IEEE Visualization and Visual Analytics (VIS), pages 106–110. IEEE, 2023

work page 2023

[42] [42]

Advanced procedural modeling of architecture

Michael Schwarz and Pascal M ¨uller. Advanced procedural modeling of architecture. ACM Transactions on Graphics (TOG), 34(4):1–12, 2015

work page 2015

[43] [43]

3d-gpt: Procedural 3d modeling with large language models

Chunyi Sun, Junlin Han, Weijian Deng, Xinlong Wang, Zishan Qin, and Stephen Gould. 3d-gpt: Procedural 3d modeling with large language models. arXiv preprint arXiv:2310.12945, 2023

work page arXiv 2023

[44] [44]

ViperGPT: Visual Inference via Python Execution for Reasoning

D´ıdac Sur´ıs, Sachit Menon, and Carl V ondrick. Vipergpt: Visual inference via python execution for reasoning. arXiv preprint arXiv:2303.08128, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[45] [45]

Qt — Tools for Each Stage of Software Development Lifecycle

The Qt Company. Qt — Tools for Each Stage of Software Development Lifecycle. https://www.qt.io/, 2024. Accessed: 2024-03-23

work page 2024

[46] [46]

Prompt a robot to walk with large language models

Yen-Jen Wang, Bike Zhang, Jianyu Chen, and Koushil Sreenath. Prompt a robot to walk with large language models. arXiv preprint arXiv:2309.09969, 2023

work page arXiv 2023

[47] [47]

Towards natural language-based visualization authoring

Yun Wang, Zhitao Hou, Leixian Shen, Tongshuang Wu, Jiaqi Wang, He Huang, Haidong Zhang, and Dongmei Zhang. Towards natural language-based visualization authoring. IEEE Transactions on Visualization and Computer Graphics, 29(1):1222–1232, 2022

work page 2022

[48] [48]

Procedural cloudscapes

Antoine Webanck, Yann Cortial, Eric Gu ´erin, and Eric Galin. Procedural cloudscapes. In Computer Graphics Forum, volume 37, pages 431–442. Wiley Online Library, 2018

work page 2018

[49] [49]

Chain-of-thought prompting elicits reasoning in large language models

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35:24824–24837, 2022

work page 2022

[50] [50]

T4 model

John Winfer, Aeliya Syed, Paul Ekers, Leon Thistle, Ngan Nguyen, Ondrej Strnad, David Goodsell, Ivan Viola, and Deng Luo. T4 model. https://www.nanovis.org/T4-model.html. Accessed on 09/07/2021. 18

work page 2021