Exploring LLM Agent Designs and Interaction Modalities for Scientific Visualization

· 2026 · cs.AI · arXiv 2604.27996

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

This paper examines how large language model (LLM) agents perform on scientific visualization (SciVis) tasks that require generating visualization workflows from natural-language instructions. We compare three representative agent designs: domain-specific agents with structured tool use, computer-use agents, and general-purpose coding agents, across 15 benchmark tasks, evaluating visualization quality, efficiency, robustness, computational cost, and the impact of persistent memory. We further study interaction modalities, including code scripts, model context protocol (MCP) or API calls, command-line interfaces (CLI), and graphical user interfaces (GUI). Our goal is to characterize the tradeoffs among representative SciVis agent configurations used in practice. The results reveal clear tradeoffs across agent designs and interaction modalities. General-purpose coding agents achieve the highest task success rates but incur greater computational cost, whereas domain-specific agents are more efficient and stable but less flexible. Computer-use agents perform well on individual operations but struggle with multi-step workflows. Across both CLI- and GUI-based settings, persistent memory improves performance over repeated trials, but its effectiveness depends on the interaction mode and the quality of feedback. These findings suggest that future SciVis systems should combine structured tool use, interactive capabilities, and adaptive memory mechanisms to balance performance, robustness, and flexibility.

representative citing papers

SciVisAgentSkills: Design and Evaluation of Agent Skills for Scientific Data Analysis and Visualization

cs.AI · 2026-06-04 · unverdicted · novelty 6.0

SciVisAgentSkills provides reusable agent skills that raise mean task scores on a 108-task SciVis benchmark when paired with Codex and Claude Code agents.

Exploring LLM Agent Designs and Interaction Modalities for Scientific Visualization

cs.AI · 2026-04-30 · unverdicted · novelty 5.0

Empirical comparison of domain-specific, computer-use, and general-purpose LLM agents plus CLI/GUI modalities on SciVis tasks reveals general-purpose agents highest in success rate but costliest, domain-specific agents more efficient, and persistent memory beneficial depending on mode.

HiLSVA: Design and Evaluation of a Human-in-the-Loop Agentic System for Scientific Visualization

cs.HC · 2026-06-25 · unverdicted · novelty 4.0

HiLSVA introduces a plan-first multi-agent LLM system for scientific visualization that incorporates explicit human oversight, stepwise provenance, and learn-at-test-time adaptation, evaluated via case studies and a 12-participant user study.

citing papers explorer

Showing 2 of 2 citing papers after filters.

SciVisAgentSkills: Design and Evaluation of Agent Skills for Scientific Data Analysis and Visualization cs.AI · 2026-06-04 · unverdicted · none · ref 43 · internal anchor
SciVisAgentSkills provides reusable agent skills that raise mean task scores on a 108-task SciVis benchmark when paired with Codex and Claude Code agents.
Exploring LLM Agent Designs and Interaction Modalities for Scientific Visualization cs.AI · 2026-04-30 · unverdicted · none · ref 1 · internal anchor
Empirical comparison of domain-specific, computer-use, and general-purpose LLM agents plus CLI/GUI modalities on SciVis tasks reveals general-purpose agents highest in success rate but costliest, domain-specific agents more efficient, and persistent memory beneficial depending on mode.

Exploring LLM Agent Designs and Interaction Modalities for Scientific Visualization

fields

years

verdicts

representative citing papers

citing papers explorer