A program synthesis system models collaborative physical activities from narrated demonstrations as editable programs, enabling users to teach, inspect, and correct them, with a study showing 70% success in refining soccer tactics programs.
hub Canonical reference
How much do you understand how image generation AI models work?
Canonical reference. 100% of citing Pith papers cite this work as background.
hub tools
citation-role summary
citation-polarity summary
roles
background 6polarities
background 6representative citing papers
IdeaBlocks modularizes divergent intents into Exploration Blocks with multi-level reuse options, enabling 2.13 times more images explored and 12.5% greater visual diversity than baseline in a comparative user study.
JARVIS delivers VLM-powered contextual AR guidance with state verification for cross-reality tasks, improving usability and success rates over baselines in a 14-person study.
Intent Lenses infer capture-time user intent from photos via LLMs to create dynamic, reusable interactive objects that generate and organize structured visual notes for later sensemaking.
Mixed-Initiative Context reconceptualizes interaction context as a dynamic, jointly manageable structure that humans and AI can actively organize according to task needs.
Adaptive Prompt Elicitation (APE) uses an information-theoretic framework to generate visual queries that elicit and compile user intent into better prompts for text-to-image models, showing improved alignment in benchmarks and a user study.
Pista decomposes AI agent actions in spreadsheets into auditable steps, enabling real-time user intervention that improves task outcomes, user comprehension, agent perception, and sense of co-ownership over baseline agents.
SpatialBalancing is a system that turns revision trade-offs into spatial navigation so writers can iteratively balance scientific exposition and narrative engagement with LLM assistance.
XARP provides a WebSocket-based remote-procedure system that lets Python code and AI agents control Unity XR clients, with benchmarks and user studies showing faster iteration than conventional XR workflows.
Semantic Reality maintains a persistent connectivity graph of objects in AR via multimodal reasoning and action recognition, then visualizes relationships to aid understanding and task guidance.
A workshop proposal to reflect on HCI's core identity and the importance of human elements in the era of generative AI.
citing papers explorer
-
Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations
A program synthesis system models collaborative physical activities from narrated demonstrations as editable programs, enabling users to teach, inspect, and correct them, with a study showing 70% success in refining soccer tactics programs.
-
IdeaBlocks: Expressing and Reusing Divergent Intents for Graphic Design Exploration using Generative AI
IdeaBlocks modularizes divergent intents into Exploration Blocks with multi-level reuse options, enabling 2.13 times more images explored and 12.5% greater visual diversity than baseline in a comparative user study.
-
JARVIS: A Just-in-Time Augmented Reality VLM-Powered Instruction System for Cross-Reality Task Guidance
JARVIS delivers VLM-powered contextual AR guidance with state verification for cross-reality tasks, improving usability and success rates over baselines in a 14-person study.
-
Intent Lenses: Inferring Capture-Time Intent to Transform Opportunistic Photo Captures into Structured Visual Notes
Intent Lenses infer capture-time user intent from photos via LLMs to create dynamic, reusable interactive objects that generate and organize structured visual notes for later sensemaking.
-
Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration
Mixed-Initiative Context reconceptualizes interaction context as a dynamic, jointly manageable structure that humans and AI can actively organize according to task needs.
-
Adaptive Prompt Elicitation for Text-to-Image Generation
Adaptive Prompt Elicitation (APE) uses an information-theoretic framework to generate visual queries that elicit and compile user intent into better prompts for text-to-image models, showing improved alignment in benchmarks and a user study.
-
Auditing and Controlling AI Agent Actions in Spreadsheets
Pista decomposes AI agent actions in spreadsheets into auditable steps, enabling real-time user intervention that improves task outcomes, user comprehension, agent perception, and sense of co-ownership over baseline agents.
-
Spatial Balancing: Harnessing Spatial Reasoning to Balance Scientific Exposition and Narrative Engagement in LLM-assisted Science Communication Writing
SpatialBalancing is a system that turns revision trade-offs into spatial navigation so writers can iteratively balance scientific exposition and narrative engagement with LLM assistance.
-
XARP Tools: An Extended Reality Platform for Humans and AI Agents
XARP provides a WebSocket-based remote-procedure system that lets Python code and AI agents control Unity XR clients, with benchmarks and user studies showing faster iteration than conventional XR workflows.
-
Semantic Reality: Interactive Context-Aware Visualization of Inter-Object Relationships in Augmented Reality
Semantic Reality maintains a persistent connectivity graph of objects in AR via multimodal reasoning and action recognition, then visualizes relationships to aid understanding and task guidance.
-
What is (H)CI: Why Does the "Human'' Matter?
A workshop proposal to reflect on HCI's core identity and the importance of human elements in the era of generative AI.