hub Canonical reference

InProceedings of the 2023 CHI Conference on Human Factors in Computing Systems(Hamburg, Germany)(CHI ’23)

Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts · 2023 · DOI 10.1145/3544548

Canonical reference. 82% of citing Pith papers cite this work as background.

30 Pith papers citing it

6 external citations · Crossref

Background 82% of classified citations

open at publisher browse 30 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 11

citation-polarity summary

background 9 support 1 unclear 1

representative citing papers

Priming, Path-dependence, and Plasticity: Understanding the molding of user-LLM interaction and its implications from (many) chat logs in the wild

cs.HC · 2026-05-07 · unverdicted · novelty 7.0

Large-scale analysis of wild LLM chat logs finds that user interaction patterns stabilize quickly after initial use and correlate with long-term outcomes like retention, creating an agency paradox of limited exploration in unconstrained systems.

PersonaTeaming: Supporting Persona-Driven Red-Teaming for Generative AI

cs.HC · 2026-05-07 · unverdicted · novelty 7.0 · 2 refs

Persona-driven workflow and interface improve automated and human-AI red-teaming of generative AI by incorporating diverse perspectives into adversarial prompt creation.

Training Computer Use Agents to Assess the Usability of Graphical User Interfaces

cs.CL · 2026-04-28 · unverdicted · novelty 7.0

uxCUA is a trained computer use agent that assesses GUI usability more accurately than larger models by learning to prioritize and execute important user interactions on labeled interface datasets.

Point & Grasp: Flexible Selection of Out-of-Reach Objects Through Probabilistic Cue Integration

cs.HC · 2026-04-24 · unverdicted · novelty 7.0

Point&Grasp probabilistically integrates pointing and grasp gestures for out-of-reach object selection in MR, trained on a new ORG dataset, and outperforms single-cue baselines in user studies.

Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations

cs.AI · 2026-04-20 · conditional · novelty 7.0

GROVE visualizes distributions of language model generations as overlapping paths through a text graph, with user studies showing that graph summaries aid structural judgments like diversity assessment while raw outputs remain better for details.

Beyond Chat and Clicks: GUI Agents for In-Situ Assistance via Live Interface Transformation

cs.HC · 2026-04-16 · unverdicted · novelty 7.0

GUI agents can transform live web interfaces in real-time via DOM manipulations to deliver contextual assistance directly within the application.

Interactive Program Synthesis for Modeling Collaborative Physical Activities from Narrated Demonstrations

cs.AI · 2025-09-29 · accept · novelty 7.0

A program synthesis system models collaborative physical activities from narrated demonstrations as editable programs, enabling users to teach, inspect, and correct them, with a study showing 70% success in refining soccer tactics programs.

Evalet: Evaluating Large Language Models through Functional Fragmentation

cs.HC · 2025-09-14 · conditional · novelty 7.0

Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.

IdeaBlocks: Expressing and Reusing Divergent Intents for Graphic Design Exploration using Generative AI

cs.HC · 2025-07-29 · unverdicted · novelty 7.0

IdeaBlocks modularizes divergent intents into Exploration Blocks with multi-level reuse options, enabling 2.13 times more images explored and 12.5% greater visual diversity than baseline in a comparative user study.

xSense Design Cards: Guiding the Design of Multisensory Experiences

cs.ET · 2026-06-07 · unverdicted · novelty 6.0

Introduces xSense Design Cards with four types (Experience, Sensory, Technology, Exploration) to guide multisensory experience design and evaluation in HCI.

Babel: Jailbreaking Safety Attention via Obfuscation Distribution Optimized Sampling

cs.CR · 2026-05-18 · unverdicted · novelty 6.0

Babel is an efficient black-box jailbreaking framework that formalizes sparse safety attention heads via a mathematical obfuscation model and uses iterative distribution refinement to achieve higher attack success rates on models like GPT-4o and Claude-3-5-haiku with around 40 queries.

Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas

cs.HC · 2026-05-15 · unverdicted · novelty 6.0

CanvasConvo presents a spatial canvas interface for branching LLM conversations, evaluated in a 5-7 day field study with 24 participants that found support for exploratory workflows.

Making Abstraction Concrete: A Design Space and Interaction Model of Abstraction in Interactive Systems

cs.HC · 2026-05-11 · unverdicted · novelty 6.0

A survey of 457 papers yields a six-dimensional design space for abstraction in interactive systems that reframes gulfs of execution and evaluation while articulating cognitive and design processes for bridging abstraction gaps.

Cripping AI: Reimagining AI Through Lived Disability Experiences

cs.HC · 2026-05-03 · unverdicted · novelty 6.0

Cripping AI is a proposed framework that dismantles ableist assumptions in AI by centering disabled ways of knowing and respecting disabled labor in co-creation.

MicroVRide: Exploring 4-in-1 Virtual Reality Micromobility Simulator

cs.HC · 2026-04-12 · unverdicted · novelty 6.0

A modular VR simulator supports four distinct micromobility vehicles on one hardware setup and a preliminary study finds unique riding experiences for each.

JARVIS: A Just-in-Time Augmented Reality VLM-Powered Instruction System for Cross-Reality Task Guidance

cs.HC · 2026-04-11 · unverdicted · novelty 6.0 · 2 refs

JARVIS delivers VLM-powered contextual AR guidance with state verification for cross-reality tasks, improving usability and success rates over baselines in a 14-person study.

Beyond Compliance: How AI Could Help Creative Writers by Refusing Them

cs.HC · 2026-04-03 · unverdicted · novelty 6.0

A qualitative study with 22 creative writers finds that the reflective value of AI refusals depends on alignment with users' situational thinking phases, cognitive beliefs, and views of AI roles.

Radical Gender Neutrality: Agender Euphoria in Gaming and Play Experiences

cs.HC · 2026-03-10 · unverdicted · novelty 6.0

A critical incident technique study with 142 participants identifies mechanisms by which games create or block agender euphoria and supplies empirically grounded design criteria for gender-neutral play.

Adaptive Prompt Elicitation for Text-to-Image Generation

cs.HC · 2026-02-04 · unverdicted · novelty 6.0

Adaptive Prompt Elicitation (APE) uses an information-theoretic framework to generate visual queries that elicit and compile user intent into better prompts for text-to-image models, showing improved alignment in benchmarks and a user study.

Polite But Boring? Trade-offs Between Engagement and Psychological Reactance to Chatbot Feedback Styles

cs.HC · 2026-01-28 · unverdicted · novelty 6.0

Polite chatbot feedback lowers psychological reactance and boosts behavioral intentions but lacks engagement, whereas verbal leakage heightens surprise and engagement at the expense of increased reactance.

Teaching Prompt-Based Programming with LLMs: A 45-Minute Lesson with Guided Practice for End-User Programmers

cs.CY · 2026-06-29 · conditional · novelty 5.0

A randomized trial found that a 45-minute prompt-based programming lesson produced modest non-significant performance gains and significant self-efficacy gains compared to code tracing.

Bridging Predictions and Interventions: An Integrated Framework for Automated Decision-Systems

cs.CY · 2026-06-24 · unverdicted · novelty 5.0

Perspective paper proposing an integrated framework for automated decision systems that shifts priority from prediction accuracy to accounting for changes in organizational workflows and intervention effects.

AnimationDiff: A Visual Comparison Tool for Generated 3D Character Animations

cs.HC · 2026-05-01 · unverdicted · novelty 5.0

AnimationDiff is a visual comparison tool that combines contextual scene viewing, overlay/side-by-side modes, filtering, and temporal lenses to help users select among generated 3D character animations.

OOPrompt: Reifying Intents into Structured Artifacts for Modular and Iterative Prompting

cs.HC · 2026-04-21 · unverdicted · novelty 5.0

OOPrompt reifies user intents into structured manipulable artifacts to enable modular and iterative prompting in LLM-based interactive systems.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations cs.AI · 2026-04-20 · conditional · none · ref 56
GROVE visualizes distributions of language model generations as overlapping paths through a text graph, with user studies showing that graph summaries aid structural judgments like diversity assessment while raw outputs remain better for details.
Evalet: Evaluating Large Language Models through Functional Fragmentation cs.HC · 2025-09-14 · conditional · none · ref 70
Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.
Teaching Prompt-Based Programming with LLMs: A 45-Minute Lesson with Guided Practice for End-User Programmers cs.CY · 2026-06-29 · conditional · none · ref 33
A randomized trial found that a 45-minute prompt-based programming lesson produced modest non-significant performance gains and significant self-efficacy gains compared to code tracing.

InProceedings of the 2023 CHI Conference on Human Factors in Computing Systems(Hamburg, Germany)(CHI ’23)

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer