hub Mixed citations

Stepanova, John Desnoyers-Stewart, Kristina Höök, and Bernhard E

Tongshuang Wu, Michael Terry, Carrie Jun Cai · 2022 · arXiv 1102.351758

Mixed citation behavior. Most common role is background (62%).

13 Pith papers citing it

Background 62% of classified citations

read on arXiv browse 13 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 6 dataset 1 method 1

citation-polarity summary

background 5 unclear 1 use dataset 1 use method 1

representative citing papers

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

cs.CL · 2022-01-28 · accept · novelty 9.0

Chain-of-thought prompting, by including intermediate reasoning steps in few-shot examples, elicits strong reasoning abilities in large language models on arithmetic, commonsense, and symbolic tasks.

Fast-Food Intimacy: How Chinese Women Navigate Soul's AI Boyfriend

cs.HC · 2026-05-09 · conditional · novelty 7.0

Users experience fast-food intimacy with Soul's AI boyfriend that conflicts with gradual cultural expectations, introduces technical uncertainty, and shifts emotional labor onto women.

Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations

cs.AI · 2026-04-20 · conditional · novelty 7.0

GROVE visualizes distributions of language model generations as overlapping paths through a text graph, with user studies showing that graph summaries aid structural judgments like diversity assessment while raw outputs remain better for details.

Evalet: Evaluating Large Language Models through Functional Fragmentation

cs.HC · 2025-09-14 · conditional · novelty 7.0

Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.

Journeys of Parents with LGBTQ+ Children: How Trauma and Healing Reshape Identity and (Mis)Informating Practices

cs.HC · 2026-05-19 · unverdicted · novelty 6.0

A qualitative study of South Korean parents shows that trauma and healing after learning a child is LGBTQ+ leads to identity reconstruction as supportive parents and more critical, protective informating practices.

Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas

cs.HC · 2026-05-15 · unverdicted · novelty 6.0

CanvasConvo presents a spatial canvas interface for branching LLM conversations, evaluated in a 5-7 day field study with 24 participants that found support for exploratory workflows.

From Words to Widgets for Controllable LLM Generation

cs.HC · 2026-04-13 · unverdicted · novelty 6.0

Malleable Prompting reifies subjective preferences from natural language into GUI widgets and modulates LLM token probabilities during decoding to enable controllable generation, with a user study showing improved precision and perceived controllability over standard prompting.

CogInstrument: Modeling Cognitive Processes for Bidirectional Human-LLM Alignment in Planning Tasks

cs.HC · 2026-04-12 · unverdicted · novelty 6.0

CogInstrument represents human reasoning as revisable cognitive motifs in graphical form to support iterative alignment with LLMs during planning tasks, with a N=12 study indicating gains in targeted revision, agency, and trust over standard dialogue interfaces.

Narrix: Remixing Narrative Strategies from Examples for Story Writing

cs.HC · 2026-04-08 · unverdicted · novelty 6.0

Narrix helps novices identify and reuse narrative strategies from examples through visualization and strategy-steered generation, improving retention, confidence, and adaptation over chat interfaces in a 12-person study.

OOPrompt: Reifying Intents into Structured Artifacts for Modular and Iterative Prompting

cs.HC · 2026-04-21 · unverdicted · novelty 5.0

OOPrompt reifies user intents into structured manipulable artifacts to enable modular and iterative prompting in LLM-based interactive systems.

HeartSway: Exploring Biodata as Poetic Traces in Public Space

cs.HC · 2026-04-13 · unverdicted · novelty 5.0

An interactive public hammock captures and replays biodata as embodied traces, with a field study of ten users indicating it fosters anonymous connection and appreciation for shared vitality.

High-quality generation of dynamic game content via small language models: A proof of concept

cs.AI · 2026-01-30 · conditional · novelty 5.0

Proof-of-concept shows fine-tuned small language models achieve adequate quality for real-time game content generation in a scoped RPG loop via retry-until-success and LLM-as-judge evaluation.

Benchmark Data Contamination of Large Language Models: A Survey

cs.CL · 2024-06-06 · unverdicted · novelty 3.0

A survey reviewing benchmark data contamination in LLMs, its impact on evaluation, and alternative assessment approaches.

citing papers explorer

Showing 13 of 13 citing papers.

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models cs.CL · 2022-01-28 · accept · none · ref 77
Chain-of-thought prompting, by including intermediate reasoning steps in few-shot examples, elicits strong reasoning abilities in large language models on arithmetic, commonsense, and symbolic tasks.
Fast-Food Intimacy: How Chinese Women Navigate Soul's AI Boyfriend cs.HC · 2026-05-09 · conditional · none · ref 133
Users experience fast-food intimacy with Soul's AI boyfriend that conflicts with gradual cultural expectations, introduces technical uncertainty, and shifts emotional labor onto women.
Beyond One Output: Visualizing and Comparing Distributions of Language Model Generations cs.AI · 2026-04-20 · conditional · none · ref 53
GROVE visualizes distributions of language model generations as overlapping paths through a text graph, with user studies showing that graph summaries aid structural judgments like diversity assessment while raw outputs remain better for details.
Evalet: Evaluating Large Language Models through Functional Fragmentation cs.HC · 2025-09-14 · conditional · none · ref 92
Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.
Journeys of Parents with LGBTQ+ Children: How Trauma and Healing Reshape Identity and (Mis)Informating Practices cs.HC · 2026-05-19 · unverdicted · none · ref 99
A qualitative study of South Korean parents shows that trauma and healing after learning a child is LGBTQ+ leads to identity reconstruction as supportive parents and more critical, protective informating practices.
Conversations in Space: Structuring Non-Linear LLM Interactions on a Canvas cs.HC · 2026-05-15 · unverdicted · none · ref 27
CanvasConvo presents a spatial canvas interface for branching LLM conversations, evaluated in a 5-7 day field study with 24 participants that found support for exploratory workflows.
From Words to Widgets for Controllable LLM Generation cs.HC · 2026-04-13 · unverdicted · none · ref 53
Malleable Prompting reifies subjective preferences from natural language into GUI widgets and modulates LLM token probabilities during decoding to enable controllable generation, with a user study showing improved precision and perceived controllability over standard prompting.
CogInstrument: Modeling Cognitive Processes for Bidirectional Human-LLM Alignment in Planning Tasks cs.HC · 2026-04-12 · unverdicted · none · ref 73
CogInstrument represents human reasoning as revisable cognitive motifs in graphical form to support iterative alignment with LLMs during planning tasks, with a N=12 study indicating gains in targeted revision, agency, and trust over standard dialogue interfaces.
Narrix: Remixing Narrative Strategies from Examples for Story Writing cs.HC · 2026-04-08 · unverdicted · none · ref 97
Narrix helps novices identify and reuse narrative strategies from examples through visualization and strategy-steered generation, improving retention, confidence, and adaptation over chat interfaces in a 12-person study.
OOPrompt: Reifying Intents into Structured Artifacts for Modular and Iterative Prompting cs.HC · 2026-04-21 · unverdicted · none · ref 50
OOPrompt reifies user intents into structured manipulable artifacts to enable modular and iterative prompting in LLM-based interactive systems.
HeartSway: Exploring Biodata as Poetic Traces in Public Space cs.HC · 2026-04-13 · unverdicted · none · ref 81
An interactive public hammock captures and replays biodata as embodied traces, with a field study of ten users indicating it fosters anonymous connection and appreciation for shared vitality.
High-quality generation of dynamic game content via small language models: A proof of concept cs.AI · 2026-01-30 · conditional · none · ref 34
Proof-of-concept shows fine-tuned small language models achieve adequate quality for real-time game content generation in a scoped RPG loop via retry-until-success and LLM-as-judge evaluation.
Benchmark Data Contamination of Large Language Models: A Survey cs.CL · 2024-06-06 · unverdicted · none · ref 165
A survey reviewing benchmark data contamination in LLMs, its impact on evaluation, and alternative assessment approaches.

Stepanova, John Desnoyers-Stewart, Kristina Höök, and Bernhard E

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer