Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.
Title resolution pending
5 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
Schemex is an interactive three-stage AI workflow for schema induction that user studies show produces more actionable schemas than a frontier baseline without loss of generalizability.
Post-generation control in AI-assisted math visual creation yields higher teacher ratings for predictability and correctness than pre- or mid-generation control, with qualitative trade-offs in agency and effort.
Mixed-Initiative Context reconceptualizes interaction context as a dynamic, jointly manageable structure that humans and AI can actively organize according to task needs.
MIRAGE improves VLM analysis of multi-figure art by inserting a verifiable structured representation of micro-interactions between spatial grounding and narrative output.
citing papers explorer
-
Evalet: Evaluating Large Language Models through Functional Fragmentation
Evalet applies functional fragmentation to deliver fragment-level qualitative analysis of LLM evaluations, with a user study showing 48% more misalignment detections than holistic scoring.
-
Schemex: Discovering Structural Abstractions from Examples
Schemex is an interactive three-stage AI workflow for schema induction that user studies show produces more actionable schemas than a frontier baseline without loss of generalizability.
-
When Should Teachers Control AI Generation for Mathematics Visuals?
Post-generation control in AI-assisted math visual creation yields higher teacher ratings for predictability and correctness than pre- or mid-generation control, with qualitative trade-offs in agency and effort.
-
Mixed-Initiative Context: Structuring and Managing Context for Human-AI Collaboration
Mixed-Initiative Context reconceptualizes interaction context as a dynamic, jointly manageable structure that humans and AI can actively organize according to task needs.
-
MIRAGE: A Micro-Interaction Relational Architecture for Grounded Exploration in Multi-Figure Artworks
MIRAGE improves VLM analysis of multi-figure art by inserting a verifiable structured representation of micro-interactions between spatial grounding and narrative output.