McKee, Daniel Gillick, et al

Ivan Jurenka, Matthias Kunesch, Kyle R · 2024 · arXiv 2407.12687

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks

cs.CR · 2026-04-20 · unverdicted · novelty 7.0

LLM tutors leak answers under adversarial student attacks, but a fine-tuned jailbreak agent and simple defenses can benchmark and improve robustness.

Evaluating Multi-turn Human-AI Interaction

cs.HC · 2026-05-18 · unverdicted · novelty 6.0

Introduces the TCR framework to evaluate educational LLM assistants on transparency, consistency, and refinement in multi-turn interactions, complementing aggregate metrics.

The Missing Evaluation Axis: What 10,000 Student Submissions Reveal About AI Tutor Effectiveness

cs.CY · 2026-05-07 · conditional · novelty 6.0

Behavioral signals from how students use AI tutor feedback in 10k code submissions reveal differences between tutors and correlate more strongly with perceived helpfulness than pedagogical quality alone.

Beyond the AI Tutor: Social Learning with LLM Agents

cs.HC · 2026-04-03 · unverdicted · novelty 6.0

Two controlled experiments show multi-agent LLM configurations with both tutors and peers deliver higher learning gains and less homogeneous outputs than single-LLM tutoring in math problem-solving and essay writing.

"Would You Want an AI Tutor?" Understanding Stakeholder Perceptions of LLM-based Systems in the Classroom

cs.CY · 2025-02-02 · unverdicted · novelty 6.0

The paper proposes the Co-PALE framework connecting educational context, responsible AI principles, and perception categories to guide adoption decisions for LLM-based educational tools.

Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval-Augmented Generation (RAG)

cs.CL · 2025-05-22 · unverdicted · novelty 5.0

LC-RAG augments standard RAG by incorporating environment logs to contextualize student discourse, yielding better retrieval and more relevant guidance from the Copa agent in the C2STEM modeling environment.

Ceci n'est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems

cs.HC · 2026-04-28 · unverdicted · novelty 4.0 · 2 refs

Introduces L2-Bench benchmark for AI feedback in language education across six dimensions and identifies explainability pitfalls in AI-generated explanations that appear helpful but are flawed.

Latency and Cost of Multi-Agent Intelligent Tutoring at Scale

cs.CY · 2026-04-27 · unverdicted · novelty 3.0

Priority PayGo keeps multi-agent tutoring responses under 4 seconds even at 50 concurrent users, while costs stay below textbook prices per student.

citing papers explorer

Showing 8 of 8 citing papers.

Evaluating Answer Leakage Robustness of LLM Tutors against Adversarial Student Attacks cs.CR · 2026-04-20 · unverdicted · none · ref 69
LLM tutors leak answers under adversarial student attacks, but a fine-tuned jailbreak agent and simple defenses can benchmark and improve robustness.
Evaluating Multi-turn Human-AI Interaction cs.HC · 2026-05-18 · unverdicted · none · ref 23
Introduces the TCR framework to evaluate educational LLM assistants on transparency, consistency, and refinement in multi-turn interactions, complementing aggregate metrics.
The Missing Evaluation Axis: What 10,000 Student Submissions Reveal About AI Tutor Effectiveness cs.CY · 2026-05-07 · conditional · none · ref 7
Behavioral signals from how students use AI tutor feedback in 10k code submissions reveal differences between tutors and correlate more strongly with perceived helpfulness than pedagogical quality alone.
Beyond the AI Tutor: Social Learning with LLM Agents cs.HC · 2026-04-03 · unverdicted · none · ref 39
Two controlled experiments show multi-agent LLM configurations with both tutors and peers deliver higher learning gains and less homogeneous outputs than single-LLM tutoring in math problem-solving and essay writing.
"Would You Want an AI Tutor?" Understanding Stakeholder Perceptions of LLM-based Systems in the Classroom cs.CY · 2025-02-02 · unverdicted · none · ref 40
The paper proposes the Co-PALE framework connecting educational context, responsible AI principles, and perception categories to guide adoption decisions for LLM-based educational tools.
Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval-Augmented Generation (RAG) cs.CL · 2025-05-22 · unverdicted · none · ref 13
LC-RAG augments standard RAG by incorporating environment logs to contextualize student discourse, yielding better retrieval and more relevant guidance from the Copa agent in the C2STEM modeling environment.
Ceci n'est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems cs.HC · 2026-04-28 · unverdicted · none · ref 19 · 2 links
Introduces L2-Bench benchmark for AI feedback in language education across six dimensions and identifies explainability pitfalls in AI-generated explanations that appear helpful but are flawed.
Latency and Cost of Multi-Agent Intelligent Tutoring at Scale cs.CY · 2026-04-27 · unverdicted · none · ref 24
Priority PayGo keeps multi-agent tutoring responses under 4 seconds even at 50 concurrent users, while costs stay below textbook prices per student.

McKee, Daniel Gillick, et al

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer