Learnlm: Improving gemini for learning.arXiv preprint arXiv:2412.16429

LearnLM Team · 2024 · arXiv 2412.16429

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

Are Agents Ready to Teach? A Multi-Stage Benchmark for Real-World Teaching Workflows

cs.AI · 2026-05-14 · unverdicted · novelty 6.0

EduAgentBench is a new source-grounded benchmark that evaluates tutor agents across pedagogical judgment, situated multi-turn tutoring, and Canvas-style workflow completion, finding frontier models capable of basic judgment but inadequate for professional teaching standards.

Behavior Latticing: Inferring User Motivations from Unstructured Interactions

cs.HC · 2026-04-08 · unverdicted · novelty 6.0

Behavior latticing synthesizes connections across unstructured user interactions to generate insights into underlying motivations, yielding deeper and more accurate user understanding than task-only models.

Mitigating LLM biases toward spurious social contexts using direct preference optimization

cs.AI · 2026-04-02 · unverdicted · novelty 6.0

Debiasing-DPO reduces bias to spurious social contexts by 84% and improves predictive accuracy by 52% on average for LLMs evaluating U.S. classroom transcripts.

Large Language Models as Students Who Think Aloud: Overly Coherent, Verbose, and Confident

cs.CL · 2026-02-01 · unverdicted · novelty 6.0

LLMs simulating student think-alouds in multi-step chemistry tutoring produce overly coherent, verbose, and confident reasoning that overestimates learner success compared to 630 human utterances.

Ceci n'est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems

cs.HC · 2026-04-28 · unverdicted · novelty 4.0 · 2 refs

Introduces L2-Bench benchmark for AI feedback in language education across six dimensions and identifies explainability pitfalls in AI-generated explanations that appear helpful but are flawed.

citing papers explorer

Showing 5 of 5 citing papers.

Are Agents Ready to Teach? A Multi-Stage Benchmark for Real-World Teaching Workflows cs.AI · 2026-05-14 · unverdicted · none · ref 6
EduAgentBench is a new source-grounded benchmark that evaluates tutor agents across pedagogical judgment, situated multi-turn tutoring, and Canvas-style workflow completion, finding frontier models capable of basic judgment but inadequate for professional teaching standards.
Behavior Latticing: Inferring User Motivations from Unstructured Interactions cs.HC · 2026-04-08 · unverdicted · none · ref 97
Behavior latticing synthesizes connections across unstructured user interactions to generate insights into underlying motivations, yielding deeper and more accurate user understanding than task-only models.
Mitigating LLM biases toward spurious social contexts using direct preference optimization cs.AI · 2026-04-02 · unverdicted · none · ref 8
Debiasing-DPO reduces bias to spurious social contexts by 84% and improves predictive accuracy by 52% on average for LLMs evaluating U.S. classroom transcripts.
Large Language Models as Students Who Think Aloud: Overly Coherent, Verbose, and Confident cs.CL · 2026-02-01 · unverdicted · none · ref 19
LLMs simulating student think-alouds in multi-step chemistry tutoring produce overly coherent, verbose, and confident reasoning that overestimates learner success compared to 630 human utterances.
Ceci n'est pas une explication: Evaluating Explanation Failures as Explainability Pitfalls in Language Learning Systems cs.HC · 2026-04-28 · unverdicted · none · ref 25 · 2 links
Introduces L2-Bench benchmark for AI feedback in language education across six dimensions and identifies explainability pitfalls in AI-generated explanations that appear helpful but are flawed.

Learnlm: Improving gemini for learning.arXiv preprint arXiv:2412.16429

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer