Model predictions were generated using greedy decoding

Separate models were trained for each of the 10 unique grids, 2 policies per grid, for a total of 20 trained models · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Representing expertise accelerates learning from pedagogical interaction data

cs.CL · 2026-04-14 · unverdicted · novelty 5.0

Transformer models trained on synthetic pedagogical interaction data in spatial navigation achieve more robust expert-like performance than those trained only on expert demonstrations, particularly when they can distinguish epistemic states of expert and novice agents.

citing papers explorer

Showing 1 of 1 citing paper.

Representing expertise accelerates learning from pedagogical interaction data cs.CL · 2026-04-14 · unverdicted · none · ref 5
Transformer models trained on synthetic pedagogical interaction data in spatial navigation achieve more robust expert-like performance than those trained only on expert demonstrations, particularly when they can distinguish epistemic states of expert and novice agents.

Model predictions were generated using greedy decoding

fields

years

verdicts

representative citing papers

citing papers explorer