Expos\'ia: Teaching and Assessment of Academic Writing Skills for Research Project Proposals and Peer Feedback

· 2026 · cs.CL · arXiv 2601.06536

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We present Expos\'ia, the first public dataset that connects writing and feedback in higher education, enabling research on educationally grounded computational approaches to teaching and evaluating academic writing. Expos\'ia includes student research project proposals and peer and instructor feedback consisting of comments and free-text reviews. The dataset was collected in the "Introduction to Scientific Work" course of the Computer Science. Expos\'ia reflects the multi-stage nature of the academic writing process that includes drafting, receiving feedback, and revising the writing based on the feedback received. Both the project proposals and peer feedback are accompanied by human assessment scores based on a fine-grained, pedagogically-grounded schema for writing and feedback assessment that we develop. We use Expos\'ia to benchmark state-of-the-art large language models (LLMs) on two tasks: automated scoring of (1) the proposals and (2) the student reviews. We find that the two tasks benefit from different LLMs. Furthermore, closed-source models consistently outperform open-weight models, motivating further research on improving the performance of open-weight models preferred in classroom settings. Finally, we establish that a prompting strategy that scores multiple aspects of the writing together is the most effective, an important finding for classroom deployment.

representative citing papers

FOXGLOVE: Understanding Goal-Oriented and Anchored Writing Feedback from Experts and LLMs on Argumentative Essays

cs.CL · 2026-06-04 · unverdicted · novelty 6.0

FOXGLOVE dataset of 2340 comments shows LLMs and instructors align on feedback goals and positions but diverge on sentence selection, with LLMs using more complex language and fewer questions and higher quality ratings driven by comment length.

Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments?

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

Method infers and refines natural-language rubrics from inline comments on artifacts via LLM-based prediction mismatches, evaluated in real-world and controlled settings to support comment prediction and revision.

citing papers explorer

Showing 2 of 2 citing papers after filters.

FOXGLOVE: Understanding Goal-Oriented and Anchored Writing Feedback from Experts and LLMs on Argumentative Essays cs.CL · 2026-06-04 · unverdicted · none · ref 16 · internal anchor
FOXGLOVE dataset of 2340 comments shows LLMs and instructors align on feedback goals and positions but diverge on sentence selection, with LLMs using more complex language and fewer questions and higher quality ratings driven by comment length.
Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments? cs.LG · 2026-05-28 · unverdicted · none · ref 3 · internal anchor
Method infers and refines natural-language rubrics from inline comments on artifacts via LLM-based prediction mismatches, evaluated in real-world and controlled settings to support comment prediction and revision.

Expos\'ia: Teaching and Assessment of Academic Writing Skills for Research Project Proposals and Peer Feedback

fields

years

verdicts

representative citing papers

citing papers explorer