Multitask BART fine-tuning with rubric context and boundary-based soft labels yields lower mean absolute error and better grade-distribution alignment than single-task or code-only baselines on multi-semester CS1 C++ data.
Codev-bench: How do llms understand developer-centric code completion?
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Leveraging BART to Assess CS1 C++ Programming Assignments using Rubric-based Criteria
Multitask BART fine-tuning with rubric context and boundary-based soft labels yields lower mean absolute error and better grade-distribution alignment than single-task or code-only baselines on multi-semester CS1 C++ data.