CoEdIT: Text Editing by Task-Specific Instruction Tuning

Dhruv Kumar; Dongyeop Kang; Ryan Koo; Vipul Raheja

arxiv: 2305.09857 · v2 · pith:VLLU2UR7new · submitted 2023-05-17 · 💻 cs.CL · cs.AI

CoEdIT: Text Editing by Task-Specific Instruction Tuning

Vipul Raheja , Dhruv Kumar , Ryan Koo , Dongyeop Kang This is my paper

classification 💻 cs.CL cs.AI

keywords textinstructionscoediteditingstate-of-the-artavailableeditmodel

0 comments

read the original abstract

We introduce CoEdIT, a state-of-the-art text editing system for writing assistance. CoEdIT takes instructions from the user specifying the attributes of the desired text, such as "Make the sentence simpler" or "Write it in a more neutral style," and outputs the edited text. We present a large language model fine-tuned on a diverse collection of task-specific instructions for text editing (a total of 82K instructions). Our model (1) achieves state-of-the-art performance on various text editing benchmarks, (2) is competitive with publicly available largest-sized LLMs trained on instructions while being nearly 60x smaller, (3) is capable of generalizing to unseen edit instructions, and (4) exhibits abilities to generalize to composite instructions containing different combinations of edit actions. Through extensive qualitative and quantitative analysis, we show that writers prefer the edits suggested by CoEdIT relative to other state-of-the-art text editing models. Our code, data, and models are publicly available at https://github.com/vipulraheja/coedit.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

EditPropBench: Measuring Factual Edit Propagation in Scientific Manuscripts
cs.CL 2026-05 unverdicted novelty 7.0

EditPropBench evaluates LLM editors on propagating factual edits to dependent claims in synthetic scientific manuscripts, showing that even the strongest systems miss roughly 30% of required updates on hard cases.
LLMs Corrupt Your Documents When You Delegate
cs.CL 2026-04 unverdicted novelty 6.0

LLMs corrupt an average of 25% of document content during long delegated editing workflows across 52 domains, even frontier models, and agentic tools do not mitigate the issue.
Echo: Learning from Experience Data via User-Driven Refinement
cs.AI 2026-05 unverdicted novelty 5.0

Echo is a framework that harvests user-driven refinements of agent proposals as training signals to align models with real-world needs, demonstrated by raising code completion acceptance from 25.7% to 35.7% in production.