pith. sign in

arxiv: 2606.01820 · v1 · pith:YO3LCODMnew · submitted 2026-06-01 · 💻 cs.CL

TalkTag: Fine-Grained Morphosyntactic Error Annotation for Transcribed Speech

classification 💻 cs.CL
keywords annotationerrortalktagmorphosyntacticdatafine-grainedlinguisticalternative
0
0 comments X
read the original abstract

Fine-grained morphosyntactic error annotation is important in clinical and developmental language research, yet it is labour-intensive, expert-dependent, and difficult to scale. We present TalkTag, an LLM-based lightweight tool fine-tuned to automate CHAT-style error annotation in spoken-language transcripts. Developed under conditions of extreme data scarcity using children's narrative data, the system shows the feasibility of linguistic analysis in low-resource settings. Our evaluation demonstrates that TalkTag produces encouragingly precise annotation while effectively identifying instances where linguistic ambiguity makes automated tagging genuinely complex. In summary, with TalkTag, we provide a scalable alternative to manual error annotation and practically viable support for morphosyntactic error annotation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.