TalkTag: Fine-Grained Morphosyntactic Error Annotation for Transcribed Speech 文章

ArXiv CS.CL2026-06-02NEWSen作者: Shamira Venturini (Karlsruhe Institute of Technology, Karlsruhe University of Applied Sciences), Oliver Hennh\"ofer (Karlsruhe University of Applied Sciences), Steffen Kinkel (Karlsruhe University of Applied Sciences), Jannik Str\"otgen (Karlsruhe University of Applied Sciences)

摘要

arXiv:2606.01820v1 Announce Type: new Abstract: Fine-grained morphosyntactic error annotation is important in clinical and developmental language research, yet it is labour-intensive, expert-dependent, and difficult to scale. We present TalkTag, an LLM-based lightweight tool fine-tuned to automate CHAT-style error annotation in spoken-language transcripts. Developed under conditions of extreme data scarcity using children's narrative data, the system shows the feasibility of linguistic analysis in low-resource settings. Our evaluation demonstrates that TalkTag produces encouragingly precise annotation while effectively identifying instances where linguistic ambiguity makes automated tagging genuinely complex. In summary, with TalkTag, we provide a scalable alternative to manual error annotation and practically viable support for morphosyntactic error annotation.

相关公司

暂无数据

相关人物

暂无数据