STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media 事件

ACQUISITION2026-05-26影响: HIGH

STREAM: A Data-Centric Framework for Mining High-Value Task-Oriented Dialogues from Streaming Media arXiv:2605.25162v1 Announce Type: new Abstract: Large language models for vertical domains are bottlenecked by the scarcity of complex, domain-specific task-oriented dialogues. Existing data acquisition pipelines face a persistent trilemma: expert annotation is expensive, real-world service conversations are constrained by privacy and commercial restrictions, and static corpora quickly become tem