Benchmarking and Learning Real-World Customer Service Dialogue 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Benchmarking and Learning Real-World Customer Service Dialogue arXiv:2510.22143v3 Announce Type: replace Abstract: Existing benchmarks and training pipelines for industrial intelligent customer service (ICS) remain misaligned with real-world dialogue requirements, overemphasizing verifiable task success while under-measuring subjective service quality and realistic failure modes, leaving a gap between offline gains and deployable dialogue behavior. We close this gap with a benchmark-to-optimiza