Stop Listening to Me! How Multi-turn Conversations Can Degrade LLM Reliability 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
Stop Listening to Me! How Multi-turn Conversations Can Degrade LLM Reliability arXiv:2603.11394v3 Announce Type: replace Abstract: Large language models (LLMs) excel on static benchmarks, but their performance across multi-turn conversations, which better reflect real-world usage, remains understudied. Addressing this gap is critical in high-stakes settings like healthcare, where patients and clinicians are turning to LLM chatbots to address their medical inquiries. Here, we introduce the "stic