MIRA: A Bilingual Benchmark for Medical Information Response Audit 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
MIRA: A Bilingual Benchmark for Medical Information Response Audit arXiv:2605.28025v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to provide public-facing health information, yet existing safety evaluations overlook whether responses preserve comparable medical information across different user phrasings of the same question. To address this, we introduce the Medical Information Response Audit (MIRA), a bilingual, controlled benchmark that assesses whether
相关产品查看全部 (10)
相关报道查看全部 (1)
MIRA: A Bilingual Benchmark for Medical Information Response Audit
ArXiv CS.CL2026-05-28