MIRA: A Bilingual Benchmark for Medical Information Response Audit 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

MIRA: A Bilingual Benchmark for Medical Information Response Audit arXiv:2605.28025v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to provide public-facing health information, yet existing safety evaluations overlook whether responses preserve comparable medical information across different user phrasings of the same question. To address this, we introduce the Medical Information Response Audit (MIRA), a bilingual, controlled benchmark that assesses whether