Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese 事件

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese arXiv:2606.07853v1 Announce Type: cross Abstract: Large Language Models are transforming the support for clinical decision and their application in real scenarios. Yet, most benchmarks are conducted in English, and cross-lingual evaluation is needed to tackle the language gaps in global access. We introduce ClinicalBr, the first bilingual benchmark for clinical decision built from real Brazilian case reports. The corpus