Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese 事件

PRODUCT_LAUNCH2026-06-09影响: MEDIUM

Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese arXiv:2606.07853v1 Announce Type: cross Abstract: Large Language Models are transforming the support for clinical decision and their application in real scenarios. Yet, most benchmarks are conducted in English, and cross-lingual evaluation is needed to tackle the language gaps in global access. We introduce ClinicalBr, the first bilingual benchmark for clinical decision built from real Brazilian case reports. The corpus

Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese · 相关人物