MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio 文章

ArXiv CS.CL2026-05-29NEWSen作者: Harshit Rajgarhia, Shuubham Ojha, Asif Shaik, Akhil Pothanapalli, Rachuri Lokesh, Abhishek Mukherji, Prasanna Desikan

查看原文 →

关系图谱

摘要

arXiv:2605.00969v2 Announce Type: replace-cross Abstract: Medical audio data is difficult to collect due to privacy regulations and high annotation costs arising from domain expertise. Thus, existing benchmarks tend to underrepresent complex medical audio scenarios. To address this challenge, we present MedMosaic, a medical audio question-answering dataset designed to benchmark language and audio reasoning models under realistic clinical constraints. MedMosaic features a diverse range of medical audio types, including condition-related physiological sounds, carefully constructed synthetic voices to mimic speech with artifacts as well as real short and long length clinical conversations to model varying context lengths. The dataset also features a total of 46,701 question-answer pairs, spanning categories such as multiple-choice, sequential multi-turn, and open-ended question-answers, enabling systematic evaluation of multi-hop reasoning and answer generation capabilities.

MedMosaic: A Challenging Large Scale Benchmark of Diverse Medical Audio 文章

摘要

相关事件查看全部 (2)

相关公司

相关人物

相关产品查看全部 (2)

相关技术