MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models arXiv:2606.05177v1 Announce Type: new Abstract: Existing multimodal safety benchmarks focus solely on visual inputs and cannot assess Omni Large Language Models (LLMs) that process vision, audio, and text. We introduce MCBench, a benchmark with 1196 scenarios spanning four safety categories that require integrating multiple modalities for accurate safety assessment. Each unsafe scenario is paired with a minimally
相关产品查看全部 (10)
相关报道查看全部 (1)
MCBench: A Multicontext Safety Assessment Benchmark for Omni Large Language Models
ArXiv CS.CL2026-06-05