MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization arXiv:2605.29951v1 Announce Type: cross Abstract: Understanding how harm emerges from interaction between otherwise benign image-text pairs requires intent-aware cross-modal reasoning beyond surface-level features. Existing vision-language models (VLMs) excel at literal reasoning over perceptual cues but often fail to derive harmful semantics that rely on implicit, context-dependent reasoning. To ev