MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization 文章

ArXiv CS.CL2026-05-29NEWSen作者: Anisha Saha, Varsha Suresh, Teodora Kamova, Sophia Wiedmann, Timothy Hospedales, Vera Demberg

MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization · 相关技术