VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring 事件

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

VLESA: Vision-Language Embodied Safety Agent for Human Activity Monitoring arXiv:2606.03954v1 Announce Type: new Abstract: As AI systems increasingly assist humans in physical tasks, ensuring safety becomes paramount -- physical actions carry immediate and irreversible consequences that digital errors do not. We introduce the Vision-Language Embodied Safety Agent (VLESA), a framework that monitors human activities from egocentric video and triggers real-time safety interventions when dangerous