Vision-language Models for Driver Monitoring Systems: A Driver Activity Description Dataset 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Vision-language Models for Driver Monitoring Systems: A Driver Activity Description Dataset arXiv:2606.02273v1 Announce Type: new Abstract: Understanding subtle driver actions is essential for building reliable driver monitoring systems. Existing visionlanguage models (VLMs) are trained on general datasets and struggle to recognize fine distinctions in driver behaviors. This paper addresses this limitation by creating a detailed natural language version of the Drive&Act dataset. We evaluate thr