Interpretable Modeling of Driver Attention Shifts with a Vision--Language Model 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Interpretable Modeling of Driver Attention Shifts with a Vision--Language Model arXiv:2508.05852v2 Announce Type: replace Abstract: Driver gaze is commonly modeled as a spatial heatmap, but heatmaps alone are difficult for humans to interpret because they do not explain which road object or region is being monitored or why an attention shift may matter. This study examines whether minimal human-grounded supervision can steer a vision--language model toward interpretable descriptions of driver a

Interpretable Modeling of Driver Attention Shifts with a Vision--Language Model · 相关技术