LiteViLNet: Lightweight Vision-LiDAR Fusion Network for Efficient Road Segmentation 文章

ArXiv CS.CV2026-06-01NEWSen作者: Daojie Peng, Bingtao Wang, Fulong Ma, Liang Zhang, Jun Ma

摘要

arXiv:2605.21007v2 Announce Type: replace Abstract: Road segmentation is a fundamental perception task for autonomous driving and intelligent robotic systems, requiring both high accuracy and real-time inference, especially for deployment on resource-constrained edge devices. Existing multi-modal road segmentation methods often rely on heavy transformer-based encoders to achieve state-of-the-art performance, but their enormous computational cost prohibits real-time deployment on embedded platforms. To address this dilemma, we propose LiteViLNet, a lightweight multi-modal network that fuses RGB texture information and LiDAR geometric information for efficient road segmentation. Specifically, we design a dual-stream lightweight encoder and depth-wise separable convolutions to extract hierarchical features from both modalities with minimal parameters.