SAM-Enhanced Segmentation on Road Datasets: Balancing Critical Classes in Autonomous Driving 文章

ArXiv CS.CV2026-05-28NEWSen作者: Toomas Tahves, Mauro Bellone, Junyi Gu, Raivo Sell

详细信息

来源站点
ArXiv CS.CV
作者
Toomas Tahves, Mauro Bellone, Junyi Gu, Raivo Sell
文章类型
NEWS
语言
en
发布日期
2026-05-28

摘要

arXiv:2605.28136v1 Announce Type: new Abstract: Dense semantic segmentation is essential for autonomous driving, yet many multi-modal datasets lack pixel-level annotations. The Zenseact Open Dataset (ZOD) provides rich multi-sensor data but only bounding-box labels, limiting its use for segmentation research. Our primary contribution is a Segment Anything Model (SAM)-based annotation pipeline that produces dense, pixel-level annotations for ZOD by converting bounding boxes into semantic masks. In this pilot study, we process over 100,000 frames and manually curate a 2,300-frame subset (36% acceptance rate) to establish a reliable baseline. Using these annotations, we evaluate transformer-based CLFT and CNN-based DeepLabV3+ architectures across diverse weather conditions, achieving up to 48.1% mIoU with CLFT-Hybrid. To address extreme class imbalance, where pedestrians, cyclists, and signs constitute less than 1% of pixels, we explore specialized models targeting rare classes.