ATV-Net: Adaptive Triple-View Network with Dynamic Feature Fusion 文章

ArXiv CS.CV2026-05-26NEWSen作者: Hsin-Jui Pan, Sheng-Wei Chan, Meng-Qian Li, Chun-Po Shen

摘要

arXiv:2605.25803v1 Announce Type: new Abstract: Recent semantic segmentation research has increasingly moved toward stronger context modeling, dense attention, and transformer-based architectures. Although these models achieve impressive performance, classical CNN-based segmentation pipelines remain attractive because of their simplicity, efficiency, and ease of implementation. This paper revisits a practical question: how far can a ResNet-based segmentation model be improved by only modifying the segmentation head? We propose ATV-Net, an Adaptive Triple-View Network that strengthens a ResNet-101 backbone using three simple but complementary receptive-field views. The micro view captures point-wise semantic responses, the local view models neighborhood structures and object boundaries, and the scout view provides enlarged contextual cues.