SAM3 Self-Distillation for Fine-Grained GOOSE 2D Semantic Segmentation 文章

ArXiv CS.CV2026-06-19NEWSen作者: Xuesong Wang

详细信息

来源站点
ArXiv CS.CV
作者
Xuesong Wang
文章类型
NEWS
语言
en
发布日期
2026-06-19

摘要

arXiv:2606.20130v1 Announce Type: new Abstract: We describe our 4th-place entry to the ICRA 2026 GOOSE 2D Fine-Grained Semantic Segmentation Challenge, which reached a composite mean Intersection-over-Union (mIoU) of 69.73% on the official 1,815-image test set. Our model adapts the image encoder of a recent visual foundation model, Segment Anything Model 3 (SAM3), with a lightweight decoder. Beyond this, we contribute two techniques and one empirical finding: (i) a self-distillation scheme that re-uses SAM3 itself, prompted with ground-truth boxes, as a teacher on the classes where it outperforms our own model; (ii) an image-level multi-scale test-time augmentation scheme that restores multi-scale inference for a fixed-input-size model by rescaling the image rather than the model input; and (iii) the finding that an aggressive photometric distortion from a winning 2025 GOOSE 2D entry, transplanted onto our pipeline, is its single largest source of improvement.