Sub-Semantic Image Segmentation 文章

ArXiv CS.CV2026-06-16NEWSen作者: Aviad Cohen Zada, Nadav Orenstein, Shai Avidan, Gal Oren

详细信息

来源站点: ArXiv CS.CV
作者: Aviad Cohen Zada, Nadav Orenstein, Shai Avidan, Gal Oren
文章类型: NEWS
语言: en
发布日期: 2026-06-16

摘要

arXiv:2606.14754v1 Announce Type: new Abstract: Images can be segmented based on visual cues (i.e., texture segmentation) or into objects (i.e., semantic segmentation). We propose a new category of sub-semantic image segmentation that blurs the line between the two. In sub-semantic image segmentation, language is not used to name whole objects. Instead, it is used to partition an image into stable appearance patterns that can be described by language. To do that, we couple a general-purpose vision-language model to SAM 3, a promptable segmentation backbone whose native text pathway can ground rich descriptions into masks. Simple coupling fails for a number of reasons that we identify in the paper, and we overcome them by introducing DETECTURE that resolves three concrete failure modes -- language leakage between texture regions, prompt competition inside the segmentation backbone, and semantic distortion at the language-to-mask interface.

Sub-Semantic Image Segmentation 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (10)

相关技术查看全部 (2)