Sub-Semantic Image Segmentation 文章

ArXiv CS.CV2026-06-16NEWSen作者: Aviad Cohen Zada, Nadav Orenstein, Shai Avidan, Gal Oren

详细信息

来源站点
ArXiv CS.CV
作者
Aviad Cohen Zada, Nadav Orenstein, Shai Avidan, Gal Oren
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2606.14754v1 Announce Type: new Abstract: Images can be segmented based on visual cues (i.e., texture segmentation) or into objects (i.e., semantic segmentation). We propose a new category of sub-semantic image segmentation that blurs the line between the two. In sub-semantic image segmentation, language is not used to name whole objects. Instead, it is used to partition an image into stable appearance patterns that can be described by language. To do that, we couple a general-purpose vision-language model to SAM 3, a promptable segmentation backbone whose native text pathway can ground rich descriptions into masks. Simple coupling fails for a number of reasons that we identify in the paper, and we overcome them by introducing DETECTURE that resolves three concrete failure modes -- language leakage between texture regions, prompt competition inside the segmentation backbone, and semantic distortion at the language-to-mask interface.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据