Enhancing MedSAM with a Lightweight Box Predictor for Medical Image Segmentation 文章

ArXiv CS.CV2026-06-04NEWSen作者: Amirhossein Movahedisefat, Amirreza Fateh, Mohammad Reza Mohammadi

摘要

arXiv:2606.04705v1 Announce Type: new Abstract: Semantic segmentation in medical imaging is a critical yet challenging task due to data scarcity and high variability across modalities. While foundation models like the Segment Anything Model (SAM) show promise, they often struggle with medical images without specific adaptation. Moreover, point prompts, despite being the most natural form of user interaction, provide insufficient spatial context for reliable segmentation, particularly when target structures are irregular or poorly contrasted. In this paper, we propose an enhanced segmentation framework that integrates a lightweight Box Predictor module into the MedSAM architecture. The Box Predictor estimates an approximate bounding box from a single user click using localized image embedding features, providing spatial guidance that reduces the ambiguity of point prompts, while introducing only 1.6M additional parameters and negligible inference overhead.

Enhancing MedSAM with a Lightweight Box Predictor for Medical Image Segmentation 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (2)

相关技术查看全部 (1)