Improving Visual Grounding in Remote Sensing via Cluster-Guided Refinement and Model Ensemble Voting 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Improving Visual Grounding in Remote Sensing via Cluster-Guided Refinement and Model Ensemble Voting arXiv:2606.00556v1 Announce Type: new Abstract: Visual grounding aims to locate image regions that correspond to natural language descriptions and is a key component of interpretable vision systems. In remote sensing imagery, grounding is particularly challenging due to complex scenes, small objects, and large variations in scale. Relying on a single model is often insufficient to address these