详细信息
- 来源站点
- ArXiv CS.CV
- 作者
- Sol\`ene Debuys\`ere, Nicolas Trouv\'e, Nathan Letheule, Elise Colin, Georgia Channing
- 文章类型
- PAPER
- 语言
- en
- 发布日期
- 2026-06-19
摘要
arXiv:2606.20523v1 Announce Type: new Abstract: Multimodal foundation models have advanced rapidly thanks to large optical benchmarks, but comparable resources for synthetic aperture radar (SAR) remain limited. Existing SAR--optical datasets largely rely on low-resolution, intensity-only Ground Range Detected~(GRD) products and do not preserve complex-valued SAR measurements or native acquisition geometry, which restricts physically grounded multimodal learning. In particular, large-scale public datasets combining very-high-resolution (VHR) SAR SLC, aligned optical imagery, and natural-language descriptions are still lacking. We present a VHR SAR--optical--text dataset built from open-access Umbra spotlight acquisitions distributed as Sensor Independent Complex Data (SICD). From around 2,500 worldwide scenes (VV/HH, 20cm--2m native resolution), we standardize all SAR data to an 80cm slant-range grid via band-limited FFT resampling and tile the imagery into 1024 by 1024 patches.
相关事件
暂无数据
相关人物
暂无数据