Geometry-Preserving Unsupervised Alignment for Heterogeneous Foundation Models 事件

PRODUCT_LAUNCH2026-06-04影响: MEDIUM

Geometry-Preserving Unsupervised Alignment for Heterogeneous Foundation Models arXiv:2606.04385v1 Announce Type: new Abstract: Foundation models have driven rapid progress in computer vision, yet the two dominant paradigms, vision-language foundation models (VLMs) and vision-only foundation models (VFMs), remain only partially compatible. VLMs offer language-grounded semantic alignment but are often visually coarse, while VFMs learn discriminative perceptual geometry but lack semantic grounding