LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence arXiv:2605.25979v1 Announce Type: new Abstract: We introduce LLaVA-OneVision-2 (LLaVA-OV-2), the most capable vision-language model in the LLaVA-OneVision series to date, achieving superior performance across a broad range of multimodal benchmarks. The model builds on a native OneVision-Encoder and incorporates Windowed Attention for efficient local computation while maintaining native resolution. Its key advance is codec-stream