LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence arXiv:2605.25979v1 Announce Type: new Abstract: We introduce LLaVA-OneVision-2 (LLaVA-OV-2), the most capable vision-language model in the LLaVA-OneVision series to date, achieving superior performance across a broad range of multimodal benchmarks. The model builds on a native OneVision-Encoder and incorporates Windowed Attention for efficient local computation while maintaining native resolution. Its key advance is codec-stream
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence · 相关报道
相关报道
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence
ArXiv CS.CV2026-05-26