Cosmos 3: Omnimodal World Models for Physical AI 事件
OPEN_SOURCE2026-06-03影响: MEDIUM
Cosmos 3: Omnimodal World Models for Physical AI arXiv:2606.02800v1 Announce Type: new Abstract: We introduce Cosmos 3, a family of omnimodal world models designed to jointly process and generate language, image, video, audio, and action sequences within a unified mixture-of-transformers architecture. By supporting highly flexible input-output configurations, Cosmos 3 seamlessly unifies critical modalities for Physical AI -- effectively subsuming vision-language models, video generators, world
相关公司查看全部 (10)
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Cosmos 3: Omnimodal World Models for Physical AI
ArXiv CS.CV2026-06-03