Unified Video-Action Joint Denoising for Dexterous Action and Data Generation 事件

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

Unified Video-Action Joint Denoising for Dexterous Action and Data Generation arXiv:2606.03868v1 Announce Type: new Abstract: Recent world action models leverage video foundation models by aligning broad visual-dynamics priors with executable robot actions. We revisit this alignment from a distributional perspective. Existing formulations typically narrow the aligned prior into an observation-conditioned policy distribution over future actions. In contrast, we keep the distribution broader by m