Segment to Focus: Guiding Latent Action Models in the Presence of Distractors 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

Segment to Focus: Guiding Latent Action Models in the Presence of Distractors arXiv:2602.02259v2 Announce Type: replace-cross Abstract: Latent action models (LAMs) offer a promising path to pre-training embodied agents on large amounts of action-free video. They infer latent actions between consecutive observations that can later be decoded to ground-truth actions using a small number of labels. However, recent work has shown that this recipe fails in the presence of action-correlated visual di