SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning 文章

ArXiv CS.CV2026-05-27NEWSen作者: Philip Schroeder, Thomas Weng, Karl Schmeckpeper, Eric Rosen, Stephen Hart, Ondrej Biza

SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning · 相关技术