SUSD: Structured Unsupervised Skill Discovery through State Factorization 文章

ArXiv CS.AI2026-06-04NEWSen作者: Seyed Mohammad Hadi Hosseini, Mahdieh Soleymani Baghshah

摘要

arXiv:2602.01619v2 Announce Type: replace-cross Abstract: Unsupervised Skill Discovery (USD) aims to autonomously learn a diverse set of skills without relying on extrinsic rewards. One of the most common USD approaches is to maximize the Mutual Information (MI) between skill latent variables and states. However, MI-based methods tend to favor simple, static skills due to their invariance properties, limiting the discovery of dynamic, task-relevant behaviors. Distance-Maximizing Skill Discovery (DSD) promotes more dynamic skills by leveraging state-space distances, yet still fall short in encouraging comprehensive skill sets that engage all controllable factors or entities in the environment. In this work, we introduce SUSD, a novel framework that harnesses the compositional structure of environments by factorizing the state space into independent components (e.g., objects or controllable entities).