Physics-Guided Policy Optimization with Self-Distillation 文章

ArXiv CS.AI2026-06-03NEWSen作者: Ke Wang, Yuning Wu, Haoran Liu, Chaoqun Jia, Devin Chen, Kai Wei

Physics-Guided Policy Optimization with Self-Distillation · 相关技术