Learning to Search and Searching to Learn for Generalization in Planning 文章

ArXiv CS.AI2026-05-26NEWSen作者: Michael Aichm\"uller, Yannik Hesse, Hector Geffner

详细信息

来源站点: ArXiv CS.AI
作者: Michael Aichm\"uller, Yannik Hesse, Hector Geffner
文章类型: NEWS
语言: en
发布日期: 2026-05-26

摘要

arXiv:2605.25720v1 Announce Type: new Abstract: Combinatorial generalization remains a central challenge in Deep Reinforcement Learning (DRL). Classical planning provides a simple yet challenging setting to study this problem through explicit relational descriptions, without requiring learning from perception. In sparse-reward domains, standard RL exploration via real-time search is ineffective, and learning-based planning methods often rely on expert demonstrations, hindsight relabeling, or random walks from the goal state. In contrast, planners rely on best-first search methods such as $\mathrm{A}^\star$ to solve problems from scratch. We propose a self-improving $\mathrm{WA}^\star$ learning framework in combination with a value heuristic represented by a Relational Graph Neural Network: the heuristic guides search, and the resulting search data updates the heuristic via $Q$-learning.

Learning to Search and Searching to Learn for Generalization in Planning 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术查看全部 (7)