Learning to Search and Searching to Learn for Generalization in Planning 文章

ArXiv CS.AI2026-05-26NEWSen作者: Michael Aichm\"uller, Yannik Hesse, Hector Geffner

详细信息

来源站点
ArXiv CS.AI
作者
Michael Aichm\"uller, Yannik Hesse, Hector Geffner
文章类型
NEWS
语言
en
发布日期
2026-05-26

摘要

arXiv:2605.25720v1 Announce Type: new Abstract: Combinatorial generalization remains a central challenge in Deep Reinforcement Learning (DRL). Classical planning provides a simple yet challenging setting to study this problem through explicit relational descriptions, without requiring learning from perception. In sparse-reward domains, standard RL exploration via real-time search is ineffective, and learning-based planning methods often rely on expert demonstrations, hindsight relabeling, or random walks from the goal state. In contrast, planners rely on best-first search methods such as $\mathrm{A}^\star$ to solve problems from scratch. We propose a self-improving $\mathrm{WA}^\star$ learning framework in combination with a value heuristic represented by a Relational Graph Neural Network: the heuristic guides search, and the resulting search data updates the heuristic via $Q$-learning.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据