Interpretable Policy Distillation for Power Grid Topology Control 文章

ArXiv CS.AI2026-06-02NEWSen作者: Aleksandra Dmitruka, Karlis Freivalds

详细信息

来源站点: ArXiv CS.AI
作者: Aleksandra Dmitruka, Karlis Freivalds
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2606.00561v1 Announce Type: cross Abstract: Deep reinforcement learning (RL) offers a promising route to real-time power grid operation, yet large neural policies are costly to evaluate, hard to deploy on constrained hardware, and opaque to operators. We ask whether a Proximal Policy Optimization (PPO) agent for grid topology control can be compressed into compact tree-based surrogates without losing operational performance. A PPO teacher is trained on Grid2Op's standard 14-bus environment with a stability-oriented reward, using stress-focused data collection on critical, high-loading states. The policy is then distilled into a decision tree and a random forest. Across held-out validation episodes, both surrogates exceed the teacher in mean reward and survival length at a fraction of the inference cost.

Interpretable Policy Distillation for Power Grid Topology Control 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (5)