Curriculum-Adapted Robust Reinforcement Learning for UAV Deconfliction in Adversarial Environments 文章

ArXiv CS.AI2026-06-03NEWSen作者: Deepak Kumar Panda, Adolfo Perrusquia, Weisi Guo

摘要

arXiv:2506.21129v2 Announce Type: replace-cross Abstract: Autonomous unmanned aerial vehicles (UAVs) increasingly rely on reinforcement learning (RL) for navigation. However, global navigation satellite system (GNSS) spoofing attacks can induce out-of-distribution observation shifts that corrupt value estimation and degrade mission performance. Existing robust RL approaches typically improve resilience against specific attack models but often fail to generalize to attacks not encountered during training. To address this limitation, we propose a curriculum-guided adaptation framework that progressively exposes a robust policy to gradient-based adversarial observation perturbations of increasing intensity while aligning temporal-difference (TD) error distributions across curriculum stages. Rather than adapting to a particular attack model, the proposed approach preserves TD-error consistency to promote transferability across attack conditions.