Latent Q-Barrier Shielding for Safe In-Context Reinforcement Learning 文章

ArXiv CS.AI2026-05-26NEWSen作者: Minjae Kwon, Amir Moeini, Shangtong Zhang, Lu Feng

详细信息

来源站点: ArXiv CS.AI
作者: Minjae Kwon, Amir Moeini, Shangtong Zhang, Lu Feng
文章类型: NEWS
语言: en
发布日期: 2026-05-26

摘要

arXiv:2605.25267v1 Announce Type: cross Abstract: Safe in-context reinforcement learning (ICRL) adapts online from interaction history without test-time parameter updates while controlling episode cost under a safety budget. Under out-of-distribution (OOD) deployment shifts, pretraining-only safe ICRL can give poor reward-safety tradeoffs because the remaining budget affects behavior only through frozen policy conditioning, not an explicit action-level check against predicted future cost. We propose a latent Q-Barrier shield that learns a context representation, latent dynamics, and an ensemble cost critic before deployment. Without parameter updates, the shield infers context from history and filters or softly reweights candidate actions using the remaining budget and predicted future cost.

Latent Q-Barrier Shielding for Safe In-Context Reinforcement Learning 文章

详细信息

摘要

相关事件

相关公司查看全部 (5)

相关人物

相关产品查看全部 (4)

相关技术查看全部 (20)