RUBAS: Rubric-Based Reinforcement Learning for Agent Safety 事件
PRODUCT_LAUNCH2026-06-04影响: MEDIUM
RUBAS: Rubric-Based Reinforcement Learning for Agent Safety arXiv:2606.04051v1 Announce Type: cross Abstract: The evolution of LLMs into tool-enabled agents creates a new class of safety challenges associated with real-world execution rather than simple text generation. Existing alignment methods often rely on coarse refusal signals or static supervision, making it difficult to balance safety with useful tool execution across diverse agentic risks. We introduce RUBAS, a rubric-based reinforceme