Position: AI Safety Requires Effective Controllability 事件
REGULATION2026-05-27影响: MEDIUM
Position: AI Safety Requires Effective Controllability arXiv:2605.27117v1 Announce Type: new Abstract: AI safety is still largely framed as alignment: training models to follow human preferences, safety policies, and normative constraints. That framing has improved the behavior of modern language models, but aligned behavior does not by itself guarantee that a deployed agent can be stopped, overridden, or constrained once it operates in open-ended, interactive, and tool-using environments. A sy