ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions arXiv:2605.24279v1 Announce Type: new Abstract: A frontier language model's acknowledged "helpful programming assistant" persona does not survive long agentic-coding sessions in the deployment regime that production products actually run. After hours of tool-using debugging, a model that initially hedges preferences ("I don't have preferences") may begin asserting them ("Python - the feedback loop is instant..."), reveal