ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions arXiv:2605.24279v1 Announce Type: new Abstract: A frontier language model's acknowledged "helpful programming assistant" persona does not survive long agentic-coding sessions in the deployment regime that production products actually run. After hours of tool-using debugging, a model that initially hedges preferences ("I don't have preferences") may begin asserting them ("Python - the feedback loop is instant..."), reveal
相关产品查看全部 (10)
相关报道查看全部 (1)
ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions
ArXiv CS.CL2026-05-26