Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations 文章

ArXiv CS.CL2026-05-28NEWSen作者: Sachin Kumar

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations · 相关技术

相关技术