Conceptual Steganography 事件
PRODUCT_LAUNCH2026-05-27影响: MEDIUM
Conceptual Steganography arXiv:2605.26537v1 Announce Type: new Abstract: Language Models (LMs) emit Chains-of-Thought (CoTs) that drive much of their capability. However, the same sequence that carries useful reasoning can also covertly convey messages: a misaligned model may embed covert information in its CoT that slips through human supervision, a form of steganography known as encoded reasoning. Prior LM steganography schemes operate in the token or lexical space, and a content-preserving p
相关产品查看全部 (10)
相关报道查看全部 (1)
Conceptual Steganography
ArXiv CS.CL2026-05-27