Janus: A Benchmark for Goal-Conditioned Information Distortion in LLMs 事件
PRODUCT_LAUNCH2026-06-10影响: MEDIUM
Janus: A Benchmark for Goal-Conditioned Information Distortion in LLMs arXiv:2606.10852v1 Announce Type: new Abstract: LLM deception is often evaluated through direct markers such as fabricated claims, explicit lies, or strategic concealment. However, many real-world misleading communications do not depend on false statements, rather, they arise from selective treatment of true material facts: omitting adverse evidence, softening unfavorable details, emphasizing favorable details, or replacing
相关产品查看全部 (10)
相关报道查看全部 (1)
Janus: A Benchmark for Goal-Conditioned Information Distortion in LLMs
ArXiv CS.CL2026-06-10