AuAu: A Benchmark for Auditing Authoritarian Alignment in Large Language Models 文章

ArXiv CS.CL2026-06-16NEWSen作者: Andreas Einwiller, Max Klabunde, Florian Lemmerich

AuAu: A Benchmark for Auditing Authoritarian Alignment in Large Language Models · 相关技术

相关技术