Code as a Weapon: A Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests 事件

REGULATION2026-05-28影响: MEDIUM

Code as a Weapon: A Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests arXiv:2605.28734v1 Announce Type: cross Abstract: A general-purpose language model that answers a harmful question returns text; a coding model that complies with a malicious request can return a working weapon -- a keylogger, a ransomware stub, an exploit that runs as written. This asymmetry in the severity of a single act of compliance implies coding-specialized models should c

Code as a Weapon: A Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests · 相关公司

S
SECGOVERNMENT
P
PURCOMPANY
A
arXivNONPROFIT
G
GLENONPROFIT
H
HuMANONPROFIT
L
LoweCOMPANY
A
ACTNONPROFIT
A
ActuaNONPROFIT
R
RatioRESEARCH_INSTITUTE