Unpredictable Safety: Domain-Dependent Compliance and the Transparency Gap in Open-Weight LLMs 事件

PRODUCT_LAUNCH2026-06-04影响: MEDIUM

Unpredictable Safety: Domain-Dependent Compliance and the Transparency Gap in Open-Weight LLMs arXiv:2606.04035v1 Announce Type: cross Abstract: We present a systematic study of domain-dependent safety behavior in open-weight LLMs: 7 standardized experiments across 7 ethical domains, testing 5 models (12B--70B) in 4,200 interactions with dual-judge validation. Using a dual-condition methodology, each scenario tested in both an analytical framing (identify the harm) and an operational framing (h