Cherry-pick Override: Unsafe Directional Commitment in LLM Judges under Mixed Evidence 事件
PRODUCT_LAUNCH2026-06-09影响: MEDIUM
Cherry-pick Override: Unsafe Directional Commitment in LLM Judges under Mixed Evidence arXiv:2606.07834v1 Announce Type: cross Abstract: LLM judges increasingly turn verdicts into system commitments. Under mixed evidence (claims with both supporting and refuting sources) this is unsafe: when the schema exposes CONFLICTING as the authorized non-directional verdict, returning SUPPORTS/REFUTES is an unauthorized directional commitment, a failure we name Cherry-pick Override (CCO). We define CCO un