Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Do Gender Cues Affect LLM Value Trade-offs? Evidence from a Controlled Decision Benchmark arXiv:2606.02214v1 Announce Type: new Abstract: Large language models are increasingly used in value-sensitive decision settings, where irrelevant demographic cues should not alter judgments. We construct the Realistic Value Decision Benchmark (RVDB), a controlled benchmark that varies only the role-gender configuration while holding the scenario, ordered value pair, roles, candidate decisions, Value Dista