Neuron-Level Interventions for Gendered and Gender-Neutral Generation in Language Models 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

Neuron-Level Interventions for Gendered and Gender-Neutral Generation in Language Models arXiv:2605.30717v1 Announce Type: new Abstract: Language models (LMs) can produce gendered language and stereotypes even when given neutral prompts. Most prior work on gender bias in LMs primarily examines gender through a binary lens (feminine vs. masculine), with limited attention to gender-neutral forms, such as they/them pronouns or neutrally phrased job titles. How gender-related signals are encoded in

Neuron-Level Interventions for Gendered and Gender-Neutral Generation in Language Models · 相关产品