Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

Epistemic Injustice in Language Models: An Audit of Pretraining Filters and Guardrails arXiv:2606.05936v1 Announce Type: new Abstract: Modern language models rely on pretraining filters to remove undesirable content from training corpora and inference-time guardrails to suppress undesirable outputs during deployment. In this paper, we examine how these filtering and moderation decisions produce forms of epistemic erasure and reveal tensions both across automated systems and between these system