Obfuscation Rules for Detecting and Detoxifying Korean Toxicity 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

Obfuscation Rules for Detecting and Detoxifying Korean Toxicity arXiv:2510.10961v3 Announce Type: replace Abstract: As language models become increasingly deployed in online environments, toxicity detection and detoxification have received growing attention. Existing studies primarily focus on non-obfuscated text, which limits robustness when users intentionally disguise toxic expressions. In particular, Korean toxic expressions can be easily disguised through agglutinative morphology and Hange