Obfuscation Rules for Detecting and Detoxifying Korean Toxicity 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

Obfuscation Rules for Detecting and Detoxifying Korean Toxicity arXiv:2510.10961v3 Announce Type: replace Abstract: As language models become increasingly deployed in online environments, toxicity detection and detoxification have received growing attention. Existing studies primarily focus on non-obfuscated text, which limits robustness when users intentionally disguise toxic expressions. In particular, Korean toxic expressions can be easily disguised through agglutinative morphology and Hange

Obfuscation Rules for Detecting and Detoxifying Korean Toxicity · 相关报道