From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP 事件
PRODUCT_LAUNCH2026-05-26影响: MEDIUM
From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP arXiv:2605.25226v1 Announce Type: new Abstract: Large language models are widely deployed in high-stakes NLP tasks, yet risks such as bias, hallucination, adversarial vulnerability and unreliable generalization remain. Probe-based auditing reveals inconsistencies in model behavior. Adversarial text generation uncovers robustness gaps, especially in lower-resourced languages with limited benchmarks. Enterpri