NormEval: A Unified Multi-Metric Framework for Evaluating Semantic Fidelity in Text Normalization 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

NormEval: A Unified Multi-Metric Framework for Evaluating Semantic Fidelity in Text Normalization arXiv:2511.20409v2 Announce Type: replace Abstract: Text normalization methods such as stemming and lemmatization are fundamental components of NLP pipelines. As new normalization tools are developed for diverse languages, evaluation methodologies remain fragmented, relying on Compression Ratio, downstream accuracy, or sequence-to-sequence prediction scores in isolation, failing to distinguish betw