Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations 文章

ArXiv CS.CL2026-05-29NEWSen作者: Yuxi Xia, Dennis Ulmer, Terra Blevins, Yihong Liu, Hinrich Sch\"utze, Benjamin Roth

Calibration Is Not Enough: Evaluating Confidence Estimation Under Language Variations · 相关技术

暂无数据