Grounding Text Embeddings in Stakeholder Associations 文章

ArXiv CS.CL2026-05-27NEWSen作者: Jonathan Rystr{\o}m, Sofie Burgos-Thorsen, Zihao Fu, Johan Irving S{\o}ltoft, Kenneth C. Enevoldsen, Chris Russell

摘要

arXiv:2605.27168v1 Announce Type: new Abstract: Text embeddings are widely used to analyse large corpora of complex texts. However, it is unclear whether the embeddings capture the same semantic distances as the human experts using them. Ensuring alignment between embedding representations and human intentions is essential for valid analyses. We present the Stakeholder Grounding Exercise, a method for making expert associations explicit and grounding embedding model results in human understanding. In our primary case study on Danish policy issues, we find that neural text embeddings are substantially less reliable than human experts (19-26 pp gap), and that this misalignment propagates to downstream clustering performance (Spearman $\rho=0.9$ between exercise ranking and cluster quality).

Grounding Text Embeddings in Stakeholder Associations 文章

摘要

相关事件查看全部 (1)

相关公司查看全部 (4)

相关人物

相关产品查看全部 (9)

相关技术查看全部 (18)