Position: Text Embeddings Should Capture Implicit Semantics, Not Just Surface Meaning 文章

ArXiv CS.CL2026-05-29NEWSen作者: Yiqun Sun, Qiang Huang, Anthony K. H. Tung, Jun Yu

摘要

arXiv:2506.08354v2 Announce Type: replace Abstract: This position paper argues that text embedding research should move beyond surface meaning and embrace implicit semantics as a central modeling objective. Text embeddings are a foundational component of modern NLP, underpinning a wide range of applications and driving sustained research progress. Despite rapid progress, most embedding models remain narrowly focused on surface-level semantics, whereas linguistic theory emphasizes that much of human meaning is implicit, shaped by pragmatics, speaker intent, and sociocultural context. Current models are typically trained on datasets that lack such depth and evaluated using benchmarks that reward surface similarity. As a result, they struggle with tasks that require interpretive reasoning, stance recognition, or socially grounded understanding.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据