A Method for Learning Large-Scale Computational Construction Grammars from Semantically Annotated Corpora 文章

ArXiv CS.CL2026-05-27NEWSen作者: Paul Van Eecke, Katrien Beuls

摘要

arXiv:2603.12754v2 Announce Type: replace Abstract: We present a method for learning large-scale, broad-coverage construction grammars from corpora of language use. Starting from utterances annotated with constituency structure and semantic frames, the method facilitates the learning of human-interpretable computational construction grammars that capture the intricate relationship between syntactic structures and the semantic relations they express. The resulting grammars consist of networks of tens of thousands of constructions formalised within the Fluid Construction Grammar framework. Not only do these grammars support the frame-semantic analysis of open-domain text, they also house a trove of information about the syntactico-semantic usage patterns present in the data they were learnt from.