Bundesrecht: An Open Library and Corpus for German Statutory Reference Processing 文章

ArXiv CS.CL2026-06-01NEWSen作者: Harshil Darji, Martin Heckelmann, Christina Kratsch, Gerard de Melo

摘要

arXiv:2605.31338v1 Announce Type: new Abstract: Statutory references are central to legal language understanding, but are difficult to process automatically, as they appear in compact and variable surface forms, may combine multiple targets, use special abbreviations, and often point to lower-level units. Existing tools for German focus either on parsing references from legal documents or accessing statutory text once citations are explicit. This paper introduces bundesrecht, an open resource for German statutory reference processing, consisting of a software library and a structured corpus of German federal law. The library parses, normalizes, and resolves German statutory references, mapping raw citation strings to structured objects, expanding compact references into canonical forms, and linking them to statutory provisions. The accompanying dataset preserves the internal hierarchy of statutes from laws to fine-granular subclauses.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据