Schema-Grounded LLM Extraction for FHIR Patient Digital Twins 文章

ArXiv CS.CL2026-05-26NEWSen作者: Rafael Brens, Yuqiao Meng, Luoxi Tang, Zhaohan Xi

摘要

arXiv:2601.05847v2 Announce Type: replace Abstract: We revisit the problem of constructing interoperable patient digital twins from unstructured electronic health records (EHRs) and argue that the task is better cast not as a cascade of extraction modules but as constrained generation of a valid FHIR bundle. We introduce SG-LLM, a schema-grounded LLM extractor that (i) augments the prompt with candidate SNOMED-CT, RxNorm, and LOINC codes retrieved through a SapBERT index, (ii) decodes under a JSON Schema derived directly from FHIR R4 StructureDefinitions, and (iii) closes a validator-in-the-loop repair stage whose diagnostics are fed back as structured error messages. We argue that the twin's usefulness, not only span-level F1, is the right object of evaluation, and operationalize this with a clinical-utility experiment that measures the gap in 30-day readmission AUROC between classifiers trained on SG-LLM-generated FHIR bundles versus expert-curated ones.

相关事件查看全部 (1)

相关公司

暂无数据

相关人物

暂无数据