eCream-MedCorpus A Large-Scale Corpus of Clinical Notes for Italian 文章

ArXiv CS.CL2026-07-03PAPERen作者: Tiziano Labruna, Guido Bertolini, Pietro Ferrazzi, Bernardo Magnini

详细信息

来源站点
ArXiv CS.CL
作者
Tiziano Labruna, Guido Bertolini, Pietro Ferrazzi, Bernardo Magnini
文章类型
PAPER
语言
en
发布日期
2026-07-03

别名

eCream-MedCorpus A Large-Scale Corpus of Clinical Notes for Italian

摘要

arXiv:2606.12569v2 Announce Type: replace Abstract: We present eCream-MedCorpus, a new and unique large-scale dataset of clinical notes produced in Emergency Departments of Italian hospitals. The corpus, in its current version, is composed of approximately 4 million clinical notes fully anonymized, covering diverse phases of patient care during the stay in the emergency department. In addition, a subset of about six thousand notes has been manually annotated by clinical experts through a structured Case Report Form (CRF) containing 132 items relevant for two patient situations in emergency departments, dyspnea and loss of consciousness. Items may assume numerical values (e.g., for blood saturation), categorical (e.g., for level of consciousness ), binary (e.g., for presence of traumas), and mixed value types.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据