详细信息
- 来源站点
- ArXiv CS.CL
- 作者
- Tiziano Labruna, Guido Bertolini, Pietro Ferrazzi, Bernardo Magnini
- 文章类型
- PAPER
- 语言
- en
- 发布日期
- 2026-07-03
别名
摘要
arXiv:2606.12569v2 Announce Type: replace Abstract: We present eCream-MedCorpus, a new and unique large-scale dataset of clinical notes produced in Emergency Departments of Italian hospitals. The corpus, in its current version, is composed of approximately 4 million clinical notes fully anonymized, covering diverse phases of patient care during the stay in the emergency department. In addition, a subset of about six thousand notes has been manually annotated by clinical experts through a structured Case Report Form (CRF) containing 132 items relevant for two patient situations in emergency departments, dyspnea and loss of consciousness. Items may assume numerical values (e.g., for blood saturation), categorical (e.g., for level of consciousness ), binary (e.g., for presence of traumas), and mixed value types.