A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks arXiv:2605.23977v1 Announce Type: new Abstract: This paper audits benchmark evaluation in clinical-interview depression detection through four complementary probes across DAIC/E-DAIC, CMDC, ANDROIDS, MODMA, and PDCH. First, we re-evaluate E-DAIC under strict subject-disjoint leave-one-subject-out cross-validation. A lightweight hybrid text-plus-LLM-score model reaches macro-F1 = 0.723 - the highest reported under this pro

A Multi-Probe Audit of Clinical-Interview Depression Detection Benchmarks · 相关技术