FedMental: Evaluating Federated Learning for Mental Health Detection from Social Media Data 文章

ArXiv CS.CL2026-06-04NEWSen作者: Nuredin Ali Abdelkadir, Anjali Ratnam, Zeerak Talat, Stevie Chancellor

摘要

arXiv:2605.18936v2 Announce Type: replace-cross Abstract: Social media text data are often used to train Machine Learning (ML) models to identify users exhibiting high-risk mental health behaviors. However, sharing this sensitive data poses privacy risks and limits the growth of benchmark datasets. We comprehensively evaluate whether privacy-preserving ML techniques can enable safer data sharing while preserving performance. Specifically, we apply federated learning (FL) and Differentially Private FL for two widely-studied mental health prediction tasks: depression detection on X (Twitter) and suicide crisis detection on Reddit. We simulate realistic data-sharing scenarios by treating each user as a client in a non-IID setting, evaluating across different client fractions, aggregation strategies, and privacy budgets. While FL achieves comparable performance to centralized training (centralized F1 = 85.63; best FL model F1 = 83.