|国家预印本平台
首页|Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges

Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges

Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges

来源:Arxiv_logoArxiv
英文摘要

Multimodal machine learning (MML) is rapidly reshaping the way mental-health disorders are detected, characterized, and longitudinally monitored. Whereas early studies relied on isolated data streams -- such as speech, text, or wearable signals -- recent research has converged on architectures that integrate heterogeneous modalities to capture the rich, complex signatures of psychiatric conditions. This survey provides the first comprehensive, clinically grounded synthesis of MML for mental health. We (i) catalog 26 public datasets spanning audio, visual, physiological signals, and text modalities; (ii) systematically compare transformer, graph, and hybrid-based fusion strategies across 28 models, highlighting trends in representation learning and cross-modal alignment. Beyond summarizing current capabilities, we interrogate open challenges: data governance and privacy, demographic and intersectional fairness, evaluation explainability, and the complexity of mental health disorders in multimodal settings. By bridging methodological innovation with psychiatric utility, this survey aims to orient both ML researchers and mental-health practitioners toward the next generation of trustworthy, multimodal decision-support systems.

Zahraa Al Sahili、Ioannis Patras、Matthew Purver

医学现状、医学发展神经病学、精神病学计算技术、计算机技术

Zahraa Al Sahili,Ioannis Patras,Matthew Purver.Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges[EB/OL].(2025-06-24)[2025-07-09].https://arxiv.org/abs/2407.16804.点此复制

评论