Exploring Machine Learning and Language Models for Multimodal Depression Detection

  • 2025-08-28 14:07:07
  • Javier Si Zhao Hong, Timothy Zoe Delaya, Sherwyn Chan Yin Kit, Pai Chet Ng, Xiaoxiao Miao
  • 0

Abstract

This paper presents our approach to the first Multimodal Personality-AwareDepression Detection Challenge, focusing on multimodal depression detectionusing machine learning and deep learning models. We explore and compare theperformance of XGBoost, transformer-based architectures, and large languagemodels (LLMs) on audio, video, and text features. Our results highlight thestrengths and limitations of each type of model in capturing depression-relatedsignals across modalities, offering insights into effective multimodalrepresentation strategies for mental health prediction.

 

Quick Read (beta)

loading the full paper ...