Abstract
This paper presents our approach to the first Multimodal Personality-AwareDepression Detection Challenge, focusing on multimodal depression detectionusing machine learning and deep learning models. We explore and compare theperformance of XGBoost, transformer-based architectures, and large languagemodels (LLMs) on audio, video, and text features. Our results highlight thestrengths and limitations of each type of model in capturing depression-relatedsignals across modalities, offering insights into effective multimodalrepresentation strategies for mental health prediction.
Quick Read (beta)
loading the full paper ...