Abstract
Learning from multimodal datasets can leverage complementary information andimprove performance in prediction tasks. A commonly used strategy to accountfor feature correlations in high-dimensional datasets is the latent variableapproach. Several latent variable methods have been proposed for multimodaldatasets. However, these methods either focus on extracting the sharedcomponent across all modalities or on extracting both a shared component andindividual components specific to each modality. To address this gap, wepropose a Multi-Modal Fission Learning (MMFL) model that simultaneouslyidentifies globally joint, partially joint, and individual componentsunderlying the features of multimodal datasets. Unlike existing latent variablemethods, MMFL uses supervision from the response variable to identifypredictive latent components and has a natural extension for incorporatingincomplete multimodal data. Through simulation studies, we demonstrate thatMMFL outperforms various existing multimodal algorithms in both complete andincomplete modality settings. We applied MMFL to a real-world case study forearly prediction of Alzheimers Disease using multimodal neuroimaging andgenomics data from the Alzheimers Disease Neuroimaging Initiative (ADNI)dataset. MMFL provided more accurate predictions and better insights intowithin- and across-modality correlations compared to existing methods.