Abstract
Multiple Instance Learning (MIL) is a cornerstone approach in computationalpathology (CPath) for generating clinically meaningful slide-level embeddingsfrom gigapixel tissue images. However, MIL often struggles with small, weaklysupervised clinical datasets. In contrast to fields such as NLP andconventional computer vision, where transfer learning is widely used to addressdata scarcity, the transferability of MIL models remains poorly understood. Inthis study, we systematically evaluate the transfer learning capabilities ofpretrained MIL models by assessing 11 models across 21 pretraining tasks formorphological and molecular subtype prediction. Our results show thatpretrained MIL models, even when trained on different organs than the targettask, consistently outperform models trained from scratch. Moreover,pretraining on pancancer datasets enables strong generalization across organsand tasks, outperforming slide foundation models while using substantially lesspretraining data. These findings highlight the robust adaptability of MILmodels and demonstrate the benefits of leveraging transfer learning to boostperformance in CPath. Lastly, we provide a resource which standardizes theimplementation of MIL models and collection of pretrained model weights onpopular CPath tasks, available at https://github.com/mahmoodlab/MIL-Lab