Medico 2025: Visual Question Answering for Gastrointestinal Imaging

Abstract

The Medico 2025 challenge addresses Visual Question Answering (VQA) forGastrointestinal (GI) imaging, organized as part of the MediaEval task series.The challenge focuses on developing Explainable Artificial Intelligence (XAI)models that answer clinically relevant questions based on GI endoscopy imageswhile providing interpretable justifications aligned with medical reasoning. Itintroduces two subtasks: (1) answering diverse types of visual questions usingthe Kvasir-VQA-x1 dataset, and (2) generating multimodal explanations tosupport clinical decision-making. The Kvasir-VQA-x1 dataset, created from 6,500images and 159,549 complex question-answer (QA) pairs, serves as the benchmarkfor the challenge. By combining quantitative performance metrics andexpert-reviewed explainability assessments, this task aims to advancetrustworthy Artificial Intelligence (AI) in medical image analysis.Instructions, data access, and an updated guide for participation are availablein the official competition repository:https://github.com/simula/MediaEval-Medico-2025

Quick Read (beta)

loading the full paper ...