Abstract
Cardiac diffusion tensor imaging (DTI) offers unique insights intocardiomyocyte arrangements, bridging the gap between microscopic andmacroscopic cardiac function. However, its clinical utility is limited bytechnical challenges, including a low signal-to-noise ratio, aliasingartefacts, and the need for accurate quantitative fidelity. To address theselimitations, we introduce RSFR (Reconstruction, Segmentation, Fusion &Refinement), a novel framework for cardiac diffusion-weighted imagereconstruction. RSFR employs a coarse-to-fine strategy, leveraging zero-shotsemantic priors via the Segment Anything Model and a robust Vision Mamba-basedreconstruction backbone. Our framework integrates semantic features effectivelyto mitigate artefacts and enhance fidelity, achieving state-of-the-artreconstruction quality and accurate DT parameter estimation under highundersampling rates. Extensive experiments and ablation studies demonstrate thesuperior performance of RSFR compared to existing methods, highlighting itsrobustness, scalability, and potential for clinical translation in quantitativecardiac DTI.