Abstract
Heatwaves, prolonged periods of extreme heat, have intensified in frequencyand severity due to climate change, posing substantial risks to public health,ecosystems, and infrastructure. Despite advancements in Machine Learning (ML)modeling, accurate heatwave forecasting at weather scales (1--15 days) remainschallenging due to the non-linear interactions between atmospheric drivers andthe rarity of these extreme events. Traditional models relying on heuristicfeature engineering often fail to generalize across diverse climates andcapture the complexities of heatwave dynamics. This study introduces theDistribution-Informed Graph Neural Network (DI-GNN), a novel framework thatintegrates principles from Extreme Value Theory (EVT) into the graph neuralnetwork architecture. DI-GNN incorporates Generalized Pareto Distribution(GPD)-derived descriptors into the feature space, adjacency matrix, and lossfunction to enhance its sensitivity to rare heatwave occurrences. Byprioritizing the tails of climatic distributions, DI-GNN addresses thelimitations of existing methods, particularly in imbalanced datasets wheretraditional metrics like accuracy are misleading. Empirical evaluations usingweather station data from British Columbia, Canada, demonstrate the superiorperformance of DI-GNN compared to baseline models. DI-GNN achieved significantimprovements in balanced accuracy, recall, and precision, with high AUC andaverage precision scores, reflecting its robustness in distinguishing heatwaveevents.