diffIRM: A Diffusion-Augmented Invariant Risk Minimization Framework for Spatiotemporal Prediction over Graphs
Journal:
arXiv
Published Date:
Dec 31, 2024
Abstract
Spatiotemporal prediction over graphs (STPG) is challenging, because
real-world data suffers from the Out-of-Distribution (OOD) generalization
problem, where test data follow different distributions from training ones. To
address this issue, Invariant Risk Minimization (IRM) has emerged as a
promising approach for learning invariant representations across different
environments. However, IRM and its variants are originally designed for
Euclidean data like images, and may not generalize well to graph-structure data
such as spatiotemporal graphs due to spatial correlations in graphs. To
overcome the challenge posed by graph-structure data, the existing graph OOD
methods adhere to the principles of invariance existence, or environment
diversity. However, there is little research that combines both principles in
the STPG problem. A combination of the two is crucial for efficiently
distinguishing between invariant features and spurious ones. In this study, we
fill in this research gap and propose a diffusion-augmented invariant risk
minimization (diffIRM) framework that combines these two principles for the
STPG problem. Our diffIRM contains two processes: i) data augmentation and ii)
invariant learning. In the data augmentation process, a causal mask generator
identifies causal features and a graph-based diffusion model acts as an
environment augmentor to generate augmented spatiotemporal graph data. In the
invariant learning process, an invariance penalty is designed using the
augmented data, and then serves as a regularizer for training the
spatiotemporal prediction model. The real-world experiment uses three human
mobility datasets, i.e. SafeGraph, PeMS04, and PeMS08. Our proposed diffIRM
outperforms baselines.