Towards a robust R2D2 paradigm for radio-interferometric imaging: revisiting DNN training and architecture
Journal:
arXiv
Published Date:
Mar 4, 2025
Abstract
The R2D2 Deep Neural Network (DNN) series was recently introduced for image
formation in radio interferometry. It can be understood as a learned version of
CLEAN, whose minor cycles are substituted with DNNs. We revisit R2D2 on the
grounds of series convergence, training methodology, and DNN architecture,
improving its robustness in terms of generalisability beyond training
conditions, capability to deliver high data fidelity, and epistemic
uncertainty. Firstly, while still focusing on telescope-specific training, we
enhance the learning process by randomising Fourier sampling integration times,
incorporating multi-scan multi-noise configurations, and varying imaging
settings, including pixel resolution and visibility-weighting scheme. Secondly,
we introduce a convergence criterion whereby the reconstruction process stops
when the data residual is compatible with noise, rather than simply using all
available DNNs. This not only increases the reconstruction efficiency by
reducing its computational cost, but also refines training by pruning out the
data/image pairs for which optimal data fidelity is reached before training the
next DNN. Thirdly, we substitute R2D2's early U-Net DNN with a novel
architecture (U-WDSR) combining U-Net and WDSR, which leverages wide
activation, dense connections, weight normalisation, and low-rank convolution
to improve feature reuse and reconstruction precision. As previously, R2D2 was
trained for monochromatic intensity imaging with the Very Large Array (VLA) at
fixed $512 \times 512$ image size. Simulations on a wide range of inverse
problems and a case study on real data reveal that the new R2D2 model
consistently outperforms its earlier version in image reconstruction quality,
data fidelity, and epistemic uncertainty.