Distilling Knowledge for Designing Computational Imaging Systems
Journal:
arXiv
Published Date:
Jan 29, 2025
Abstract
Designing the physical encoder is crucial for accurate image reconstruction
in computational imaging (CI) systems. Currently, these systems are designed
via end-to-end (E2E) optimization, where the encoder is modeled as a neural
network layer and is jointly optimized with the decoder. However, the
performance of E2E optimization is significantly reduced by the physical
constraints imposed on the encoder. Also, since the E2E learns the parameters
of the encoder by backpropagating the reconstruction error, it does not promote
optimal intermediate outputs and suffers from gradient vanishing. To address
these limitations, we reinterpret the concept of knowledge distillation (KD)
for designing a physically constrained CI system by transferring the knowledge
of a pretrained, less-constrained CI system. Our approach involves three steps:
(1) Given the original CI system (student), a teacher system is created by
relaxing the constraints on the student's encoder. (2) The teacher is optimized
to solve a less-constrained version of the student's problem. (3) The teacher
guides the training of the student through two proposed knowledge transfer
functions, targeting both the encoder and the decoder feature space. The
proposed method can be employed to any imaging modality since the relaxation
scheme and the loss functions can be adapted according to the physical
acquisition and the employed decoder. This approach was validated on three
representative CI modalities: magnetic resonance, single-pixel, and compressive
spectral imaging. Simulations show that a teacher system with an encoder that
has a structure similar to that of the student encoder provides effective
guidance. Our approach achieves significantly improved reconstruction
performance and encoder design, outperforming both E2E optimization and
traditional non-data-driven encoder designs.