FSDM: An efficient video super-resolution method based on Frames-Shift Diffusion Model.

Journal: Neural networks : the official journal of the International Neural Network Society

Published Date: Apr 3, 2025

Abstract

Video super-resolution is a fundamental task aimed at enhancing video quality through intricate modeling techniques. Recent advancements in diffusion models have significantly enhanced image super-resolution processing capabilities. However, their integration into video super-resolution workflows remains constrained due to the computational complexity of temporal fusion modules, demanding more computational resources compared to their image counterparts. To address this challenge, we propose a novel approach: a Frames-Shift Diffusion Model based on the image diffusion models. Compared to directly training diffusion-based video super-resolution models, redesigning the diffusion process of image models without introducing complex temporal modules requires minimal training consumption. We incorporate temporal information into the image super-resolution diffusion model by using optical flow and perform multi-frame fusion. This model adapts the diffusion process to smoothly transition from image super-resolution to video super-resolution diffusion without additional weight parameters. As a result, the Frames-Shift Diffusion Model efficiently processes videos frame by frame while maintaining computational efficiency and achieving superior performance. It enhances perceptual quality and achieves comparable performance to other state-of-the-art diffusion-based VSR methods in PSNR and SSIM. This approach optimizes video super-resolution by simplifying the integration of temporal data, thus addressing key challenges in the field.

Authors

Shijie Yang

School of Mechanical Engineering, Liaoning Technical University, Fuxin, China.
Chao Chen

Department of Neonatology, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China.
Jie Liu

School of Bioscience and Bioengineering, South China University of Technology, Guangzhou, China.
Jie Tang

Department of Computer Science and Technology, Tsinghua University, Beijing, China jietang@tsinghua.edu.cn.
Gangshan Wu

State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, Jiangsu, China; Department of Computer Science and Technology, Nanjing University, Nanjing, 210023, Jiangsu, China. Electronic address: gswu@nju.edu.cn.

Keywords

Algorithms Diffusion Humans Image Processing, Computer-Assisted Neural Networks, Computer Video Recording

External Resources

View on PubMed Access via DOI PubMed (40187080)

FSDM: An efficient video super-resolution method based on Frames-Shift Diffusion Model.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

FSDM: An efficient video super-resolution method based on Frames-Shift Diffusion Model.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals