Hybrid knowledge distillation from intermediate layers for efficient Single Image Super-Resolution

  • Jiao Xie
  • , Linrui Gong
  • , Shitong Shao
  • , Shaohui Lin
  • , Linkai Luo*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

Convolutional and Transformer models have achieved remarkable results for Single Image Super-Resolution (SISR). However, the tremendous memory and computation consumption of these models restricts their usage in resource-limited scenarios. Knowledge distillation, as an effective model compression technique, has received great research focus on the SISR task. In this paper, we propose a novel efficient SISR method via hybrid knowledge distillation from intermediate layers, termed HKDSR, which leverages the knowledge from frequency information into that RGB information. To accomplish this, we first pre-train the teacher with multiple intermediate upsampling layers to generate the intermediate SR outputs. We then construct two kinds of intermediate knowledge from the Frequency Similarity Matrix (FSM) and Adaptive Channel Fusion (ACF). FSM aims to mine the relationship of frequency similarity between the Ground-truth (GT) HR image, and the intermediate SR outputs of teacher and student by Discrete Wavelet Transformation. ACF merges the intermediate SR output of the teacher and GT HR image in a channel dimension to adaptively align the intermediate SR output of the student. Finally, we leverage the knowledge from FSM and ACF into reconstruction loss to effectively improve student performance. Extensive experiments demonstrate the effectiveness of HKDSR on different benchmark datasets and network architectures.

Original languageEnglish
Article number126592
JournalNeurocomputing
Volume554
DOIs
StatePublished - 14 Oct 2023

Keywords

  • Discrete wavelet transformation
  • Image super-resolution
  • Knowledge distillation
  • Model compression

Fingerprint

Dive into the research topics of 'Hybrid knowledge distillation from intermediate layers for efficient Single Image Super-Resolution'. Together they form a unique fingerprint.

Cite this