TY - GEN
T1 - When Handcrafted Filter Meets CNN
T2 - 14th Annual ACM International Conference on Multimedia Retrieval, ICMR 2024
AU - Wu, Zhijian
AU - Liu, Wenhui
AU - Huang, Dingjiang
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s).
PY - 2024/6/7
Y1 - 2024/6/7
N2 - Due to their powerful representational ability, convolutional neural networks (CNN) have achieved great success in image super-resolution (SR). In the trained SR models such as EDSR, we observe that partial convolutions exhibit analogous characteristics compared to handcrafted filters which avoid parameters with much less computational cost. This inspires us to substitute the handcrafted filters for the learnable convolutions in the SR models, such that the network complexity and the computational overhead are significantly reduced. In this study, we propose a novel lightweight SR network dubbed as Conv-Filter Mixer (CFM). Specifically, our CFM encapsulates three kinds of computations: learnable convolution, integrated filter unit (IFU), and identity mapping. Among them, IFU consists of diverse handcrafted filters to efficiently extract primitive representations in a non-parametric manner, making the limited parameterized components of lightweight networks focus on learning abstract and intricate features. To further improve efficiency, we introduce channel splitting and shuffling structures to mix the features produced by heterogeneous components efficiently. Extensive experiments demonstrate that our CFM achieves state-of-the-art performance with fewer parameters and computational costs.
AB - Due to their powerful representational ability, convolutional neural networks (CNN) have achieved great success in image super-resolution (SR). In the trained SR models such as EDSR, we observe that partial convolutions exhibit analogous characteristics compared to handcrafted filters which avoid parameters with much less computational cost. This inspires us to substitute the handcrafted filters for the learnable convolutions in the SR models, such that the network complexity and the computational overhead are significantly reduced. In this study, we propose a novel lightweight SR network dubbed as Conv-Filter Mixer (CFM). Specifically, our CFM encapsulates three kinds of computations: learnable convolution, integrated filter unit (IFU), and identity mapping. Among them, IFU consists of diverse handcrafted filters to efficiently extract primitive representations in a non-parametric manner, making the limited parameterized components of lightweight networks focus on learning abstract and intricate features. To further improve efficiency, we introduce channel splitting and shuffling structures to mix the features produced by heterogeneous components efficiently. Extensive experiments demonstrate that our CFM achieves state-of-the-art performance with fewer parameters and computational costs.
KW - Deep learning
KW - Handcrafted filter
KW - Image super-resolution
KW - Lightweight network
UR - https://www.scopus.com/pages/publications/85199137723
U2 - 10.1145/3652583.3658003
DO - 10.1145/3652583.3658003
M3 - 会议稿件
AN - SCOPUS:85199137723
T3 - ICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval
SP - 722
EP - 730
BT - ICMR 2024-Proceedings of the 14th Annual ACM International Conference on Multimedia Retrieval
PB - Association for Computing Machinery, Inc
Y2 - 10 June 2024 through 14 June 2024
ER -