Scale-pyramid dynamic atrous convolution for pixel-level labeling

  • Zhiqiang Li
  • , Jie Jiang
  • , Xi Chen*
  • , Min Zhang
  • , Yong Wang
  • , Qingli Li
  • , Honggang Qi
  • , Min Liu
  • , Robert Laganière
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

For achieving better performance, the majority of deep convolutional neural networks have endeavored to increase the model capacity by adding more convolutional layers or increasing the size of the filters. Consequently, the computational cost increases proportionally with the model capacity. This problem can be alleviated by dynamic convolution. In the case of pixel-level labeling, existing pixel-level dynamic convolution methods have a smaller scanning area than ordinary convolution or image-level dynamic convolution and are thus unable to exploit fine contextual information. As a consequence, pixel-level dynamic convolution is more sensitive to large-scale varying objects and confusion categories. In this paper, we propose a scale-pyramid dynamic atrous convolution (SDAConv) and exploit multi-scale pixel-level features in finer granularity, in order to efficiently increase model capacity, exploring contextual information, capture detail information and alleviate large-scale variation problem at the same time. Through kernel engineering (instead of network engineering), SDAConv dynamically arranges atrous filters in the individual convolutional kernels over different semantic areas at dense scales in the spatial dimension. By simply replacing the regular convolution with SDAConv in SOTA architectures, extensive experiments on three public datasets, Cityscapes, PASCAL VOC 2012 and ADE20K benchmarks demonstrate the superior performance of SDAConv on pixel-level labeling tasks.

Original languageEnglish
Article number122695
JournalExpert Systems with Applications
Volume241
DOIs
StatePublished - 1 May 2024

Keywords

  • DCNN
  • Deep learning
  • Dynamic convolution
  • Kernel engineering
  • Pixel-level labeling

Fingerprint

Dive into the research topics of 'Scale-pyramid dynamic atrous convolution for pixel-level labeling'. Together they form a unique fingerprint.

Cite this