跳到主要导航 跳到搜索 跳到主要内容

See more than once: Kernel-sharing atrous convolution for semantic segmentation

  • Ye Huang
  • , Qingqing Wang
  • , Wenjing Jia
  • , Yue Lu
  • , Yuxin Li
  • , Xiangjian He*
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

The state-of-the-art semantic segmentation solutions usually leverage different receptive fields via multiple parallel branches to handle objects of different sizes. However, employing separate kernels for individual branches may degrade the generalization of the network to objects with different scales, and the computational cost increases with the increase of the number of branches. To tackle this problem, we propose a novel network structure, namely Kernel-Sharing Atrous Convolution (KSAC), where branches with different receptive fields share the same kernel, i.e., let a single kernel ‘see’ the input feature maps more than once with different receptive fields. Experiments conducted on the benchmark PASCAL VOC 2012 dataset show that our proposed sharing strategy can not only boost the network's generalization and representation abilities but also reduce the computational cost significantly. Specifically, on the validation set, when compared with DeepLabv3+, about 2.7G FLOPs and 12.7G FLOPs are saved for output stride = 16 and 8 respectively. In addition, different from the widely used ASPP structure, our proposed KSAC is able to further improve the mIOU by taking benefit of wider context with larger atrous rates. Finally, our KSAC achieves mIOUs of 88.1%, 45.47% and 80.7% on the PASCAL VOC 2012 test set (Everingham et al., 2009), ADE20K dataset (Zhou et al., 2017) and Cityscapes datasets (Marius et al., 2016), respectively. Our full code will be released on Github: https://github.com/edwardyehuang/iSeg.

源语言英语
页(从-至)26-34
页数9
期刊Neurocomputing
443
DOI
出版状态已出版 - 5 7月 2021
已对外发布

指纹

探究 'See more than once: Kernel-sharing atrous convolution for semantic segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此