跳到主要导航 跳到搜索 跳到主要内容

Counting crowds with perspective distortion correction via adaptive learning

  • Yixuan Sun
  • , Jian Jin*
  • , Xingjiao Wu
  • , Tianlong Ma
  • , Jing Yang
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

The goal of crowd counting is to estimate the number of people in the image. Presently, use regression to count people number became a mainstream method. It is worth noting that, with the development of convolutional neural networks (CNN), methods that are based on CNN have become a research hotspot. It is a more interesting topic that how to locate the site of the person in the image than simply predicting the number of people in the image. The perspective transformation present is still a challenge, because perspective distortion will cause differences in the size of the crowd in the image. To devote perspective distortion and locate the site of the person more accuracy, we design a novel framework named Adaptive Learning Network (CAL). We use the VGG as the backbone. After each pooling layer is output, we collect the 1/2, 1/4, 1/8, and 1/16 features of the original image and combine them with the weights learned by an adaptive learning branch. The object of our adaptive learning branch is each image in the datasets. By combining the output features of different sizes of each image, the challenge of drastic changes in the size of the image crowd due to perspective transformation is reduced. We conducted experiments on four population counting data sets (i.e., ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50 and UCF-QNRF), and the results show that our model has a good performance.

源语言英语
文章编号3781
页(从-至)1-17
页数17
期刊Sensors
20
13
DOI
出版状态已出版 - 1 7月 2020

指纹

探究 'Counting crowds with perspective distortion correction via adaptive learning' 的科研主题。它们共同构成独一无二的指纹。

引用此