research-article

Scale-aware Progressive Optimization Network

Authors:

Ying Chen,

Lifeng Huang,

Chengying Gao,

Ning LiuAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 2211 - 2219

https://doi.org/10.1145/3394171.3413698

Published: 12 October 2020 Publication History

Get Access

Abstract

Crowd counting has attracted increasing attention due to its wide application prospect. One of the most essential challenge in this domain is large scale variation, which impacts the accuracy of density estimation. To this end, we propose a scale-aware progressive optimization network (SPO-Net) for crowd counting, which trains a scale adaptive network to achieve high-quality density map estimation and overcome the variable scale dilemma in highly congested scenes. Concretely, the first phase of SPO-Net, band-pass stage, mainly concentrates on preprocessesing the input image and fusing both high-level semantic information and low-level spatial information from separated multi-layer features. And the second phase of SPO-Net, rolling guidance stage, aims to learn a scale-adapted network from multi-scale features as well as rolling training manner. For better learning local correlation of multi-size regions and reducing redundant calculations, we introduce a progressive optimization strategy. Extensive experiments on three challenging crowd counting datasets not only demonstrate the efficacy of each part in SPO-Net, but also suggest the superiority of our proposed method compared with the state-of-the-art approaches.

Supplementary Material

ZIP File (mmfp1625aux.zip)

In this supplementary document, we provide more details and comparisons about ablation experiments

Download
1.31 MB

MP4 File (3394171.3413698.mp4)

We propose a scale-aware progressive optimization network (SPO-Net), which focuses on achieving high-quality density map regression for crowd counting. In response to the biggest challenge scale variation in crowd counting, we propose a rolling structure and a progressive optimization strategy. The rolling structure can extract rich multi-scale features to learn a scale-adapted network, and the progressive optimization strategy can help the network achieve efficient learning. Our rolling structure is only applied in the training phase, so no more parameters or calculations would be added in the testing phase, which is different from the crowd counting methods proposed before. Our method has been tested on three challenging crowd counting datasets (ShanghaiTech, UCF_CC_50 and UCF-QNRF) and extensive experiments demonstrate the efficacy of each part in SPO-Net.

Download
27.24 MB

References

[1]

Lokesh Boominathan, Srinivas SS Kruthiventi, and R Venkatesh Babu. 2016. Crowdnet: A deep convolutional network for dense crowd counting. In Proceedings of ACM International Conference on Multimedia. 640--644.

Abstract

Supplementary Material

References

Index Terms

Recommendations

Single image super-resolution via deep progressive multi-scale fusion networks

SA-InterNet: Scale-Aware Interaction Network for Joint Crowd Counting and Localization

MLANet: multi-level attention network with multi-scale feature fusion for crowd counting

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations