Generating high quality crowd density map based on perceptual loss

Zheyi Fan¹,
Yixuan Zhu¹,
Yu Song¹ &
…
Zhiwen Liu¹

601 Accesses
7 Citations
Explore all metrics

Abstract

High quality crowd density maps preserve a large amount of spatial information of crowd distribution, which provides significant priori information for the field of crowd behavior analysis and anomaly detection. Recent work on crowd density estimation pays more attention to the accuracy of crowd counting, ignoring the quality of crowd density map estimation. Hence, in this paper, we propose an end-to-end crowd density estimation network to generate high quality crowd density map. The original pixel-level Euclidean distance loss function in the Multi-column Convolutional Neural Network (MCNN) is replaced by the perceptual loss network. By optimizing the perceptual loss function that is defined as the differences between high-level semantic features generated by a pre-trained network, high-quality map estimation can be obtained. At the same time the accuracy of crowd counting and the sensitivity to the external environment can be improved. Extensive experiments conducted on challenging datasets validate the proposed method outperforms the state-of-the-art methods in both the crowd counting accuracy and the density estimation quality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Denstity Level Aware Network for Crowd Counting

Crowd Counting from a Still Image Using Multi-scale Fully Convolutional Network with Adaptive Human-Shaped Kernel

A crowd counting method via density map and counting residual estimation

Article 24 May 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Chen K, Loy CC, Gong S et al (2012) Feature mining for localised crowd counting. In: British Machine Vision Conference
Oosterhout TV, Bakkes S, Kröse BJ (2015) Head detection in stereo data for people counting and segmentation. In: International Conference on Computer Vision Theory and Applications, pp. 620-625
Wang S, Zhang J, Miao Z (2014) A new edge feature for head-shoulder detection. In: IEEE International Conference on Image Processing, pp. 2822-2826
Ouyang WL, Wang X (2014) Joint deep learning for pedestrian detection. In: IEEE International Conference on Computer Vision, pp. 2056-2063
Rabaud V, Belongie S (2006) Counting crowded moving objects. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 705-711
Brostow GJ, Cipolla R (2006) Unsupervised Bayesian detection of independent motion in crowds. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 594-601
Zhang Z (2004) Camera calibration with one-dimensional objects. IEEE Trans Pattern Anal Mach Intel 26(7):892–899
Article Google Scholar
Fradi H, Dugelay J (2016) Low level crowd analysis using frame-wise normalized feature for people counting. In: International Workshop on Information Forensics and Security, pp. 246–251
Liang R, Zhu Y, Wang H (2014) Counting crowd flow based on feature points. Neurocomputing. 133(8):377–384
Article Google Scholar
Chan AB, Liang ZS, Vasconcelos N (2008) Privacy preserving crowd monitoring: Counting people without people models or tracking. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-7
Fiaschi L, Koethe U, Nair R et al (2012) Learning to count with regression forest and structured labels. In: International Conference on Pattern Recognition, pp. 2685-2688
Mamoona S, Salman M, Hasan S et al (2018) People counting in dense crowd images using sparse head detections. IEEE Trans Circuits Syst Video Technol 8215:1–10
Google Scholar
Zhang NC, Li NH, Wang X et al (2015) Cross-scene crowd counting via deep convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 833-841
Zhang Y, Zhou D, Chen S et al (2016) Single-image crowd counting via multi-column convolutional neural network. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 589-597
Boominathan L, Kruthiventi SSS, Babu RV (2016) Crowdnet: a deep convolutional network for dense crowd counting. In: ACM International Conference on Multimedia, pp. 640-644
Xu M, Ge Z, Jiang X et al (2018) Depth information guided crowd counting for complex crowd scenes. Pattern Recognition Letters. 1-9
Marsden M, McGuiness K, Little S et al (2017) Fully convolutional crowd counting on highly congested scenes. In: International Conference on Computer Vision Theory and Applications, pp. 27-33
Shi Z, Zhang L, Sun Y et al (2018) Multiscale multitask deep NetVLAD for crowd counting. IEEE Trans Ind Inf 14(11):4953–4962
Article Google Scholar
Hinton GE, Osindero S, Teh YW (2014) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
Article MathSciNet Google Scholar
Johnson J, Alahi A Li FF (2016) Perceptual losses for real-time style transfer and super-resolution. In: European Conference on Computer Vision, pp. 694-711
Chapter Google Scholar
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Computer Science, 1-14
Wang Z, Bovik AC, Sheikh HR et al (2004) Image quality assessment: from error visibility to structural similarity. IEEE Trans Image Process 13(4):600–612
Article Google Scholar
Li J, Yang H, Chen L et al (2017) An end-to-end generative adversarial network for crowd counting under complicated scenes. In: IEEE International Symposium on Broadband Multimedia Systems & Broadcasting, pp. 1-4
Sindagi VA, Patel VM (2017) Generating high-quality crowd density maps using contextual pyramid CNNs. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1861-1870
Shi Z, Zhang L, Liu Y et al (2018) Crowd counting with deep negative correlation learning. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5382-5390
Idrees H, Saleemi I, Seibert C et al (2013) Multi-source multi-scale counting in extremely dense crowd images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2547–2554
Rodriguez M, Laptev I, Sivic J et al (2011) Density-aware person detection and tracking in crowds. In: International Conference on Computer Vision, pp. 2423–2430
Sam D B, Surya S, Babu R V (2017) Switching convolutional neural network for crowd counting. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4031-4039

Download references

Funding

This study was funded by National Natural Science Foundation of China (grant number: 61701029) and Basic Research Foundation of Beijing Institute of Technology (grant number: 20170542008).

Author information

Authors and Affiliations

School of Information and Electronics, Beijing Institute of Technology, Beijing, China
Zheyi Fan, Yixuan Zhu, Yu Song & Zhiwen Liu

Authors

Zheyi Fan
View author publications
You can also search for this author in PubMed Google Scholar
Yixuan Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yu Song
View author publications
You can also search for this author in PubMed Google Scholar
Zhiwen Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zheyi Fan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fan, Z., Zhu, Y., Song, Y. et al. Generating high quality crowd density map based on perceptual loss. Appl Intell 50, 1073–1085 (2020). https://doi.org/10.1007/s10489-019-01573-7

Download citation

Published: 17 December 2019
Issue Date: April 2020
DOI: https://doi.org/10.1007/s10489-019-01573-7

Generating high quality crowd density map based on perceptual loss

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Denstity Level Aware Network for Crowd Counting

Crowd Counting from a Still Image Using Multi-scale Fully Convolutional Network with Adaptive Human-Shaped Kernel

A crowd counting method via density map and counting residual estimation

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Generating high quality crowd density map based on perceptual loss

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Denstity Level Aware Network for Crowd Counting

Crowd Counting from a Still Image Using Multi-scale Fully Convolutional Network with Adaptive Human-Shaped Kernel

A crowd counting method via density map and counting residual estimation

Explore related subjects

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation