Abstract
Promoted by Smart City, pedestrian detection under wide-angle surveillance has attracted much attention. Aiming to the small-size pedestrians have poor resolution and different degrees of distortion in visual picture from wide-angle field of view, a robust pedestrian detection algorithm based on parallel channel cascade network is proposed. The algorithm, an improved Faster R-CNN (Faster Region Convolutional Neural Networks), first obtains the differential graph and original graph to construct parallel input, and then introduces a new feature extraction network, which called the Channel Cascade Network (CCN), further designs parallel CCN for fusing more abundant image features. Finally, in Region Proposal Network, the size distribution of pedestrians in the picture is counted by clustering to best fit the pedestrian date sets. Compared with the standard Faster-RCNN and the FPN, the proposed algorithm is more conducive to the small-size pedestrian detection in the case of wide angle field distortion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zhang, Q.: Research on pedestrian detection methods on still images. University of Science and Technology of China (2015). (in Chinese)
Lin, T.Y., Dollar, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 936–944 (2016)
Wang, B.: Pedestrian detection based on deep learning. Beijing Jiaotong University (2015). (in Chinese)
Zhang, J., Xiao, J., Zhou, C., et.al: A multi-class pedestrian detection network for distorted pedestrians. In: 2018 13th IEEE Conference on Industrial Electronics and Applications (ICIEA), May, pp. 1079–1083 (2018)
He, J., Liu, K., Zhang, Y., et al: A channel-cascading pedestrian detection network for small-size pedestrians, pp. 325–338. Springer (2018)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004). IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Dalal, N., Triggs, B., et al.: Histograms of oriented gradients for human detection. In: Computer Vision and Pattern Recognition, pp. 886–893 (2005)
Zhu, Q., Yeh, M.C., Cheng, K.T., et al.: Fast human detection using a cascade of histograms of oriented gradients. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1491–1498 (2006)
Burges, C.J.: A tutorial on support vector machines for pattern recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 121–167 (1998)
Freund, Y., Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. In: European Conference on Computational Learning Theory, pp. 23–37. Springer, Berlin (1995)
Felzenszwalb, P.F., et al.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. 32, 1627–1645 (2010)
Wang, X.: An HOG-LBP human detector with partial occlusion handling. In: Proceedings of the IEEE International Conference on Computer Vision, Kyoto, Japan, September, pp. 32–39 (2009)
Kuo, W., Hariharan, B., Malik, J..: DeepBox: learning objectness with convolutional networks. In IEEE International Conference on Computer Vision, pp. 2479–2487 (2015)
Girshick, R., et al.: Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1904–1916 (2014)
He, K., Zhang, X., Ren, S., et al.: Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1904–1916 (2014)
Girshick, R.: Fast R-CNN. In: Computer Science, pp. 1440–1448 (2015)
Ren, S., Girshick, R., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. In: International Conference on Neural Information Processing Systems, pp. 1137–1149 (2015)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger, pp. 6517–6525 (2016)
Redmon, J., Farhadi, A.: YOLOv3: An Incremental Improvement. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE (2018). arXiv:1804.02767
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot MultiBox detector. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 21–37 (2015)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2015)
Kong, T., Yao, A., Chen, Y., et al: HyperNet: towards accurate region proposal generation and joint object detection. In: Computer Vision and Pattern Recognition, pp. 845–853 (2016)
Cai, Z., et al.: A unified multi-scale deep convolutional neural network for fast object detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 354–370 (2016)
Hariharan, B., Arbelaez, P., Girshick, R., et al: Hypercolumns for object segmentation and fine-grained localization. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 447–456 (2014)
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 640–651 (2014)
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. In: Computer Science, pp. 730–734 (2014)
Acknowledgments
This work is supported in part by National Natural Science Foundation of China (N0. 61771270), in part by Natural Science Foundation of Zhejiang Province (No. LY9F0001, No. LQ15F020004, No. LY19F010006), and by Key research and development plan of Zhejiang province (2018C01086).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
He, J., Zhang, Y., Yao, T. (2020). Robust Pedestrian Detection Based on Parallel Channel Cascade Network. In: Arai, K., Kapoor, S. (eds) Advances in Computer Vision. CVC 2019. Advances in Intelligent Systems and Computing, vol 943. Springer, Cham. https://doi.org/10.1007/978-3-030-17795-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-17795-9_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-17794-2
Online ISBN: 978-3-030-17795-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)