Abstract
The content-based analysis of visual multimedia like images and videos are urgently needed to empower human society for the automation of difficult tasks. Pedestrian detection serves as a backbone for a multitude of image processing and machine learning algorithms and secures quite a lot of real-world applications. Keeping this fact in mind, here, we deal with the fabrication of suitable features to identify human/pedestrian instances from images with near accuracy. Accordingly, we introduce second-order aggregate channel features (SOACF) to enhance the performance of much-celebrated pedestrian detection algorithm which was mainly based on the first-order information in an image—aggregate channel features detector (ACF detector). We experimentally proved the complementary nature of ACF and SOACF. Designed to garner both these features together, instead of simple concatenation, or direct merging of the two detectors, we employed a weighted non-maximum suppression merging algorithm. The prospective detector not only performed well on INRIA, Caltech and KITTI pedestrian data set but also, mitigate the miss rate by \(\sim 4\%\) in Caltech data set and \(\sim 2\%\) in KITTI data set in comparison with ACF detector. Despite the fact that our in-house generated detector uses only a few channels, it surpasses many state-of-the-art methods based on baseline ACF detector. Moreover, the detection speed is 100 times faster than the topmost pedestrian detector based on ACF.
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Kaur A, Dhir R, Lehal GS (2017) A survey on camera-captured scene text detection and extraction: towards gurmukhi script. Int J Multimed Inf Retr 6(2):115–142
Shirahama K, Grzegorzek M, Uehara K (2015) Weakly supervised detection of video events using hidden conditional random fields. Int J Multimed Inf Retr 4(1):17–32
Saadna Y, Behloul A (2017) An overview of traffic sign detection and classification methods. Int J Multimed Inf Retr 6(3):193–210
Sathish PK, Balaji S (2018) A complete person re-identification model using Kernel-pca-based Gabor-filtered hybrid descriptors. Int J Multimed Inf Retr 7(4):221–229
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR’05), vol 1, pp 886–893. IEEE
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Benenson R, Mathias M, Timofte R, Van Gool L (2012) Pedestrian detection at 100 frames per second. In: CVPR
Viola P, Jones MJ, Snow D (2005) Detecting pedestrian using patterns of motion and appearance. Int J Comput Vis 63(2):153–161
Zhang S, Benenson R, Omran M, Hosang J, Schiele B (2016) How far are we from solving pedestrian detection? In: CVPR
Dollár P, Appel R, Belongie S, Perona P (2014) Fast feature pyramids for object detection. IEEE Trans Pattern Anal Mach Intell 36(8):1532–1545
Nam W, Dollár P, Han JH (2014) Local decorrelation for improved pedestrian detection. In: Advances in neural information processing systems, pp 424–432
Yang B, Yan J, Lei Z, Li SZ (2015) Convolutional channel features. In: Proceedings of the IEEE international conference on computer vision, pp 82–90
Paisitkriangkrai S, Shen C, van den Hengel A (2014) Strengthening the effectiveness of pedestrian detection with spatially pooled features. In: European conference on computer vision, pp 546–561. Springer
Zhang S, Benenson R, Schiele B (2015) Filtered channel features for pedestrian detection. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR), pp 1751–1760. IEEE
Dollár P, Wojek C, Schiele B, Perona P (2012) Pedestrian detection: an evaluation of the state of the art. PAMI 34:743–761
Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The kitti vision benchmark suite. In: Conference on computer vision and pattern recognition (CVPR)
Zhang S, Bauckhage C, Cremers AB (2014) Informed haar-like features improve pedestrian detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 947–954
Lim JJ, Zitnick CL, Dollr, P (2013) Sketch tokens: a learned mid-level representation for contour and object detection. In: 2013 IEEE conference on computer vision and pattern recognition, pp 3158–3165
Cao H, Yamaguchi K, Naito T, Ninomiya Y (2009) Pedestrian recognition using second-order hog feature. In: Asian conference on computer vision, pp 628–634. Springer
Jiang Y, Ma J (2015) Combination features and models for human detection. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition 07 June 2012, pp 240–248. 10.1109/CVPR.2015.7298620
Huang D, Zhu C, Wang Y, Chen L (2014) Hsog: a novel local image descriptor based on histograms of the second-order gradients. IEEE Trans Image Process 23(11):4680–4695
Benenson R, Mathias M, Tuytelaars T, Van Gool L (2013) Seeking the strongest rigid detector. In: CVPR
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Bastian, B.T., C.V., J. Pedestrian detection using first- and second-order aggregate channel features. Int J Multimed Info Retr 8, 127–133 (2019). https://doi.org/10.1007/s13735-019-00171-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13735-019-00171-0