Abstract
This paper introduces a system capable of real-time video surveillance in low-end edge computing environment by combining object detection tracking algorithm. Recently, the accuracy of object detection has been improved due to the performance of approaches based on deep learning algorithm such as region-based convolutional network, which has two stages for inferencing. One-stage detection algorithms such as single shot detector and you only look once (YOLO) have been developed at the expense of some accuracy and can be used for real-time systems. However, high-performance hardware such as general-purpose graphics processing unit is required to achieve excellent object detection performance and speed. In this study, we propose an approach called N-YOLO which is instead of resizing image step in YOLO algorithm, it divides into fixed size images used in YOLO and merges detection results of each divided sub-image with inference results at different times using correlation-based tracking algorithm the amount of computation for object detection and tracking can be significantly reduced. In addition, we propose a system that can guarantee real-time performance in various edge computing environments by adaptively controlling the cycle of object detection and tracking.
Similar content being viewed by others
References
Al-masni MA, Al-antari MA, Park J-M et al (2018) Simultaneous detection and classification of breast masses in digital mammograms via a deep learning YOLO-based CAD system. Comput Methods Prog Biomed 157:85–94. https://doi.org/10.1016/j.cmpb.2018.01.017
Cao X, Guo S, Lin J, Zhang W, Liao M (2020) Online tracking of ants based on deep association metrics: method, dataset and evaluation. Pattern Recogn 103:107233. https://doi.org/10.1016/j.patcog.2020.107233
Danelljan M, Häger G, Khan FS, Felsberg M (2014) Accurate scale estimation for robust visual tracking. In: BMVC 2014 - proceedings of the British machine vision conference 2014. British Machine Vision Association, BMVA, pp. 1–11
Ding Z, Wong E (2019) Confidence trigger detection: an approach to build real-time tracking-by-detection system. ArXiv 1902(00615):1–9
Durand T Tracking things in object detection videos. In: Move Lab https://www.move-lab.com/blog/tracking-things-in-object-detection-videos. Accessed 27 Jun 2020
Fu H, Wu L, JianM YY, Wang X (2019) MF-SORT: simple online and Realtime tracking with motion features. In: Zhao Y, Barnes N, Chen B et al (eds) Image and graphics. Springer International Publishing, Cham, pp 157–168
Geng Y, Liang R-Z, Li W et al (2017) Learning convolutional neural network to maximize Pos@top performance measure. Bruges, Belgium, pp 589–594
Girshick R (2015) Fast R-CNN. In: The IEEE international conference on computer vision (ICCV). IEEE Computer Society, Boston, pp. 1440–1448
Huang C-H, Boyer E, Do B et al (2015) Toward user-specific tracking by detection of human shapes in multi-cameras. In: The IEEE conference on computer vision and pattern recognition (CVPR). IEEE Computer Society, Boston, pp 4027–4035
Le N, Heili A, Odobez JM (2016) Long-term time-sensitive costs for CRF-based tracking by detection. In: lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer Verlag, pp 43–51
Lin TY, Maire M, Belongie S et al (2014) Microsoft COCO: common objects in context. In: lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer Verlag, pp 740–755
Nikouei SY, Chen Y, Song S, et al (2018) Real-time human detection as an edge service enabled by a lightweight CNN. In: proceedings - 2018 IEEE international conference on EDGE computing, EDGE 2018 - part of the 2018 IEEE world congress on services. Institute of Electrical and Electronics Engineers Inc., pp 125–129
Pinho RR, Tavares JMR, Correia MV (2007) An improved management model for tracking missing features in computer vision long image sequences. WSEAS Trans Inf Sci Appl 1:196–203
Pinho RR, Tavares JMRS (2009) Tracking features in image sequences with Kalman filtering, global optimization, mahalanobis distance and a management model. Computer Modeling in Engineering & Sciences 46:51–75
Pinho RR, Tavares JMR (2005) Correia MV (2005) a movement tracking management model with Kalman filtering, global optimization techniques and mahalanobis distance. Advances in Computational Methods in Sciences and Engineering 1:1–4
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: The IEEE conference on computer vision and pattern recognition (CVPR). IEEE Computer Society, Hawaii, pp 7263–7271
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: The IEEE conference on computer vision and pattern recognition (CVPR). IEEE Computer Society, Las Vegas, pp. 779–788
Tavares J, Padilha A (1995)Matching lines in image sequences with geometric constraints. Portugal, pp 1–3
Wang S, Niu L, Li N (2018) Research on image recognition of insulators based on YOLO algorithm. In: 2018 international conference on power system technology (POWERCON). Pp 3871–3874
Yuan Y, Xiong Z, Wang Q (2017) An incremental framework for video-based traffic sign detection, tracking, and recognition. IEEE Trans Intell Transp Syst 18:1918–1929. https://doi.org/10.1109/TITS.2016.2614548
Zhang G, Liang G, Su F, Qu F, Wang JY (2018) Cross-domain attribute representation based on convolutional neural network. In: Huang D-S, GromihaMM, Han K, Hussain A (eds) Intelligent computing methodologies. Springer International Publishing, Cham, pp 134–142
Zhang B, Huang Z, Rahi BH, Wang Q, Li M (2019) Online semi-supervised multi-person tracking with gaussian process regression. MATEC Web Conf 277:01003. https://doi.org/10.1051/matecconf/201927701003
Zhang G, Liang G, LiW FJ, Wang J, Geng Y, Wang JY (2017) Learning convolutional ranking-score function by query preference regularization. In: Yin H, Gao Y, Chen S et al (eds) Intelligent data engineering and automated learning – IDEAL 2017. Springer International Publishing, Cham, pp 1–8
Zhu J, Zhang S, Yang J (2019) Online multi-object tracking using single object tracker and Markov clustering. In: Zhao Y, Barnes N, Chen B et al (eds) Image and graphics. Springer International Publishing, Cham, pp 555–567
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Jha, S., Seo, C., Yang, E. et al. Real time object detection and trackingsystem for video surveillance system. Multimed Tools Appl 80, 3981–3996 (2021). https://doi.org/10.1007/s11042-020-09749-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-09749-x