Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

WiseCam: A Systematic Approach to Intelligent Pan-Tilt Cameras for Moving Object Tracking

Published: 01 December 2024 Publication History

Abstract

With the desired functionality of moving object tracking, wireless pan-tilt cameras are able to play critical roles in a growing diversity of surveillance environments. However, today's pan-tilt cameras oftentimes underperform when tracking frequently moving objects like humans – they are prone to lose sight of objects and bring about excessive mechanical rotations that are especially detrimental to those energy-constrained outdoor scenarios. The ineffectiveness and high cost of all state-of-the-art tracking approaches are rooted in their adherence to the industry's simplicity principle, which leads to their stateless nature, performing gimbal rotations based only on the latest object detection. To address the issues, we design and implement WiseCam that wisely tunes the pan-tilt cameras to minimize mechanical rotation costs while maintaining long-term object tracking. This systematic tracking approach also tackles issues of motion-rotation speed gap and scattered moving objects, which is universally applicable to complex tracking scenarios. We examine the performance of WiseCam by experiments on two types of pan-tilt cameras with different motors. Results show that it significantly outperforms the state-of-the-art tracking approaches on both tracking duration and power consumption.

References

[1]
QY Research, “Global PTZ camera market research report 2024,” 2024. [Online]. Available: https://www.qyresearch.com/reports/2113600/ptz-camera
[2]
Wikipedia, “Stepper motor,” 2023. [Online]. Available: https://en.wikipedia.org/wiki/Stepper_motor
[3]
ISL Products, “Stepper motor fundamentals,” 2023. [Online]. Available: https://islproducts.com/design-note/servo-motor-fundamentals
[4]
K. Okumura, H. Oku, and M. Ishikawa, “High-speed gaze controller for millisecond-order pan/tilt camera,” in Proc. IEEE Int. Conf. Robot. Autom., 2011, pp. 6186–6191.
[5]
Z. Wu and R. J. Radke, “Keeping a pan-tilt-zoom camera calibrated,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 8, pp. 1994–2007, Aug. 2013.
[6]
M. Rovai, “Automatic vision object tracking,” 2018. [Online]. Available: https://towardsdatascience.com/automatic-vision-object-tracking-347af1cc8a3b?gi=661a553688
[7]
Wikipedia, “PID controller,” 2023. [Online]. Available: https://en.wikipedia.org/wiki/PID_controller
[8]
ChangingMinds, “Simplicity principle,” 2023. [Online]. Available: http://changingminds.org/principles/simplicity.htm
[9]
J. E, L. He, Z. Li, and Y. Liu, “WiseCam: Wisely tuning wireless pan-tilt cameras for cost-effective moving object tracking,” in Proc. IEEE Conf. Comput. Commun., 2023, pp. 1660–1669.
[10]
ReoLink, “ReoLink go: Wire-free security goes anywhere with 4G LTE,” 2023. [Online]. Available: https://reolink.com/product/reolink-go
[11]
Eufy, “Pre-order SoloCam series,” 2022. [Online]. Available: https://us.eufylife.com/pages/solocam-preorder
[12]
A. J. Lipton, H. Fujiyoshi, and R. S. Patil, “Moving target classification and tracking from real-time video,” in Proc. IEEE 4th Workshop Appl. Comput. Vis., 1998, pp. 8–14.
[13]
M. Piccardi, “Background subtraction techniques: A review,” in Proc. IEEE Int. Conf. Syst. Man Cybern., 2004, pp. 3099–3104.
[14]
M. Xu, X. Zhang, Y. Liu, G. Huang, X. Liu, and F. X. Lin, “Approximate query service on autonomous IoT camera,” in Proc. 18th Int. Conf. Mobile Syst. Appl. Serv., 2020, pp. 191–205.
[15]
B. D. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proc. Imag. Understanding Workshop, 1981, pp. 121–130.
[16]
D. S. Bolme, J. R. Beveridge, B. A. Draper, and Y. M. Lui, “Visual object tracking using adaptive correlation filters,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 2010, pp. 2544–2550.
[17]
N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 2005, pp. 886–893.
[18]
S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Comput., vol. 9, no. 8, pp. 1735–1780, 1997.
[19]
Y. L. Cun, I. Kanter, and S. A. Solla, “Eigenvalues of covariance matrices: Application to neural-network learning,” Phys. Rev. Lett., vol. 66, no. 18, pp. 2396–2399, 1991.
[20]
C. M. Bishop, Pattern Recognition and Machine Learning. Berlin, Germany: Springer, 2006, p. 236.
[21]
M. Pirotta, M. Restelli, and L. Bascetta, “Adaptive step-size for policy gradient methods,” in Proc. 26th Int. Conf. Neural Inf. Process. Syst., 2013, pp. 1394–1402.
[22]
J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” 2017, arXiv: 1707.06347.
[23]
H. B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Proc. 20th Int. Conf. Artif. Intell. Statist., PMLR, pp. 1273–1282, 2017.
[24]
A. Bedagkar-Gala and S. K. Shah, “A survey of approaches and trends in person re-identification,” Image Vis. Comput., vol. 32, no. 4, pp. 270–286, 2014.
[25]
ISO/TC 12 Quantities and units, “ISO 80000-2:2019 Quantities and units–Part 2: Mathematics,” pp. 19–21, 2019. [Online]. Available: https://www.iso.org/standard/64973.html
[26]
ONVIF, “ONVIF Profile S. Specification v1.3,” 2019. [Online]. Available: https://www.onvif.org/wp-content/uploads/2019/12/ONVIF_Profile_-S_Specification_v1-3.pdf
[27]
Adafruit, “Adafruit 16-Channel 12-bit PWM/Servo driver - I2C interface - PCA9685,” 2021. [Online]. Available: https://www.adafruit.com/product/815
[28]
Raspberry Pi, “Raspberry pi 4: Your tiny, dual-display, desktop computer,” 2021. [Online]. Available: https://www.raspberrypi.org/products/raspberry-pi-4-model-b/
[29]
OpenCV, “OpenCV 4.5.0,” 2020. [Online]. Available: https://opencv.org/opencv-4–5-0/
[30]
Google, “TensorFlow: An end-to-end open source machine learning platform,” 2021. [Online]. Available: https://tensorflow.google.com/
[31]
S. M. LaValle, M. S. Branicky, and S. R. Lindemann, “On the relationship between classical grid search and probabilistic roadmaps,” Int. J. Robot. Res., vol. 23, no. 7/8, pp. 673–692, 2004.
[32]
D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” 2014, arXiv:1412.6980.
[33]
The Apache Software Foundation, “Apache MINA,” 2023. [Online]. Available: http://mina.apache.org/
[34]
J. Redmon and A. Farhadi, “YOLOv3: An incremental improvement,” 2018, arXiv: 1804.02767.
[35]
J. Jiang, G. Ananthanarayanan, P. Bodik, S. Sen, and I. Stoica, “Chameleon: Scalable adaptation of video analytics,” in Proc. Conf. ACM Special Int. Group Data Commun., 2018, pp. 253–266.
[36]
K. Hsieh et al., “Focus: Querying large video datasets with low latency and low cost,” in Proc. 13th USENIX Conf. Operating Syst. Des. Implementation, 2018, pp. 269–286.
[37]
Y. Li, A. Padmanabhan, P. Zhao, Y. Wang, G. H. Xu, and R. Netravali, “Reducto: On-camera filtering for resource-efficient real-time video analytics,” in Proc. Annu. Conf. ACM Special Int. Group Data Commun. Appl. Technol. Architectures Protoc. Comput. Commun., 2020, pp. 359–376.
[38]
L. M. Ni, Y. Liu, Y. C. Lau, and A. P. Patil, “LANDMARC: Indoor location sensing using active RFID,” in Proc. IEEE 1st Int. Conf. Pervasive Comput. Commun., 2003, pp. 1–9.
[39]
J. Redmon and A. Farhadi, “YOLO9000: Better, faster, stronger,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 6517–6525.
[40]
R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 580–587.
[41]
D. D. Doyle, A. L. Jennings, and J. T. Black, “Optical flow background estimation for real-time pan/tilt camera object tracking,” Measurement, vol. 48, pp. 195–207, 2014.
[42]
X. Chen, X. Wang, and J. Xuan, “Tracking multiple moving objects using unscented kalman filtering techniques,” 2018, arXiv: 1802.01235.
[43]
O. Cetintas, G. Brasó, and L. Leal-Taixé, “Unifying short and long-term tracking with graph hierarchies,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 22877–22887.
[44]
M. Huang, X. Li, J. Hu, H. Peng, and S. Lyu, “Tracking multiple deformable objects in egocentric videos,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 1461–1471.
[45]
Z. Cao, J. Li, D. Zhang, M. Zhou, and A. Abusorrah, “A multi-object tracking algorithm with center-based feature extraction and occlusion handling,” IEEE Trans. Intell. Transp. Syst., vol. 24, no. 4, pp. 4464–4473, Apr. 2023.
[46]
L. Zhang, H. Han, M. Zhou, Y. Al-Turki, and A. Abusorrah, “An improved discriminative model prediction approach to real-time tracking of objects with camera as sensors,” IEEE Sensors J., vol. 21, no. 15, pp. 17308–17317, Aug. 2021.
[47]
I. Ahmed, S. Din, G. Jeon, F. Piccialli, and G. Fortino, “Towards collaborative robotics in top view surveillance: A framework for multiple object tracking by detection using deep learning,” IEEE/CAA J. Automatica Sinica, vol. 8, no. 7, pp. 1253–1270, Jul. 2021.
[48]
Z. Pang, J. Li, P. Tokmakov, D. Chen, S. Zagoruyko, and Y.-X. Wang, “Standing between past and future: Spatio-temporal modeling for multi-camera 3D multi-object tracking,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 17928–17938.
[49]
E. Ristani and C. Tomasi, “Features for multi-target multi-camera tracking and re-identification,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2018, pp. 6036–6046.

Index Terms

  1. WiseCam: A Systematic Approach to Intelligent Pan-Tilt Cameras for Moving Object Tracking
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Please enable JavaScript to view thecomments powered by Disqus.

            Information & Contributors

            Information

            Published In

            cover image IEEE Transactions on Mobile Computing
            IEEE Transactions on Mobile Computing  Volume 23, Issue 12
            Dec. 2024
            4601 pages

            Publisher

            IEEE Educational Activities Department

            United States

            Publication History

            Published: 01 December 2024

            Qualifiers

            • Research-article

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • 0
              Total Citations
            • 0
              Total Downloads
            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 03 Mar 2025

            Other Metrics

            Citations

            View Options

            View options

            Figures

            Tables

            Media

            Share

            Share

            Share this Publication link

            Share on social media