research-article

WiseCam: A Systematic Approach to Intelligent Pan-Tilt Cameras for Moving Object Tracking

Authors:

Yunhao LiuAuthors Info & Claims

IEEE Transactions on Mobile Computing, Volume 23, Issue 12

Pages 12330 - 12344

https://doi.org/10.1109/TMC.2024.3410645

Published: 01 December 2024 Publication History

Abstract

With the desired functionality of moving object tracking, wireless pan-tilt cameras are able to play critical roles in a growing diversity of surveillance environments. However, today's pan-tilt cameras oftentimes underperform when tracking frequently moving objects like humans – they are prone to lose sight of objects and bring about excessive mechanical rotations that are especially detrimental to those energy-constrained outdoor scenarios. The ineffectiveness and high cost of all state-of-the-art tracking approaches are rooted in their adherence to the industry's simplicity principle, which leads to their stateless nature, performing gimbal rotations based only on the latest object detection. To address the issues, we design and implement WiseCam that wisely tunes the pan-tilt cameras to minimize mechanical rotation costs while maintaining long-term object tracking. This systematic tracking approach also tackles issues of motion-rotation speed gap and scattered moving objects, which is universally applicable to complex tracking scenarios. We examine the performance of WiseCam by experiments on two types of pan-tilt cameras with different motors. Results show that it significantly outperforms the state-of-the-art tracking approaches on both tracking duration and power consumption.

References

[1]

QY Research, “Global PTZ camera market research report 2024,” 2024. [Online]. Available: https://www.qyresearch.com/reports/2113600/ptz-camera

[2]

Wikipedia, “Stepper motor,” 2023. [Online]. Available: https://en.wikipedia.org/wiki/Stepper_motor

[3]

ISL Products, “Stepper motor fundamentals,” 2023. [Online]. Available: https://islproducts.com/design-note/servo-motor-fundamentals

[4]

K. Okumura, H. Oku, and M. Ishikawa, “High-speed gaze controller for millisecond-order pan/tilt camera,” in Proc. IEEE Int. Conf. Robot. Autom., 2011, pp. 6186–6191.

[5]

Z. Wu and R. J. Radke, “Keeping a pan-tilt-zoom camera calibrated,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 35, no. 8, pp. 1994–2007, Aug. 2013.

Digital Library

[6]

M. Rovai, “Automatic vision object tracking,” 2018. [Online]. Available: https://towardsdatascience.com/automatic-vision-object-tracking-347af1cc8a3b?gi=661a553688

[7]

Wikipedia, “PID controller,” 2023. [Online]. Available: https://en.wikipedia.org/wiki/PID_controller

[8]

ChangingMinds, “Simplicity principle,” 2023. [Online]. Available: http://changingminds.org/principles/simplicity.htm

[9]

J. E, L. He, Z. Li, and Y. Liu, “WiseCam: Wisely tuning wireless pan-tilt cameras for cost-effective moving object tracking,” in Proc. IEEE Conf. Comput. Commun., 2023, pp. 1660–1669.

[10]

ReoLink, “ReoLink go: Wire-free security goes anywhere with 4G LTE,” 2023. [Online]. Available: https://reolink.com/product/reolink-go

[11]

Eufy, “Pre-order SoloCam series,” 2022. [Online]. Available: https://us.eufylife.com/pages/solocam-preorder

[12]

A. J. Lipton, H. Fujiyoshi, and R. S. Patil, “Moving target classification and tracking from real-time video,” in Proc. IEEE 4th Workshop Appl. Comput. Vis., 1998, pp. 8–14.

[13]

M. Piccardi, “Background subtraction techniques: A review,” in Proc. IEEE Int. Conf. Syst. Man Cybern., 2004, pp. 3099–3104.

[14]

M. Xu, X. Zhang, Y. Liu, G. Huang, X. Liu, and F. X. Lin, “Approximate query service on autonomous IoT camera,” in Proc. 18th Int. Conf. Mobile Syst. Appl. Serv., 2020, pp. 191–205.

Digital Library

[15]

B. D. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proc. Imag. Understanding Workshop, 1981, pp. 121–130.

[16]

D. S. Bolme, J. R. Beveridge, B. A. Draper, and Y. M. Lui, “Visual object tracking using adaptive correlation filters,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 2010, pp. 2544–2550.

[17]

N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 2005, pp. 886–893.

[18]

S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Comput., vol. 9, no. 8, pp. 1735–1780, 1997.

Digital Library

[19]

Y. L. Cun, I. Kanter, and S. A. Solla, “Eigenvalues of covariance matrices: Application to neural-network learning,” Phys. Rev. Lett., vol. 66, no. 18, pp. 2396–2399, 1991.

[20]

C. M. Bishop, Pattern Recognition and Machine Learning. Berlin, Germany: Springer, 2006, p. 236.

[21]

M. Pirotta, M. Restelli, and L. Bascetta, “Adaptive step-size for policy gradient methods,” in Proc. 26th Int. Conf. Neural Inf. Process. Syst., 2013, pp. 1394–1402.

[22]

J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” 2017, arXiv: 1707.06347.

[23]

H. B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Proc. 20th Int. Conf. Artif. Intell. Statist., PMLR, pp. 1273–1282, 2017.

[24]

A. Bedagkar-Gala and S. K. Shah, “A survey of approaches and trends in person re-identification,” Image Vis. Comput., vol. 32, no. 4, pp. 270–286, 2014.

Digital Library

[25]

ISO/TC 12 Quantities and units, “ISO 80000-2:2019 Quantities and units–Part 2: Mathematics,” pp. 19–21, 2019. [Online]. Available: https://www.iso.org/standard/64973.html

[26]

ONVIF, “ONVIF Profile S. Specification v1.3,” 2019. [Online]. Available: https://www.onvif.org/wp-content/uploads/2019/12/ONVIF_Profile_-S_Specification_v1-3.pdf

[27]

Adafruit, “Adafruit 16-Channel 12-bit PWM/Servo driver - I2C interface - PCA9685,” 2021. [Online]. Available: https://www.adafruit.com/product/815

[28]

Raspberry Pi, “Raspberry pi 4: Your tiny, dual-display, desktop computer,” 2021. [Online]. Available: https://www.raspberrypi.org/products/raspberry-pi-4-model-b/

[29]

OpenCV, “OpenCV 4.5.0,” 2020. [Online]. Available: https://opencv.org/opencv-4–5-0/

[30]

Google, “TensorFlow: An end-to-end open source machine learning platform,” 2021. [Online]. Available: https://tensorflow.google.com/

[31]

S. M. LaValle, M. S. Branicky, and S. R. Lindemann, “On the relationship between classical grid search and probabilistic roadmaps,” Int. J. Robot. Res., vol. 23, no. 7/8, pp. 673–692, 2004.

[32]

D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” 2014, arXiv:1412.6980.

[33]

The Apache Software Foundation, “Apache MINA,” 2023. [Online]. Available: http://mina.apache.org/

[34]

J. Redmon and A. Farhadi, “YOLOv3: An incremental improvement,” 2018, arXiv: 1804.02767.

[35]

J. Jiang, G. Ananthanarayanan, P. Bodik, S. Sen, and I. Stoica, “Chameleon: Scalable adaptation of video analytics,” in Proc. Conf. ACM Special Int. Group Data Commun., 2018, pp. 253–266.

[36]

K. Hsieh et al., “Focus: Querying large video datasets with low latency and low cost,” in Proc. 13th USENIX Conf. Operating Syst. Des. Implementation, 2018, pp. 269–286.

[37]

Y. Li, A. Padmanabhan, P. Zhao, Y. Wang, G. H. Xu, and R. Netravali, “Reducto: On-camera filtering for resource-efficient real-time video analytics,” in Proc. Annu. Conf. ACM Special Int. Group Data Commun. Appl. Technol. Architectures Protoc. Comput. Commun., 2020, pp. 359–376.

[38]

L. M. Ni, Y. Liu, Y. C. Lau, and A. P. Patil, “LANDMARC: Indoor location sensing using active RFID,” in Proc. IEEE 1st Int. Conf. Pervasive Comput. Commun., 2003, pp. 1–9.

[39]

J. Redmon and A. Farhadi, “YOLO9000: Better, faster, stronger,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2017, pp. 6517–6525.

[40]

R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit., 2014, pp. 580–587.

[41]

D. D. Doyle, A. L. Jennings, and J. T. Black, “Optical flow background estimation for real-time pan/tilt camera object tracking,” Measurement, vol. 48, pp. 195–207, 2014.

[42]

X. Chen, X. Wang, and J. Xuan, “Tracking multiple moving objects using unscented kalman filtering techniques,” 2018, arXiv: 1802.01235.

[43]

O. Cetintas, G. Brasó, and L. Leal-Taixé, “Unifying short and long-term tracking with graph hierarchies,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 22877–22887.

[44]

M. Huang, X. Li, J. Hu, H. Peng, and S. Lyu, “Tracking multiple deformable objects in egocentric videos,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 1461–1471.

[45]

Z. Cao, J. Li, D. Zhang, M. Zhou, and A. Abusorrah, “A multi-object tracking algorithm with center-based feature extraction and occlusion handling,” IEEE Trans. Intell. Transp. Syst., vol. 24, no. 4, pp. 4464–4473, Apr. 2023.

Digital Library

[46]

L. Zhang, H. Han, M. Zhou, Y. Al-Turki, and A. Abusorrah, “An improved discriminative model prediction approach to real-time tracking of objects with camera as sensors,” IEEE Sensors J., vol. 21, no. 15, pp. 17308–17317, Aug. 2021.

[47]

I. Ahmed, S. Din, G. Jeon, F. Piccialli, and G. Fortino, “Towards collaborative robotics in top view surveillance: A framework for multiple object tracking by detection using deep learning,” IEEE/CAA J. Automatica Sinica, vol. 8, no. 7, pp. 1253–1270, Jul. 2021.

[48]

Z. Pang, J. Li, P. Tokmakov, D. Chen, S. Zagoruyko, and Y.-X. Wang, “Standing between past and future: Spatio-temporal modeling for multi-camera 3D multi-object tracking,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2023, pp. 17928–17938.

[49]

E. Ristani and C. Tomasi, “Features for multi-target multi-camera tracking and re-identification,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit., 2018, pp. 6036–6046.

Index Terms

WiseCam: A Systematic Approach to Intelligent Pan-Tilt Cameras for Moving Object Tracking
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision

Index terms have been assigned to the content through auto-classification.

Recommendations

Calibrating Pan-Tilt Cameras with Telephoto Lenses
Computer Vision – ACCV 2007
Abstract
Pan-tilt cameras are widely used in surveillance networks. These cameras are often equipped with telephoto lenses to capture objects at a distance. Such a camera makes full-metric calibration more difficult since the projection with a telephoto ...
Calibrating pan-tilt cameras with telephoto lenses
ACCV'07: Proceedings of the 8th Asian conference on Computer vision - Volume Part I

Pan-tilt cameras are widely used in surveillance networks. These cameras are often equipped with telephoto lenses to capture objects at a distance. Such a camera makes full-metric calibration more difficult since the projection with a telephoto lens is ...
Cooperative object tracking using dual‐pan–tilt–zoom cameras based on planar ground assumption

Pan–tilt–zoom (PTZ) cameras play an important role in visual surveillance system. Dual‐PTZ camera system is the simplest and most typical one. The superiority of this system lies in that it can obtain both large‐view information and high‐resolution local‐...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image IEEE Transactions on Mobile Computing

IEEE Transactions on Mobile Computing Volume 23, Issue 12

Dec. 2024

4601 pages

Issue’s Table of Contents

1536-1233 © 2024 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.

Publisher

IEEE Educational Activities Department

United States

Publication History

Published: 01 December 2024

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

Figures

Tables

Media

View Issue’s Table of Contents