Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2425836.2425912acmotherconferencesArticle/Chapter ViewAbstractPublication PagesivcnzConference Proceedingsconference-collections
poster

3D object tracking with a high-resolution GPU based real-time stereo

Published: 26 November 2012 Publication History

Abstract

Stereo correspondence algorithms, which are fast enough for real-time use, require hardware assistance and inevitably trade some matching accuracy for speed. A cloud of 3D points thus produced by our previously reported GPU accelerated implementation of a dynamic programming correspondence algorithm is noisy and contains artifacts, which hinder tracking accuracy. We have augmented this implementation with modules for re-projection and filtering. A fast clustering procedure based upon a set of simple volume rules identifies candidate objects. An opportunistic tagging system tracks objects through occlusions. Kalman filtering predicts positions in the next frame. These steps reduce the effects of dynamic programming streaks in the depth maps. Experiments with synthetic and real-world video sequences confirmed the accuracy in tracking multiple objects (e.g. humans) in various environments.

References

[1]
S. Asano, T. Maruyama, and Y. Yamaguchi. Performance comparison of FPGA, GPU and CPU in image processing. In Proc. Int. Conf. on Field Programmable Logic and Applications, pages 126--131, 2009.
[2]
L. Cai, L. He, Y. Xu, Y. Zhao, and X. Yang. Multi-object detection and tracking by stereo vision. Pattern Recognition, 43(12): 4028--4041, 2010.
[3]
T. Darrell, G. Gordon, M. Harville, and J. Woodfill. Integrated person tracking using stereo, color, and pattern detection. International Journal of Computer Vision, 37(2): 175--185, 2000.
[4]
G. Gimel'farb. Probabilistic regularisation and symmetry in binocular dynamic programming stereo. Pattern Recognition Letters, 23(4): 431--442, 2002.
[5]
M. L. Gong and Y. H. Yang. Real-time stereo matching using orthogonal reliability-based dynamic programming. IEEE Trans. on Image Processing, 16(3): 879--884, 2007.
[6]
M. Harville. Stereo person tracking with adaptive plan-view templates of height and occupancy statistics. Image and Vision Computing, 22(2): 127--142, 2004.
[7]
M. Himmelsbach, A. Müller, T. Lüttel, and H. Wünsche. Lidar-based 3d object perception. In Proceedings of 1st International Workshop on Cognition for Technical Systems, 2008.
[8]
R. Kalarot and J. Morris. Comparison of FPGA and GPU implementations of real-time stereo vision. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 IEEE Computer Society Conference on, pages 9--15. IEEE, 2010.
[9]
R. Kalarot and J. Morris. Implementation of symmetric dynamic programming stereo matching algorithm using CUDA. In Proc. 16th Korea-Japan Joint Workshop on Frontiers of Computer Vision. FCV, 2010.
[10]
R. Kalarot, J. Morris, D. Berry, and J. Dunning. Analysis of real-time stereo vision algorithms on GPU. In International Conference Image and Vision Computing New Zealand (IVCNZ), pages 179--184, 2011.
[11]
R. Kalarot, J. Morris, and G. Gimel'farb. Performance analysis of multi-resolution symmetric dynamic programming stereo on GPU. In 25th International Conference Image and Vision Computing New Zealand (IVCNZ), pages 1--7. IEEE, 2010.
[12]
K. Khoshelham and S. Elberink. Accuracy and resolution of kinect depth data for indoor mapping applications. Sensors, 12(2): 1437--1454, 2012.
[13]
K. Lee, B. Kalyan, S. Wijesoma, M. Adams, F. Hover, and N. Patrikalakis. Tracking random finite objects using 3d-lidar in marine environments. In Proceedings of the 2010 ACM Symposium on Applied Computing, pages 1282--1287. ACM, 2010.
[14]
V. Lepetit and P. Fua. Monocular model-based 3D tracking of rigid objects, volume 1. Now Publishers Inc., Hanover, MA, USA, 2005.
[15]
R. Muñoz-Salinas, M. García-Silvente, and R. Medina Carnicer. Adaptive multi-modal stereo people tracking without background modelling. Journal of Visual Communication and Image Representation, 19(2): 75--91, 2008.
[16]
R. Muñoz-Salinas, R. Medina-Carnicer, F. Madrid-Cuevas, and A. Carmona-Poyato. People detection and tracking with multiple stereo cameras using particle filters. Journal of Visual Communication and Image Representation, 20(5): 339--350, 2009.
[17]
S. Obdržálek, G. Kurillo, J. Han, T. Abresch, R. Bajcsy, et al. Real-time human pose detection and tracking for tele-rehabilitation in virtual reality. Studies in Health Technology and Informatics, 173: 320, 2012.
[18]
S. Park and H. Jeong. Real-time stereo vision fpga chip with low error rate. In Proc. Int. Conf. on Multimedia and Ubiquitous Engineering, pages 751--756, Washington, DC, USA, 2007. IEEE Computer Society.
[19]
L. Spinello, M. Luber, and K. Arras. Tracking people in 3d using a bottom-up top-down detector. In 2011 IEEE International Conference on Robotics and Automation (ICRA), pages 1304--1310. IEEE, 2011.
[20]
A. Yilmaz, O. Javed, and M. Shah. Object tracking: A survey. ACM Computing Surveys (CSUR), 38(4), 2006.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
IVCNZ '12: Proceedings of the 27th Conference on Image and Vision Computing New Zealand
November 2012
547 pages
ISBN:9781450314732
DOI:10.1145/2425836
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • HRS: Hoare Research Software Ltd.
  • Google Inc.
  • Dept. of Information Science, Univ.of Otago: Department of Information Science, University of Otago, Dunedin, New Zealand

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 November 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. GPU
  2. object tracking
  3. real-time stereo
  4. tracking accuracy

Qualifiers

  • Poster

Conference

IVCNZ '12
Sponsor:
  • HRS
  • Dept. of Information Science, Univ.of Otago
IVCNZ '12: Image and Vision Computing New Zealand
November 26 - 28, 2012
Dunedin, New Zealand

Acceptance Rates

Overall Acceptance Rate 55 of 74 submissions, 74%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 201
    Total Downloads
  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)1
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media