Video analysis based on volumetric event detection

Jing Wang¹ &
Zhi-Jie Xu¹

88 Accesses
Explore all metrics

Abstract

During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today’s approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called “pre-suspicion” mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos

Abnormal Event Detection Based on Multi-scale Markov Random Field

Automatic Event Detection in User-Generated Video Content: A Survey

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

S. A. Velastin, P. Remagnino. Intelligent Distributed Video Surveillance System, New York, USA: The Institution of Electrical Engineers, pp. 1–2, 2006.
Google Scholar
E. H. Aldelson, J. R. Bergen. Spatiotemporal energy models for the perception of motion. Journal of the Optical Society of America A, vol. 2, no. 2, pp. 284–299, 1985.
Article Google Scholar
H. H. Baker, R. C. Bolles. Generalizing epipolar plane image analysis on the spatiotemporal surface. International Journal of Computer Vision, vol. 3, no. 1, pp. 33–49, 1989.
Article Google Scholar
Y. Li, C. K. Tang, H. Y. Shum. Efficient dense depth estimation from dense multiperspective panoramas. In Proceedings of the 8th IEEE International Conference on Computer Vision, IEEE, Canada, vol. 1, no. 1, pp. 119–126, 2001.
Google Scholar
G. Kuehne, S. Richter, M. Beier. Motion-based segmentation and contour based classification of video objects. In Proceedings of the 9th ACM International Conference on Multimedia, ACM, USA, vol. 9, no. 1, pp. 41–50, 2001.
Google Scholar
C. W. Ngo, T. C. Pong, H. J. Zhang. Motion analysis and segmentation through spatio-temporal slices processing. IEEE Transactions on Image Processing, vol. 12, no. 3, pp. 341–355, 2003.
Article Google Scholar
K. Hirahara, K. Ikeuchi. Detection of street-parking vehicles from panoramic street image. In Proceedings of IEEE International Conference on Intelligent Transportation Systems, IEEE, vol. 2, pp. 993–998, 2003.
Article Google Scholar
R.Mandelbaum, G. Salgian, H. Sawhney. Correlation-based estimation of ego-motion and structure from motion and stereo. In Proceedings of the 7th IEEE International Conference on Computer Vision, IEEE, Greece, vol. 1, pp. 544–551, 1999.
Chapter Google Scholar
H. Kawasaki, M. Murao, K. Ikeuchi, M. Sakauchi. Enhanced navigation systems with real images and real-time information. In Proceedings of the 8th World Congress on Intelligent Transport Systems, Sweden, pp. 221–228, 2001.
A. Rav-Acha, P. Peleg. A unified approach for motion analysis and view synthesis. In Proceedings of the 2nd International Symposium on 3D Data Processing, Visualization, and Transmission, Greece, vol. 1, no. 1, pp. 717–724, 2004.
Article Google Scholar
M. blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri. Actions as space-time shapes. In Proceedings of the 10th IEEE International Conference on Computer Vision, IEEE, PRC, vol. 2, pp. 1395–1402, 2005.
Google Scholar
A. Yilmaz, M. Shah. Actions as objects: A novel action representation. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, USA, vol. 1, pp.984–989, 2005.
Google Scholar
V. Moolani, R. Balasubramanian, L. Shen, A. Tandon. Shape analysis and spatio-temporal tracking of mesoscale eddies in miami isopycnic coordinate ocean model. In Proceedings of International Symposium on 3D Data Processing Visualization and Transmission, USA, vol. 1, pp. 663–670, 2006.
Article Google Scholar
L. Gorelick, M. Galun, E. Sharon, R. Basri, A. Brandt. Shape representation and classification using the Poisson equation. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 1991–2005, 2006.
Article Google Scholar
A. F. Bobick, J. W. Davis. The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 257–267, 2001.
Article Google Scholar
D. Weinland, R. Ronfard, E. Boyer. Free viewpoint action recognition using motion history volumes. Computer Vision and Image Understanding, vol. 104, no. 2–3, pp. 249–257, 2006.
Article Google Scholar
T. Ogata, J. K. Tan, S. Ishikawa. High-speed human motion recognition based on a motion history image and an eigenspace. IEICE Transactions on Information and Systems, vol. E89-D, no. 1, pp. 281–289, 2006.
Article Google Scholar
C. Ware, G. Franck. Evaluating stereo and motion cues for visualizing information nets in three dimensions. ACM Transactions on Graphics, vol. 15, no. 2, pp. 121–140, 1996.
Article Google Scholar
M. Xiao, C. Z. Han, L. Zhang. Moving shadow detection and removal for traffic sequences. International Journal of Automation and Computing. vol. 4, no. 1, pp. 38–46, 2007.
Article Google Scholar
K. Yamamoto, R. Oi. Color correction for multi-view video using energy minimization of view network. International Journal of Automation and Computing. vol. 5, no. 3, pp. 234–245, 2008.
Article Google Scholar
B. K. P. Horn, B. G. Schunck. “Determining optical flow”: A retrospective. Artificial Intelligence, vol. 59, no. 1–2, pp. 81–87, 1993.
Article Google Scholar
M. Kass, A. Witkin, D. Terzopoulos. Snakes: Active contour models. International Journal of Computer Vision, vol. 1, no. 4, pp. 321–331, 1988.
Article Google Scholar
K. Fukunaga, L. Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on Information Theory, vol. 21, no. 1, pp. 32–40, 1975.
Article MATH MathSciNet Google Scholar
R. O. Duda, P. E. Hart. Pattern Classification and Scene Analysis, New York, USA: Wiley-Interscience, pp.135–137, 2000.
Google Scholar
D. Comaniciu, P. Meer. Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 603–619, 2002.
Article Google Scholar
D. Comaniciu. Nonparametric Robust Methods for Computer Vision, Ph. D. dissertation, Department of Electrical and Computer Engineering, Rutgers University, USA, 2001.
Google Scholar
D. A. Forsyth, J. Ponce. Computer vision: A modern approach, Prentice Hall, pp. 309–313, 2003.
J. Canny. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 8, no. 6, pp. 679–698, 1986.
Article Google Scholar
F. Porikli. Constant time O(1) bilateral filtering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, USA, pp. 1–8, 2008.
Google Scholar
O. Tuzel, F. Porikli, P. Meer. Learning on lie group for invariant detection and tracking. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, USA, pp. 1–8, 2008.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics, School of Computing and Engineering, University of Huddersfield, Huddersfield, HD1 3DH, UK
Jing Wang & Zhi-Jie Xu

Authors

Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-Jie Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhi-Jie Xu.

Additional information

Jing Wang received the B. Sc. degree from Xidian University, Xi’an, PRC in 2006. He then joined in Beijing Zhong Ke Fan Hua Measurement & Control Technology Co., Ltd. (also known as Pansino Ltd.), PRC as one of development engineers working on machine vision system for 2 years. He is currently a research student at the Informatics Research Group of University of Huddersfield, UK.

His research interests include image processing, machine vision, and intelligent computer vision system.

Zhi-Jie Xu received the Ph.D. degree in virtual manufacturing in 2000 at the University of Derby, UK. He is a reader and the head of the Computer Graphics and Image Processing Research Group within the School of Computing and Engineering at the University of Huddersfield, UK. He is a charted electronic engineer and a member of the IEEE, IET/IEE, British Computer Society (BCS), and UK Higher Education Academy (HEA).

His research interests include real-time graphics and vision systems, virtual reality (VR), manufacturing simulations, and web-based e-technologies.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J., Xu, ZJ. Video analysis based on volumetric event detection. Int. J. Autom. Comput. 7, 365–371 (2010). https://doi.org/10.1007/s11633-010-0516-6

Download citation

Received: 23 August 2009
Revised: 19 November 2009
Published: 18 August 2010
Issue Date: August 2010
DOI: https://doi.org/10.1007/s11633-010-0516-6

Video analysis based on volumetric event detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos

Abnormal Event Detection Based on Multi-scale Markov Random Field

Automatic Event Detection in User-Generated Video Content: A Survey

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Video analysis based on volumetric event detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos

Abnormal Event Detection Based on Multi-scale Markov Random Field

Automatic Event Detection in User-Generated Video Content: A Survey

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation