Abstract
During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today’s approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called “pre-suspicion” mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
S. A. Velastin, P. Remagnino. Intelligent Distributed Video Surveillance System, New York, USA: The Institution of Electrical Engineers, pp. 1–2, 2006.
E. H. Aldelson, J. R. Bergen. Spatiotemporal energy models for the perception of motion. Journal of the Optical Society of America A, vol. 2, no. 2, pp. 284–299, 1985.
H. H. Baker, R. C. Bolles. Generalizing epipolar plane image analysis on the spatiotemporal surface. International Journal of Computer Vision, vol. 3, no. 1, pp. 33–49, 1989.
Y. Li, C. K. Tang, H. Y. Shum. Efficient dense depth estimation from dense multiperspective panoramas. In Proceedings of the 8th IEEE International Conference on Computer Vision, IEEE, Canada, vol. 1, no. 1, pp. 119–126, 2001.
G. Kuehne, S. Richter, M. Beier. Motion-based segmentation and contour based classification of video objects. In Proceedings of the 9th ACM International Conference on Multimedia, ACM, USA, vol. 9, no. 1, pp. 41–50, 2001.
C. W. Ngo, T. C. Pong, H. J. Zhang. Motion analysis and segmentation through spatio-temporal slices processing. IEEE Transactions on Image Processing, vol. 12, no. 3, pp. 341–355, 2003.
K. Hirahara, K. Ikeuchi. Detection of street-parking vehicles from panoramic street image. In Proceedings of IEEE International Conference on Intelligent Transportation Systems, IEEE, vol. 2, pp. 993–998, 2003.
R.Mandelbaum, G. Salgian, H. Sawhney. Correlation-based estimation of ego-motion and structure from motion and stereo. In Proceedings of the 7th IEEE International Conference on Computer Vision, IEEE, Greece, vol. 1, pp. 544–551, 1999.
H. Kawasaki, M. Murao, K. Ikeuchi, M. Sakauchi. Enhanced navigation systems with real images and real-time information. In Proceedings of the 8th World Congress on Intelligent Transport Systems, Sweden, pp. 221–228, 2001.
A. Rav-Acha, P. Peleg. A unified approach for motion analysis and view synthesis. In Proceedings of the 2nd International Symposium on 3D Data Processing, Visualization, and Transmission, Greece, vol. 1, no. 1, pp. 717–724, 2004.
M. blank, L. Gorelick, E. Shechtman, M. Irani, R. Basri. Actions as space-time shapes. In Proceedings of the 10th IEEE International Conference on Computer Vision, IEEE, PRC, vol. 2, pp. 1395–1402, 2005.
A. Yilmaz, M. Shah. Actions as objects: A novel action representation. In Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition, USA, vol. 1, pp.984–989, 2005.
V. Moolani, R. Balasubramanian, L. Shen, A. Tandon. Shape analysis and spatio-temporal tracking of mesoscale eddies in miami isopycnic coordinate ocean model. In Proceedings of International Symposium on 3D Data Processing Visualization and Transmission, USA, vol. 1, pp. 663–670, 2006.
L. Gorelick, M. Galun, E. Sharon, R. Basri, A. Brandt. Shape representation and classification using the Poisson equation. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 12, pp. 1991–2005, 2006.
A. F. Bobick, J. W. Davis. The recognition of human movement using temporal templates. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 3, pp. 257–267, 2001.
D. Weinland, R. Ronfard, E. Boyer. Free viewpoint action recognition using motion history volumes. Computer Vision and Image Understanding, vol. 104, no. 2–3, pp. 249–257, 2006.
T. Ogata, J. K. Tan, S. Ishikawa. High-speed human motion recognition based on a motion history image and an eigenspace. IEICE Transactions on Information and Systems, vol. E89-D, no. 1, pp. 281–289, 2006.
C. Ware, G. Franck. Evaluating stereo and motion cues for visualizing information nets in three dimensions. ACM Transactions on Graphics, vol. 15, no. 2, pp. 121–140, 1996.
M. Xiao, C. Z. Han, L. Zhang. Moving shadow detection and removal for traffic sequences. International Journal of Automation and Computing. vol. 4, no. 1, pp. 38–46, 2007.
K. Yamamoto, R. Oi. Color correction for multi-view video using energy minimization of view network. International Journal of Automation and Computing. vol. 5, no. 3, pp. 234–245, 2008.
B. K. P. Horn, B. G. Schunck. “Determining optical flow”: A retrospective. Artificial Intelligence, vol. 59, no. 1–2, pp. 81–87, 1993.
M. Kass, A. Witkin, D. Terzopoulos. Snakes: Active contour models. International Journal of Computer Vision, vol. 1, no. 4, pp. 321–331, 1988.
K. Fukunaga, L. Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Transactions on Information Theory, vol. 21, no. 1, pp. 32–40, 1975.
R. O. Duda, P. E. Hart. Pattern Classification and Scene Analysis, New York, USA: Wiley-Interscience, pp.135–137, 2000.
D. Comaniciu, P. Meer. Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 5, pp. 603–619, 2002.
D. Comaniciu. Nonparametric Robust Methods for Computer Vision, Ph. D. dissertation, Department of Electrical and Computer Engineering, Rutgers University, USA, 2001.
D. A. Forsyth, J. Ponce. Computer vision: A modern approach, Prentice Hall, pp. 309–313, 2003.
J. Canny. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 8, no. 6, pp. 679–698, 1986.
F. Porikli. Constant time O(1) bilateral filtering. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, USA, pp. 1–8, 2008.
O. Tuzel, F. Porikli, P. Meer. Learning on lie group for invariant detection and tracking. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, IEEE, USA, pp. 1–8, 2008.
Author information
Authors and Affiliations
Corresponding author
Additional information
Jing Wang received the B. Sc. degree from Xidian University, Xi’an, PRC in 2006. He then joined in Beijing Zhong Ke Fan Hua Measurement & Control Technology Co., Ltd. (also known as Pansino Ltd.), PRC as one of development engineers working on machine vision system for 2 years. He is currently a research student at the Informatics Research Group of University of Huddersfield, UK.
His research interests include image processing, machine vision, and intelligent computer vision system.
Zhi-Jie Xu received the Ph.D. degree in virtual manufacturing in 2000 at the University of Derby, UK. He is a reader and the head of the Computer Graphics and Image Processing Research Group within the School of Computing and Engineering at the University of Huddersfield, UK. He is a charted electronic engineer and a member of the IEEE, IET/IEE, British Computer Society (BCS), and UK Higher Education Academy (HEA).
His research interests include real-time graphics and vision systems, virtual reality (VR), manufacturing simulations, and web-based e-technologies.
Rights and permissions
About this article
Cite this article
Wang, J., Xu, ZJ. Video analysis based on volumetric event detection. Int. J. Autom. Comput. 7, 365–371 (2010). https://doi.org/10.1007/s11633-010-0516-6
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11633-010-0516-6