Abstract
Different from many gesture-based human-robot interaction applications, which focused on the recognition of the interactional or the pointing gestures, this paper proposes a vision-based method for manipulative gesture recognition aiming to achieve natural, proactive, and non-intrusive interaction between humans and robots. The main contributions of the paper are an object-centered scheme for the segmentation and characterization of hand trajectory information, the use of particle filtering methods for an action primitive spotting, and the tight coupling of bottom-up and top-down processing that realizes a task-driven attention filter for low-level recognition steps. In contrast to purely trajectory based techniques, the presented approach is called object-oriented w.r.t. two different aspects: it is object-centered in terms of trajectory features that are defined relative to an object, and it uses object-specific models for action primitives. The system has a two-layer structure recognizing both the HMM-modeled manipulative primitives and the underlying task characterized by the manipulative primitive sequence. The proposed top-down and bottom-up mechanism between the two layers decreases the image processing load and improves the recognition rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Black, M.J., Jepson, A.D.: A Probabilistic Framework for Matching Temporal Trajectories: CONDENSATION-Based Recognition of Gestures and Expressions. In: Burkhardt, H.-J., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1406, pp. 909–924. Springer, Heidelberg (1998)
Bobick, A.: Movement, activity, and action: The role of knowledge in the perception of motion. In: Royal Society Workshop on Knowledge-based Vision in Man and Machine (1998)
Chan, M.T., Hoogs, A., Schmiederer, J., Petersen, M.: Detecting rare events in video using semantic primitives with hmm. In: ICPR 2004, vol. IV, pp. 150–154 (2004)
Fritsch, J.: Vision-based Recognition of Gestures with Context. Dissertation, Bielefeld University, Technical Faculty (2003)
Fritsch, J., Hofemann, N., Sagerer, G.: Combining Sensory and Symbolic Data for Manipulative Gesture Recognition. In: Proc. IEEE ICPR, Cambridge, UK, pp. 930–933 (2004)
Fukuda, T., Nakauchi, Y., Noguchi, K., Matsubara, T.: Time series action support by mobile robot in intelligent environment. In: Proc. IEEE Int’l. Conf. Robotics and Automation, Barcelona, Spain, pp. 2908–2913 (2005)
Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. Int. J. Computer Vision, 5–28 (1998)
Jo, K.H., Kuno, Y., Shirai, Y.: Manipulative hand gesture recognition using task knowledge for human computer interaction. In: Proc. Int’l. Conf. on Automatic Face and Gesture Recognition, pp. 468–473 (1998)
Li, Z., Hofemann, N., Fritsch, J., Sagerer, G.: Hierarchical modeling and recognition of manipulative gesture. In: Proc. ICCV, Workshop on Modeling People and Human Interaction, Beijing, China. IEEE, Los Alamitos (2005)
Moore, D.J., Essa, I.A., Hayes III, M.H.: Exploiting human actions and object context for recognition tasks. In: Proc. IEEE Int’l. Conf. Computer Vision, pp. 20–27 (1999)
Nehaniv, C.P.: Classifying types of gesture and inferring intent. In: Proceedings of the Symposium on Robot Companions: Hard problems and Open Challenges in Robot-Human Interaction AISB 2005, Hatfield, UK, pp. 74–81 (2005)
Pinhanez, C.S., Bobick, A.F.: Human action detection using pnf propagation of temporal constraints. In: Proc. IEEE CVPR, Washington, DC, USA, pp. 898–907 (1998)
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. In: Readings in speech recognition, pp. 267–296. Morgan Kaufmann Publishers Inc., San Francisco (1990)
Viola, P., Jones, M.: Robust real-time object detection. In: Proc. IEEE Int. Workshop on Statistical and Computational Theories of Vision, Vancouver, Canada (2001)
Yu, C., Ballard, D.H.: Learning to Recognize Human Action Sequences. In: 2nd International Conference on Development and Learning (ICDL 2002), pp. 28–34 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, Z., Fritsch, J., Wachsmuth, S., Sagerer, G. (2006). An Object-Oriented Approach Using a Top-Down and Bottom-Up Process for Manipulative Action Recognition. In: Franke, K., Müller, KR., Nickolay, B., Schäfer, R. (eds) Pattern Recognition. DAGM 2006. Lecture Notes in Computer Science, vol 4174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11861898_22
Download citation
DOI: https://doi.org/10.1007/11861898_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44412-1
Online ISBN: 978-3-540-44414-5
eBook Packages: Computer ScienceComputer Science (R0)