Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Emulating human observers with bayesian binning: Segmentation of action streams

Published: 29 August 2011 Publication History

Abstract

Natural body movements arise in the form of temporal sequences of individual actions. During visual action analysis, the human visual system must accomplish a temporal segmentation of the action stream into individual actions. Such temporal segmentation is also essential to build hierarchical models for action synthesis in computer animation. Ideally, such segmentations should be computed automatically in an unsupervised manner. We present an unsupervised segmentation algorithm that is based on Bayesian Binning (BB) and compare it to human segmentations derived from psychophysical data. BB has the advantage that the observation model can be easily exchanged. Moreover, being an exact Bayesian method, BB allows for the automatic determination of the number and positions of segmentation points. We applied this method to motion capture sequences from martial arts and compared the results to segmentations provided by humans from movies that showed characters that were animated with the motion capture data. Human segmentation was then assessed by an interactive adjustment paradigm, where participants had to indicate segmentation points by selection of the relevant frames. Results show a good agreement between automatically generated segmentations and human performance when the trajectory segments between the transition points were modeled by polynomials of at least third order. This result is consistent with theories about differential invariants of human movements.

Supplementary Material

Endres (endres.zip)
Supplemental movie, appendix, image and software files for, Emulating human observers with bayesian binning: Segmentation of action streams

References

[1]
Agam, Y. and Sekuler, R. 2008. Geometric structure and chunking in reproduction of motion sequences. J. Vis. 8, 1, 1--12.
[2]
Arikan, O. and Forsyth, D. A. 2002. Interactive motion generation from examples. ACM Trans. Graph. 21, 483--490.
[3]
Barbič, J., Safonova, A., Pan, J.-Y., Faloutsos, C., Hodgins, J. K., and Pollard, N. S. 2004. Segmenting motion capture data into distinct behaviors. In Proceedings of Graphics Interface 2004 (GI'04). 185--194.
[4]
Bishop, C. M. 2007. Pattern Recognition and Machine Learning. Springer.
[5]
Dickman, H. R. 1963. The perception of behavioral units. In The Stream of Behavior, R. G. Barker Ed., Appleton-Century-Crofts, New York, 23--41.
[6]
Endres, D. and Földiák, P. 2005. Bayesian bin distribution inference and mutual information. IEEE Trans. Inf. Theory 51, 11, 3766--3779.
[7]
Endres, D. and Oram, M. 2010. Feature extraction from spike trains with Bayesian binning: Latency is where the signal starts. J. Comput. Neurosci. 29, 149--169.
[8]
Fearnhead, P. 2006. Exact and efficient Bayesian inference for multiple changepoint problems. Statist. Comput. 16, 2, 203--213.
[9]
Flash, T. and Hogan, N. 1985. The coordination of arm movements: An experimentally confirmed mathematical model. J. Neurosci. 5, 1688--1703.
[10]
Green, R. D. 2003. Spatial and temporal segmentation of continuous human motion from monocular video images. In Proceedings of Image and Vision Computing. 163--169.
[11]
Hutter, M. 2007. Exact Bayesian regression of piecewise constant functions. J. Bayesian Anal. 2, 4, 635--664.
[12]
Ilg, W., Bakir, G., Mezger, J., and Giese, M. 2004. On the representation, learning and transfer of spatio-temporal movement characteristics. Int. J. Hum. Robot. 1, 4, 613--636.
[13]
Kovar, L., Gleicher, M., and Pighin, F. 2002. Motion graphs. ACM Trans. Graph. 21, 473--482.
[14]
Kschischang, F., Frey, B., and Loeliger, H.-A. 2001. Factor graphs and the sum-product algorithm. IEEE Trans. Inf. Theory 47, 2, 498--519.
[15]
Marida, K. V. and Jupp, P. E. 2000. Directional Statistics. Wiley.
[16]
Newtson, D. and Engquist, G. 1976. The perceptual organization of ongoing behavior. J. Exper. Social Psychol. 12, 5, 436--450.
[17]
Petersen, K. B. and Pedersen, M. S. 2008. The matrix cookbook. Tech. rep. citeseerx.ist.psu.edu.
[18]
Polyakov, F., Stark, E., Drori, R., Abeles, M., and Flash, T. 2009. Parabolic movement primitives and cortical states: Merging optimality with geometric invariance. Biol. Cybern. 100, 2, 159--184.
[19]
Roether, C. L., Omlor, L., Christensen, A., and Giese, M. A. 2009. Critical features for the perception of emotion from gait. J. Vis. 9, 6, 1--32.
[20]
Rose, C., Bodenheimer, B., and Cohen, M. F. 1998. Verbs and adverbs: Multidimensional motion interpolation using radial basis functions. IEEE Comput. Graph. Appl. 18, 32--40.
[21]
Safonova, A., Hodgins, J. K., and Pollard, N. S. 2004. Synthesizing physically realistic human motion in low-dimensional, behavior-specific spaces. ACM Trans. Graph. 23, 514--521.
[22]
Segouat, J. and Braffort, A. 2009. Toward the study of sign language coarticulation: Methodology proposal. In Proceedings of the 2nd International Conferences on Advances in Computer-Human Interactions (ACHI'09). IEEE Computer Society, Los Alamitos, CA, 369--374.
[23]
Shipley, T. F., Maguire, M. J., and Brumberg, J. 2004. Segmentation of event paths. J. Vis. 4, 8.
[24]
Zacks, J. M., Kumar, S., Abrams, R. A., and Mehta, R. 2009. Using movement and intentions to understand human activity. Cogn. 112, 2, 201--216.

Cited By

View all
  • (2023)A primitive-based representation of dance: modulations by experience and perceptual validityJournal of Neurophysiology10.1152/jn.00161.2023130:5(1214-1225)Online publication date: 1-Nov-2023
  • (2022)Segmentation boundaries in accelerometer data of arm motion induced by music: Online computation and perceptual assessmentHuman Technology10.14254/1795-6889.2022.18-3.418:3(250-266)Online publication date: 28-Dec-2022
  • (2021)ARMA-Based Segmentation of Human Limb Motion SequencesSensors10.3390/s2116557721:16(5577)Online publication date: 19-Aug-2021
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Applied Perception
ACM Transactions on Applied Perception  Volume 8, Issue 3
August 2011
79 pages
ISSN:1544-3558
EISSN:1544-3965
DOI:10.1145/2010325
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 29 August 2011
Accepted: 01 July 2011
Received: 01 April 2011
Published in TAP Volume 8, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Bayesian methods
  2. Motion capture
  3. action segmentation
  4. unsupervised learning

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)2
Reflects downloads up to 29 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A primitive-based representation of dance: modulations by experience and perceptual validityJournal of Neurophysiology10.1152/jn.00161.2023130:5(1214-1225)Online publication date: 1-Nov-2023
  • (2022)Segmentation boundaries in accelerometer data of arm motion induced by music: Online computation and perceptual assessmentHuman Technology10.14254/1795-6889.2022.18-3.418:3(250-266)Online publication date: 28-Dec-2022
  • (2021)ARMA-Based Segmentation of Human Limb Motion SequencesSensors10.3390/s2116557721:16(5577)Online publication date: 19-Aug-2021
  • (2021)Human Limb Motion Segmentation by PCA-ARMA Methods2021 IEEE 21st International Conference on Communication Technology (ICCT)10.1109/ICCT52962.2021.9658092(1153-1157)Online publication date: 13-Oct-2021
  • (2020)Learning Sequential Force Interaction SkillsRobotics10.3390/robotics90200459:2(45)Online publication date: 17-Jun-2020
  • (2019)Taking it out of context: The role of contextual coherence during social event segmentationAttention, Perception, & Psychophysics10.3758/s13414-019-01752-1Online publication date: 28-May-2019
  • (2019)Predicting Perceived Naturalness of Human Animations Based on Generative Movement Primitive ModelsACM Transactions on Applied Perception10.1145/335540116:3(1-18)Online publication date: 6-Sep-2019
  • (2019)A systematic survey of martial art using motion capture technologiesMultimedia Tools and Applications10.1007/s11042-018-6624-y78:8(10113-10140)Online publication date: 1-Apr-2019
  • (2015)VideoHandlesComputers and Graphics10.1016/j.cag.2015.01.00448:C(99-106)Online publication date: 1-May-2015
  • (2015)Bayesian Approaches for Learning of Primitive-Based Compact Representations of Complex Human ActivitiesDance Notations and Robot Motion10.1007/978-3-319-25739-6_6(117-137)Online publication date: 25-Nov-2015
  • Show More Cited By

View Options

Get Access

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media