Abstract
This paper presents a robust automated noninvasive video monitoring approach to recover the human pose in conditions with persistent heavy obscuration. The proposed methods are compared with Ramanan’s stylized pose detection method and Wang’s sequential pose model. The experimental results show that the proposed method performs significantly better than Ramanan’s approach, is able to estimate the obscured body pose with various postures and obscuration levels in different environments, and is not sensitive to illumination changes. The system is evaluated in two domains: sleeping human subjects obscured by a bed cover, and pedestrians with a cluttered background scene, low feature contrast and baggy clothing. The body part detectors are trained in the sleep monitoring domain but are still able to estimate the pose in the pedestrian domain, demonstrating the robustness of the proposed technique.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Agarwal, A., & Triggs, B. (2004). 3D Human pose from silhouettes by relevance vector regression. In Proceedings of conference on computer vision and pattern recognition (Vol. 2, pp. 882–888).
Andriluka, M., Roth, S., & Schiele, B. (2009). Pictorial structures revisited: people detection and articulated pose estimation. In Proceedings of conference on computer vision and pattern recognition.
Balan, A. O., & Black, M. J. (2008). The naked truth: estimating body shape under clothing. In Proceedings of European conference on computer vision (pp. 15–29).
Barrow, H. G., Tenenbaum, J. M., Bolles, R. C., & Wolf, H. C. (1977). Parametric correspondence and chamfer matching: two new techniques for image matching. In Proceedings of international joint conference artificial intelligence (pp. 659–663).
Bordefors, G. (1988). Hierarchical chamfer matching: a parametric edge matching algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence, 10(6), 849–865.
Boult, T. E., Micheals, R., Gao, X., Lewis, P., Power, C., Yin, W., & Erkan, A. (1999). Frame-rate omnidirectional surveillance and tracking of camouflaged and occluded targets. In Proceedings of the second IEEE workshop on visual surveillance (pp. 48–55).
Chalmond, B., Francesconi, B., & Herbin, S. (2004). Using hidden scale for salient object detection. IEEE Transactions on Image Processing, 2644–2656.
Collins, R. T., Liu, Y., & Leordeanu, M. (2004). Online selection of discriminative tracking features. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27, 1631–1643.
Cremers, D. (2008). Nonlinear dynamical shape priors for level set segmentation. Journal of Scientific Computing, 35, 132–143.
Dietterich, T. G. (1998). Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation, 10, 1895–1923.
Deutscher, J., & Reid, I. (2005). Articulated body motion capture by stochastic search. International Journal of Computer Vision, 2, 185–205.
Elgammal, A., & Lee, C. S. (2004). Inferring 3D body pose from silhouettes using activity manifold learning. In Proceedings of the conference on computer vision and pattern recognition (Vol. 2, pp. 681–688).
Eng, H. L., Wang, J., Kam, A. H., & Yau, W. Y. (2004). A Bayesian framework for robust human detection and occlusion handling using human shape model. In Proceedings of the 17th international conference on pattern recognition (pp. 257–260).
Enzweiler, M., & Gavrila, D. M. (2009). Monocular pedestrian detection: survey and experiments. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31, 2179–2195.
Felzenszwalb, P. F., & Huttenlocher, D. P. (2005). Pictorial structures for object recognition. International Journal of Computer Vision, 61(1), 55–79.
Ferrari, V., Marin-Jimenez, M., & Zisserman, A. (2008). Progressive search space reduction for human pose estimation. In Proceedings of the conference on computer vision and pattern recognition (pp. 1–8).
Flemons, W. W., Littner, M. R., Rowley, J. A., & Gay, P. et al. (2003). Home diagnosis of sleep apnea: a systematic review of the literature. An evidence review. The American Thoracic Society. CHEST, 124(4), 1543–1579.
Fleuret, F., & Geman, D. (2001). Coarse-to-fine face detection. International Journal of Computer Vision, 41(1), 85–107.
Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Computer and System Sciences, 55, 119–139.
Gastaut, H. Tassinari, C. A., & Duron, B. (1966). Polygraphic study of the episodic diurnal and nocturnal (hypnic and respiratory) manifestations of the Pickwick syndrome. Brain Research, 1, 167–186.
Gavrila, D. (2000) Pedestrian detection from a moving vehicle. In Proceedings of the 6th European conference on computer vision (Vol. 2, pp. 37–49).
Ghafoor, A., Iqbal, R. N., & Khan, S. (2003). Robust image matching algorithm. In Proceedings of video image processing and multimedia communications 4th EURASIP conference (Vol. 1, pp. 155–160).
Gibson, G. J. (2004). Obstructive sleep apnoea syndrome: underestimated and undertreated. British Medical Bulletin, 72, 49–64.
Guo, F., & Qian, G. (2006). Learning and inference of 3D human poses from Gaussian mixture modeled silhouettes. In Proceedings of the international conference on pattern recognition (Vol. 2, pp. 43–47).
Haba-Rubio, J., Stane, L., Krieger, J., & Macher, J. P. (2005). Periodic limb movements and sleepiness in obstructive sleep apnea patients. Sleep Medicine, 6, 225–229.
Hasler, N., Stoll, C., Rosenhahn, B., Thormahlen, T., & Seidel, H. (2009). Estimating body shape of dressed humans. Computers and Graphics, 33, 211–216.
Hoey, J. (2006). Tracking using flocks of features, with application to assisted handwashing. In Proceedings of British machine vision conference (Vol. 1, pp. 367–376).
Huang, Z. Q., & Jiang, Z. (2005). Tracking camouflaged objects with weighted region consolidation. In Proceedings of the IEEE digital image computing techniques and applications (pp. 161–168).
Jaeggli, T., Caenen, G., Fransens, R., & Gool, L. V. (2005). Analysis of human locomotion based on partial measurements. In Proceedings of IEEE motion (pp. 248–253).
Javaheri, S., Abraham, W. T., & Brown, C. et al. (2004). Prevalence of obstructive sleep apnoea and periodic limb movement in 45 subjects with heart transplantation. European Heart Journal, 25, 260–266.
Lan, X., & Huttenlocher, D. (2004). A unified spatio-temporal articulated model for tracking. In: Proceedings of computer vision and pattern recognition (Vol. 1, pp. 722–729).
Lee, M. W., & Cohen, I. (2006). A model-based approach for estimating human 3D poses in static images. IEEE Transations on Pattern Analysis and Machine Intelligence, 6, 905–916.
Lee, M. W., & Nevatia, R. (2007). Body part detection for human pose estimation and tracking. In Proceedings of the IEEE workshop on motion and video computing (pp. 23–30).
Li, B., Meng, Q., & Holstein, H. (2008). Articulated motion reconstruction from feature points. Pattern Recognition, 41, 418–431.
Matusiewicz, S., & Gravill, N. (2006). Personal interview. Consultant Physicians in the Medical Physics Department of Lincoln County Hospital in United Kingdom.
Mori, G., Ren, X., Efros, A. A., & Malik, J. (2004). Recovering human body configurations: combining segmentation and recognition. In: Proceedings of computer vision and pattern recognition (pp. 326–333).
Neven, A. K., Middelkoop, H. A. M., Kemp, B., Kamphuisen, H. A. C., & Springer, M. P. (1998). The prevalence of clinically significant sleep apnoea syndrome in the Netherlands. Thorax, 53, 638–642.
Nguyen, H. T., & Smeulders, A. W. M. (2004). Fast occluded object tracking by a robust appearance filter. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26, 1099–1104.
Quinlan, J. R. (1996). Bagging, boosting and c4.5. In Proceedings of the thirteenth national conference on artificial intelligence (pp. 725–730).
Ramanan, D. (2007). Learning to parse images of articulated bodies. Advances in Neural Information Processing Systems, 19, 1129–1136.
Ramanan, D. (2009). Web homepage of Deva Ramanan. http:/www.ics.uci.edu/dramanan.
Ramanan, D., & Forsyth, D. A. (2003). Finding and tracking people from the bottom up. In Proceedings of computer vision and pattern recognition (Vol. 2, pp. 467–474).
Ramanan, D., Forsyth, D. A., & Zisserman, A. (2007). Tracking people by learning their appearance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 29, 65–81.
Ramanan, D., Forsyth, D. A., & Zisserman, A. (2006). Strike a pose: tracking people by finding stylized poses. In Proceedings of computer vision and pattern recognition.
Ren, X., Berg, A. C., & Malik, J. (2003). Recovering human body configurations using pairwise constraints between parts. In Proceedings of the international conference on computer vision (Vol. 1, pp. 824–831).
Rosenhahn, B., Kersting, U., Powell, K., Klette, R., Klette, G., & Seidel, H. (2007). A system for articulated tracking incorporating a clothing model. Machine Vision and Applications, 18, 25–40.
Sigal, L., & Black, M. (2006). Measure locally, reason globally: occlusion-sensitive articulated pose estimation. In Proceedings of the computer vision and pattern recognition (pp. 2041–2048).
Sigal, L., Bhatia, S., Roth, S., Black, M., & Isard, M. (2004). Tracking loose-limbed people. In Proceedings of the computer vision and pattern recognition (Vol. 1, pp. 421–428).
Sminchisescu, C., Kanaujia, A., Li, Z., & Metaxas, D. (2005). Discriminative density propagation for 3D human motion estimation. In Proceedings of computer vision and pattern recognition (Vol. 1, pp. 390–397).
Thayananthan, A., Stenger, B., Torr, P., & Cipolla, R. (2003). Shape context and chamfer matching in cluttered scenes. Computer Vision and Pattern Recognition, 1, 1.
Tobias, J., Geert, C., Rik, F., & Van, G. L. (2005). Analysis of human locomotion based on partial measurements. In Proceedings of IEEE workshop on motion and video computing (Vol. 2, pp. 248–253).
Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proceedings of IEEE CVPR conference (pp. 511–518).
Visi (2008). Visi-3 digital video system. http:/www.stowood.co.uk/page26.html.
Winn, J., & Shotton, J. (2006). The layout consistent random field for recognizing and segmenting partially occluded objects. In Proceedings of IEEE computer vision and pattern recognition (pp. 37–44).
Wang, C.-W. (2006). Real time Sobel square edge detector for night vision analysis. In Proceedings of international conference on image analysis and recognition, Lecture Notes in Computer Science (pp. 404–413).
Wang, C.-W., & Hunter, A. (2008a). A robust pose matching algorithm for covered body analysis for sleep apnoea. In Proceedings of the 8th international conference of IEEE BioInformatics and BioEngineering.
Wang, C.-W., & Hunter, A. (2008b). A simple sequential pose recognition model for sleep apnoea. In Proceedings of the 8th international conference of IEEE BioInformatics and BioEngineering.
Wang, C.-W., & Hunter, A. (2009). A low variance error boosting algorithm. International Journal of Applied Intelligence. Published online: 21 February 2009.
Wang, Y., & Mori, G. (2008). Multiple tree models for occlusion and spatial constraints in human pose estimation. In Proceedings of the European conference on computer vision (pp. 710–724).
Wu, B., & Nevatia, R. (2007). Detection and tracking of multiple, partially occluded humans by Bayesian combination of edgelet based part detectors. International Journal of Computer Vision, 2, 247–266.
Author information
Authors and Affiliations
Corresponding author
Additional information
An erratum to this article can be found at http://dx.doi.org/10.1007/s11263-011-0440-4
Rights and permissions
About this article
Cite this article
Wang, CW., Hunter, A. Robust Pose Recognition of the Obscured Human Body. Int J Comput Vis 90, 313–330 (2010). https://doi.org/10.1007/s11263-010-0365-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-010-0365-3