Depth Extraction from Video Using Non-parametric Sampling

Kevin Karsch²¹,
Ce Liu²² &
Sing Bing Kang²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7576))

Included in the following conference series:

European Conference on Computer Vision

10k Accesses
4 Altmetric

Abstract

We describe a technique that automatically generates plausible depth maps from videos using non-parametric depth sampling. We demonstrate our technique in cases where past methods fail (non-translating cameras and dynamic scenes). Our technique is applicable to single images as well as videos. For videos, we use local motion cues to improve the inferred depth maps, while optical flow is used to ensure temporal depth consistency. For training and evaluation, we use a Kinect-based system to collect a large dataset containing stereoscopic videos with known depths. We show that our depth estimation technique outperforms the state-of-the-art on benchmark databases. Our technique can be used to automatically convert a monoscopic video into stereo for 3D visualization, and we demonstrate this through a variety of visually pleasing results for indoor and outdoor scenes, including results from the feature film Charade.

Download to read the full chapter text

Chapter PDF

Depth Transfer: Depth Extraction from Videos Using Nonparametric Sampling

User Directed Multi-view-stereo

IVS3D: An Open Source Framework for Intelligent Video Sampling and Preprocessing to Facilitate 3D Reconstruction

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Zhang, G., Jia, J., Hua, W., Bao, H.: Robust bilayer segmentation and motion/depth estimation with a handheld camera. IEEE TPAMI, 603–617 (2011)
Google Scholar
Hoiem, D., Efros, A., Hebert, M.: Automatic photo pop-up. In: ACM SIGGRAPH (2005)
Google Scholar
Delage, E., Lee, H., Ng, A.: A dynamic Bayesian network model for autonomous 3D reconstruction from a single indoor image. In: CVPR (2006)
Google Scholar
Saxena, A., Chung, S., Ng, A.: Learning depth from single monocular images. In: NIPS (2005)
Google Scholar
Saxena, A., Sun, M., Ng, A.: Make3D: Learning 3D scene structure from a single still image. IEEE TPAMI 31, 824–840 (2009)
Article Google Scholar
Batra, D., Saxena, A.: Learning the right model: Efficient max-margin learning in laplacian crfs. In: CVPR (2012)
Google Scholar
Liu, B., Gould, S., Koller, D.: Single image depth estimation from predicted semantic labels. In: CVPR (2010)
Google Scholar
Li, C., Kowdle, A., Saxena, A., Chen, T.: Towards holistic scene understanding: Feedback enabled cascaded classification models. In: NIPS (2010)
Google Scholar
Wu, C., Frahm, J.M., Pollefeys, M.: Repetition-based dense single-view reconstruction. In: CVPR (2011)
Google Scholar
Han, F., Zhu, S.C.: Bayesian reconstruction of 3D shapes and scenes from a single image. In: IEEE HLK (2003)
Google Scholar
Hassner, T., Basri, R.: Example based 3D reconstruction from single 2D images. In: CVPR Workshop on Beyond Patches (2006)
Google Scholar
Guttmann, M., Wolf, L., Cohen-Or, D.: Semi-automatic stereo extraction from video footage. In: ICCV 2009., pp. 136–142 (2009)
Google Scholar
Ward, B., Kang, S.B., Bennett, E.P.: Depth Director: A system for adding depth to movies. IEEE Comput. Graph. Appl. 31, 36–48 (2011)
Article Google Scholar
Liao, M., Gao, J., Yang, R., Gong, M.: Video stereolization: Combining motion analysis with user interaction. IEEE Transactions on Visualization and Computer Graphics 18, 1079–1088 (2012)
Article Google Scholar
Konrad, J., Wang, M., Ishwar, P.: 2d-to-3d image conversion by learning depth from examples. In: 3DCINE (2012)
Google Scholar
Liu, C., Yuen, J., Torralba, A.: Nonparametric scene parsing: Label transfer via dense scene alignment. In: CVPR (2009)
Google Scholar
Liu, C., Yuen, J., Torralba, A.: SIFT Flow: Dense correspondence across scenes and its applications. IEEE TPAMI 33, 978–994 (2011)
Article Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV 42, 145–175 (2001)
Article MATH Google Scholar
Liu, C.: Beyond pixels: Exploring new representations and applications for motion analysis. PhD thesis. MIT (2009)
Google Scholar
Wang, O., Lang, M., Frei, M., Hornung, A., Smolic, A., Gross, M.: StereoBrush: Interactive 2D to 3D conversion using discontinuous warps. In: SBIM (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinois at Urbana-Champaign, USA
Kevin Karsch
Microsoft Research, New England, USA
Ce Liu
Microsoft Research, USA
Sing Bing Kang

Authors

Kevin Karsch
View author publications
You can also search for this author in PubMed Google Scholar
Ce Liu
View author publications
You can also search for this author in PubMed Google Scholar
Sing Bing Kang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Karsch, K., Liu, C., Kang, S.B. (2012). Depth Extraction from Video Using Non-parametric Sampling. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33715-4_56

Download citation

DOI: https://doi.org/10.1007/978-3-642-33715-4_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33714-7
Online ISBN: 978-3-642-33715-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics