Eigen et al., 2014 - Google Patents

Depth map prediction from a single image using a multi-scale deep network

Eigen et al., 2014

Document ID: 2789414965913271365
Author: Eigen D; Puhrsch C; Fergus R
Publication year: 2014
Publication venue: Advances in neural information processing systems

External Links

Cited by

Snippet

Predicting depth is an essential component in understanding the 3D geometry of a scene. While for stereo images local correspondence suffices for estimation, finding depth relations from a single image is less straightforward, requiring integration of both global and local …

Continue reading at proceedings.neurips.cc (PDF) (other versions)

238000011176 pooling 0 description 4

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10032—Satellite or aerial image; Remote sensing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/20—Image acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T3/00—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image
- G06T3/0068—Geometric image transformation in the plane of the image, e.g. from bit-mapped to bit-mapped creating a different image for image registration, e.g. elastic snapping

Similar Documents

Publication	Publication Date	Title
Eigen et al.	2014	Depth map prediction from a single image using a multi-scale deep network
Luo et al.	2020	Consistent video depth estimation
Im et al.	2019	Dpsnet: End-to-end deep plane sweep stereo
Sevilla-Lara et al.	2016	Optical flow with semantic segmentation and localized layers
Guerry et al.	2017	Snapnet-r: Consistent 3d multi-view semantic labeling for robotics
Flynn et al.	2016	Deepstereo: Learning to predict new views from the world's imagery
Bar-Haim et al.	2020	Scopeflow: Dynamic scene scoping for optical flow
Yin et al.	2017	Scale recovery for monocular visual odometry using depth estimated with deep convolutional neural fields
Erdogan et al.	2012	Planar segmentation of rgbd images using fast linear fitting and markov chain monte carlo
Muratov et al.	2016	3DCapture: 3D Reconstruction for a Smartphone
Evangelidis et al.	2015	Fusion of range and stereo data for high-resolution scene-modeling
Song et al.	2020	Deep novel view synthesis from colored 3d point clouds
dos Santos Rosa et al.	2019	Sparse-to-continuous: Enhancing monocular depth estimation using occupancy maps
Hirner et al.	2021	FC-DCNN: A densely connected neural network for stereo estimation
Arampatzakis et al.	2023	Monocular depth estimation: A thorough review
Alfarano et al.	2024	Estimating optical flow: A comprehensive review of the state of the art
Fang et al.	2022	Self-supervised learning of depth and ego-motion from videos by alternative training and geometric constraints from 3-d to 2-d
Shi et al.	2019	Self-supervised learning of depth and ego-motion with differentiable bundle adjustment
Wei et al.	2023	Deepsfm: Robust deep iterative refinement for structure from motion
Feng et al.	2021	Mesh reconstruction from aerial images for outdoor terrain mapping using joint 2d-3d learning
Lin et al.	2021	Dense 3D surface reconstruction of large-scale streetscape from vehicle-borne imagery and LiDAR
Anisimov et al.	2019	Rapid light field depth estimation with semi-global matching
Choi et al.	2023	Tmo: Textured mesh acquisition of objects with a mobile device by using differentiable rendering
Harisankar et al.	2020	Unsupervised depth estimation from monocular images for autonomous vehicles
Mazur et al.	2024	SuperPrimitive: Scene Reconstruction at a Primitive Level