Yin et al., 2018 - Google Patents
Geonet: Unsupervised learning of dense depth, optical flow and camera poseYin et al., 2018
View PDF- Document ID
- 1957429302279516892
- Author
- Yin Z
- Shi J
- Publication year
- Publication venue
- Proceedings of the IEEE conference on computer vision and pattern recognition
External Links
Snippet
We propose GeoNet, a jointly unsupervised learning framework for monocular depth, optical flow and ego-motion estimation from videos. The three components are coupled by the nature of 3D scene geometry, jointly learned by our framework in an end-to-end manner …
- 230000003287 optical 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yin et al. | Geonet: Unsupervised learning of dense depth, optical flow and camera pose | |
Wang et al. | Multi-view stereo in the deep learning era: A comprehensive review | |
Yang et al. | Deep virtual stereo odometry: Leveraging deep depth prediction for monocular direct sparse odometry | |
Zou et al. | Df-net: Unsupervised joint learning of depth and flow using cross-task consistency | |
Shu et al. | Feature-metric loss for self-supervised learning of depth and egomotion | |
Yang et al. | Lego: Learning edge with geometry all at once by watching videos | |
Cheng et al. | Learning depth with convolutional spatial propagation network | |
Jiang et al. | Sense: A shared encoder network for scene-flow estimation | |
Gan et al. | Monocular depth estimation with affinity, vertical pooling, and label enhancement | |
Aleotti et al. | Learning end-to-end scene flow by distilling single tasks knowledge | |
Zhou et al. | Self-distilled feature aggregation for self-supervised monocular depth estimation | |
Zhou et al. | Unsupervised learning of monocular depth estimation with bundle adjustment, super-resolution and clip loss | |
Meng et al. | CORNet: Context-based ordinal regression network for monocular depth estimation | |
Poggi et al. | Continual adaptation for deep stereo | |
Gurram et al. | Monocular depth estimation by learning from heterogeneous datasets | |
Han et al. | Transdssl: Transformer based depth estimation via self-supervised learning | |
Pilzer et al. | Progressive fusion for unsupervised binocular depth estimation using cycled networks | |
Sun et al. | Munet: Motion uncertainty-aware semi-supervised video object segmentation | |
Song et al. | Depth estimation from a single image using guided deep network | |
Liu et al. | A survey on deep learning methods for scene flow estimation | |
Du et al. | Srh-net: Stacked recurrent hourglass network for stereo matching | |
Zheng et al. | Self-supervised monocular depth estimation based on combining convolution and multilayer perceptron | |
Jia et al. | Bidirectional stereo matching network with double cost volumes | |
Wang et al. | NVDS $^{\mathbf {+}} $: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation | |
Fan et al. | Learning Bilateral Cost Volume for Rolling Shutter Temporal Super-Resolution |