Nothing Special   »   [go: up one dir, main page]

Yin et al., 2018 - Google Patents

Geonet: Unsupervised learning of dense depth, optical flow and camera pose

Yin et al., 2018

View PDF
Document ID
1957429302279516892
Author
Yin Z
Shi J
Publication year
Publication venue
Proceedings of the IEEE conference on computer vision and pattern recognition

External Links

Snippet

We propose GeoNet, a jointly unsupervised learning framework for monocular depth, optical flow and ego-motion estimation from videos. The three components are coupled by the nature of 3D scene geometry, jointly learned by our framework in an end-to-end manner …
Continue reading at openaccess.thecvf.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30244Information retrieval; Database structures therefor; File system structures therefor in image databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation

Similar Documents

Publication Publication Date Title
Yin et al. Geonet: Unsupervised learning of dense depth, optical flow and camera pose
Wang et al. Multi-view stereo in the deep learning era: A comprehensive review
Yang et al. Deep virtual stereo odometry: Leveraging deep depth prediction for monocular direct sparse odometry
Zou et al. Df-net: Unsupervised joint learning of depth and flow using cross-task consistency
Shu et al. Feature-metric loss for self-supervised learning of depth and egomotion
Yang et al. Lego: Learning edge with geometry all at once by watching videos
Cheng et al. Learning depth with convolutional spatial propagation network
Jiang et al. Sense: A shared encoder network for scene-flow estimation
Gan et al. Monocular depth estimation with affinity, vertical pooling, and label enhancement
Aleotti et al. Learning end-to-end scene flow by distilling single tasks knowledge
Zhou et al. Self-distilled feature aggregation for self-supervised monocular depth estimation
Zhou et al. Unsupervised learning of monocular depth estimation with bundle adjustment, super-resolution and clip loss
Meng et al. CORNet: Context-based ordinal regression network for monocular depth estimation
Poggi et al. Continual adaptation for deep stereo
Gurram et al. Monocular depth estimation by learning from heterogeneous datasets
Han et al. Transdssl: Transformer based depth estimation via self-supervised learning
Pilzer et al. Progressive fusion for unsupervised binocular depth estimation using cycled networks
Sun et al. Munet: Motion uncertainty-aware semi-supervised video object segmentation
Song et al. Depth estimation from a single image using guided deep network
Liu et al. A survey on deep learning methods for scene flow estimation
Du et al. Srh-net: Stacked recurrent hourglass network for stereo matching
Zheng et al. Self-supervised monocular depth estimation based on combining convolution and multilayer perceptron
Jia et al. Bidirectional stereo matching network with double cost volumes
Wang et al. NVDS $^{\mathbf {+}} $: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation
Fan et al. Learning Bilateral Cost Volume for Rolling Shutter Temporal Super-Resolution