Nothing Special   »   [go: up one dir, main page]

Yang et al., 2021 - Google Patents

Self-supervised video object segmentation by motion grouping

Yang et al., 2021

View PDF
Document ID
8537369841434303242
Author
Yang C
Lamdouar H
Lu E
Zisserman A
Xie W
Publication year
Publication venue
Proceedings of the IEEE/CVF International Conference on Computer Vision

External Links

Snippet

Animals have evolved highly functional visual systems to understand motion, assisting perception even under complex environments. In this paper, we work towards developing a computer vision system able to segment objects by exploiting motion cues, ie motion …
Continue reading at openaccess.thecvf.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6267Classification techniques
    • G06K9/6268Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06K9/6232Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
    • G06K9/6247Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computer systems utilising knowledge based models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation

Similar Documents

Publication Publication Date Title
Yang et al. Self-supervised video object segmentation by motion grouping
Yang et al. Banmo: Building animatable 3d neural models from many casual videos
Petrovich et al. Action-conditioned 3d human motion synthesis with transformer vae
Sarvakar et al. Facial emotion recognition using convolutional neural networks
Hou et al. Revealnet: Seeing behind objects in rgb-d scans
Tewari et al. Diffusion with forward models: Solving stochastic inverse problems without direct supervision
Si et al. Skeleton-based action recognition with spatial reasoning and temporal stack learning
Srinivas et al. A taxonomy of deep convolutional neural nets for computer vision
Sun et al. Lattice long short-term memory for human action recognition
Tatarchenko et al. Multi-view 3d models from single images with a convolutional network
Yang et al. Weakly-supervised disentangling with recurrent transformations for 3d view synthesis
Wang et al. 3D human activity recognition with reconfigurable convolutional neural networks
Zhan et al. Self-supervised learning via conditional motion propagation
Singh et al. Progress of human action recognition research in the last ten years: a comprehensive survey
Liu et al. Egocentric activity recognition and localization on a 3d map
Guo et al. Partially-sparse restricted boltzmann machine for background modeling and subtraction
Duan et al. A unified framework for real time motion completion
Kong et al. Multigrid predictive filter flow for unsupervised learning on videos
Huang et al. Layered controllable video generation
Ardino et al. Click to move: Controlling video generation with sparse motion
Wang et al. Spatio-temporal branching for motion prediction using motion increments
Duffhauss et al. FusionVAE: A deep hierarchical variational autoencoder for RGB image fusion
Ma et al. Multi-View Time-Series Hypergraph Neural Network for Action Recognition
Daniel et al. Unsupervised image representation learning with deep latent particles
Kuai et al. Camm: Building category-agnostic and animatable 3d models from monocular videos