Yang et al., 2021 - Google Patents

Self-supervised video object segmentation by motion grouping

Yang et al., 2021

Document ID: 8537369841434303242
Author: Yang C; Lamdouar H; Lu E; Zisserman A; Xie W
Publication year: 2021
Publication venue: Proceedings of the IEEE/CVF International Conference on Computer Vision

External Links

Cited by

Snippet

Animals have evolved highly functional visual systems to understand motion, assisting perception even under complex environments. In this paper, we work towards developing a computer vision system able to segment objects by exploiting motion cues, ie motion …

Continue reading at openaccess.thecvf.com (PDF) (other versions)

230000011218 segmentation 0 title abstract description 48

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation

Similar Documents

Publication	Publication Date	Title
Yang et al.	2021	Self-supervised video object segmentation by motion grouping
Yang et al.	2022	Banmo: Building animatable 3d neural models from many casual videos
Petrovich et al.	2021	Action-conditioned 3d human motion synthesis with transformer vae
Sarvakar et al.	2023	Facial emotion recognition using convolutional neural networks
Hou et al.	2020	Revealnet: Seeing behind objects in rgb-d scans
Tewari et al.	2023	Diffusion with forward models: Solving stochastic inverse problems without direct supervision
Si et al.	2018	Skeleton-based action recognition with spatial reasoning and temporal stack learning
Srinivas et al.	2016	A taxonomy of deep convolutional neural nets for computer vision
Sun et al.	2017	Lattice long short-term memory for human action recognition
Tatarchenko et al.	2016	Multi-view 3d models from single images with a convolutional network
Yang et al.	2015	Weakly-supervised disentangling with recurrent transformations for 3d view synthesis
Wang et al.	2014	3D human activity recognition with reconfigurable convolutional neural networks
Zhan et al.	2019	Self-supervised learning via conditional motion propagation
Singh et al.	2021	Progress of human action recognition research in the last ten years: a comprehensive survey
Liu et al.	2022	Egocentric activity recognition and localization on a 3d map
Guo et al.	2013	Partially-sparse restricted boltzmann machine for background modeling and subtraction
Duan et al.	2022	A unified framework for real time motion completion
Kong et al.	2019	Multigrid predictive filter flow for unsupervised learning on videos
Huang et al.	2022	Layered controllable video generation
Ardino et al.	2021	Click to move: Controlling video generation with sparse motion
Wang et al.	2023	Spatio-temporal branching for motion prediction using motion increments
Duffhauss et al.	2022	FusionVAE: A deep hierarchical variational autoencoder for RGB image fusion
Ma et al.	2024	Multi-View Time-Series Hypergraph Neural Network for Action Recognition
Daniel et al.	2022	Unsupervised image representation learning with deep latent particles
Kuai et al.	2023	Camm: Building category-agnostic and animatable 3d models from monocular videos