Yang et al., 2021 - Google Patents
Self-supervised video object segmentation by motion groupingYang et al., 2021
View PDF- Document ID
- 8537369841434303242
- Author
- Yang C
- Lamdouar H
- Lu E
- Zisserman A
- Xie W
- Publication year
- Publication venue
- Proceedings of the IEEE/CVF International Conference on Computer Vision
External Links
Snippet
Animals have evolved highly functional visual systems to understand motion, assisting perception even under complex environments. In this paper, we work towards developing a computer vision system able to segment objects by exploiting motion cues, ie motion …
- 230000011218 segmentation 0 title abstract description 48
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yang et al. | Self-supervised video object segmentation by motion grouping | |
Yang et al. | Banmo: Building animatable 3d neural models from many casual videos | |
Petrovich et al. | Action-conditioned 3d human motion synthesis with transformer vae | |
Sarvakar et al. | Facial emotion recognition using convolutional neural networks | |
Hou et al. | Revealnet: Seeing behind objects in rgb-d scans | |
Tewari et al. | Diffusion with forward models: Solving stochastic inverse problems without direct supervision | |
Si et al. | Skeleton-based action recognition with spatial reasoning and temporal stack learning | |
Srinivas et al. | A taxonomy of deep convolutional neural nets for computer vision | |
Sun et al. | Lattice long short-term memory for human action recognition | |
Tatarchenko et al. | Multi-view 3d models from single images with a convolutional network | |
Yang et al. | Weakly-supervised disentangling with recurrent transformations for 3d view synthesis | |
Wang et al. | 3D human activity recognition with reconfigurable convolutional neural networks | |
Zhan et al. | Self-supervised learning via conditional motion propagation | |
Singh et al. | Progress of human action recognition research in the last ten years: a comprehensive survey | |
Liu et al. | Egocentric activity recognition and localization on a 3d map | |
Guo et al. | Partially-sparse restricted boltzmann machine for background modeling and subtraction | |
Duan et al. | A unified framework for real time motion completion | |
Kong et al. | Multigrid predictive filter flow for unsupervised learning on videos | |
Huang et al. | Layered controllable video generation | |
Ardino et al. | Click to move: Controlling video generation with sparse motion | |
Wang et al. | Spatio-temporal branching for motion prediction using motion increments | |
Duffhauss et al. | FusionVAE: A deep hierarchical variational autoencoder for RGB image fusion | |
Ma et al. | Multi-View Time-Series Hypergraph Neural Network for Action Recognition | |
Daniel et al. | Unsupervised image representation learning with deep latent particles | |
Kuai et al. | Camm: Building category-agnostic and animatable 3d models from monocular videos |