Menapace et al., 2022 - Google Patents
Playable environments: Video manipulation in space and timeMenapace et al., 2022
View PDF- Document ID
- 1770031922644467021
- Author
- Menapace W
- Lathuilière S
- Siarohin A
- Theobalt C
- Tulyakov S
- Golyanik V
- Ricci E
- Publication year
- Publication venue
- Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
External Links
Snippet
Abstract We present Playable Environments-a new representation for interactive video generation and manipulation in space and time. With a single image at inference time, our novel framework allows the user to move objects in 3D while generating a video by …
- 230000015572 biosynthetic process 0 abstract description 26
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00362—Recognising human body or animal bodies, e.g. vehicle occupant, pedestrian; Recognising body parts, e.g. hand
- G06K9/00369—Recognition of whole body, e.g. static pedestrian or occupant recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2210/00—Indexing scheme for image generation or computer graphics
- G06T2210/08—Bandwidth reduction
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhao et al. | Humannerf: Efficiently generated human radiance field from sparse inputs | |
Park et al. | Transformation-grounded image generation network for novel 3d view synthesis | |
Chiang et al. | Stylizing 3d scene via implicit representation and hypernetwork | |
Wang et al. | Video-to-video synthesis | |
Liu et al. | Generative adversarial networks for image and video synthesis: Algorithms and applications | |
Balakrishnan et al. | Synthesizing images of humans in unseen poses | |
US10019826B2 (en) | Real-time high-quality facial performance capture | |
Yu et al. | Monohuman: Animatable human neural field from monocular video | |
Li et al. | Ganimator: Neural motion synthesis from a single sequence | |
EP3602494B1 (en) | Robust mesh tracking and fusion by using part-based key frames and priori model | |
Kim et al. | Neuralfield-ldm: Scene generation with hierarchical latent diffusion models | |
Menapace et al. | Playable environments: Video manipulation in space and time | |
Pang et al. | Ash: Animatable gaussian splats for efficient and photoreal human rendering | |
Weng et al. | Vid2actor: Free-viewpoint animatable person synthesis from video in the wild | |
Zakharkin et al. | Point-based modeling of human clothing | |
Bermano et al. | Facial performance enhancement using dynamic shape space analysis | |
Su et al. | Danbo: Disentangled articulated neural body representations via graph neural networks | |
Yang et al. | Synbody: Synthetic dataset with layered human models for 3d human perception and modeling | |
CN115298708A (en) | Multi-view neural human body rendering | |
Moser et al. | Semi-supervised video-driven facial animation transfer for production | |
Wang et al. | Seal-3d: Interactive pixel-level editing for neural radiance fields | |
Chen et al. | TeSTNeRF: Text-Driven 3D Style Transfer via Cross-Modal Learning. | |
Sun et al. | Human 3d avatar modeling with implicit neural representation: A brief survey | |
Bao et al. | 3d gaussian splatting: Survey, technologies, challenges, and opportunities | |
Gomes et al. | Do as I do: Transferring human motion and appearance between monocular videos with spatial and temporal constraints |