Menapace et al., 2022 - Google Patents

Playable environments: Video manipulation in space and time

Menapace et al., 2022

Document ID: 1770031922644467021
Author: Menapace W; Lathuilière S; Siarohin A; Theobalt C; Tulyakov S; Golyanik V; Ricci E
Publication year: 2022
Publication venue: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition

External Links

Cited by

Snippet

Abstract We present Playable Environments-a new representation for interactive video generation and manipulation in space and time. With a single image at inference time, our novel framework allows the user to move objects in 3D while generating a video by …

Continue reading at openaccess.thecvf.com (PDF) (other versions)

230000015572 biosynthetic process 0 abstract description 26

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00362—Recognising human body or animal bodies, e.g. vehicle occupant, pedestrian; Recognising body parts, e.g. hand
- G06K9/00369—Recognition of whole body, e.g. static pedestrian or occupant recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding, e.g. from bit-mapped to non bit-mapped
- G06T9/001—Model-based coding, e.g. wire frame
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2210/00—Indexing scheme for image generation or computer graphics
- G06T2210/08—Bandwidth reduction
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content

Similar Documents

Publication	Publication Date	Title
Zhao et al.	2022	Humannerf: Efficiently generated human radiance field from sparse inputs
Park et al.	2017	Transformation-grounded image generation network for novel 3d view synthesis
Chiang et al.	2022	Stylizing 3d scene via implicit representation and hypernetwork
Wang et al.	2018	Video-to-video synthesis
Liu et al.	2021	Generative adversarial networks for image and video synthesis: Algorithms and applications
Balakrishnan et al.	2018	Synthesizing images of humans in unseen poses
US10019826B2 (en)	2018-07-10	Real-time high-quality facial performance capture
Yu et al.	2023	Monohuman: Animatable human neural field from monocular video
Li et al.	2022	Ganimator: Neural motion synthesis from a single sequence
EP3602494B1 (en)	2022-01-12	Robust mesh tracking and fusion by using part-based key frames and priori model
Kim et al.	2023	Neuralfield-ldm: Scene generation with hierarchical latent diffusion models
Menapace et al.	2022	Playable environments: Video manipulation in space and time
Pang et al.	2024	Ash: Animatable gaussian splats for efficient and photoreal human rendering
Weng et al.	2020	Vid2actor: Free-viewpoint animatable person synthesis from video in the wild
Zakharkin et al.	2021	Point-based modeling of human clothing
Bermano et al.	2014	Facial performance enhancement using dynamic shape space analysis
Su et al.	2022	Danbo: Disentangled articulated neural body representations via graph neural networks
Yang et al.	2023	Synbody: Synthetic dataset with layered human models for 3d human perception and modeling
CN115298708A (en)	2022-11-04	Multi-view neural human body rendering
Moser et al.	2021	Semi-supervised video-driven facial animation transfer for production
Wang et al.	2023	Seal-3d: Interactive pixel-level editing for neural radiance fields
Chen et al.	2023	TeSTNeRF: Text-Driven 3D Style Transfer via Cross-Modal Learning.
Sun et al.	2022	Human 3d avatar modeling with implicit neural representation: A brief survey
Bao et al.	2024	3d gaussian splatting: Survey, technologies, challenges, and opportunities
Gomes et al.	2020	Do as I do: Transferring human motion and appearance between monocular videos with spatial and temporal constraints