Petrovich et al., 2022 - Google Patents
TEMOS: Generating diverse human motions from textual descriptionsPetrovich et al., 2022
View PDF- Document ID
- 906697653407689869
- Author
- Petrovich M
- Black M
- Varol G
- Publication year
- Publication venue
- European Conference on Computer Vision
External Links
Snippet
We address the problem of generating diverse 3D human motions from textual descriptions. This challenging task requires joint modeling of both modalities: understanding and extracting useful human-centric information from the text, and then generating plausible and …
- 241000282414 Homo sapiens 0 title abstract description 55
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce, e.g. shopping or e-commerce
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Petrovich et al. | TEMOS: Generating diverse human motions from textual descriptions | |
Huynh-The et al. | Artificial intelligence for the metaverse: A survey | |
Athanasiou et al. | Teach: Temporal action composition for 3d humans | |
Cai et al. | Learning progressive joint propagation for human motion prediction | |
Zhang et al. | Couch: Towards controllable human-chair interactions | |
Raghu et al. | A survey of deep learning for scientific discovery | |
Ferreira et al. | Learning to dance: A graph convolutional adversarial network to generate realistic dance motions from audio | |
Lucas et al. | Posegpt: Quantization-based 3d human motion generation and forecasting | |
Jain et al. | GAN-Poser: an improvised bidirectional GAN model for human motion prediction | |
Lin et al. | Multimodal transformer with variable-length memory for vision-and-language navigation | |
Ribeiro de Oliveira et al. | Virtual reality solutions employing artificial intelligence methods: A systematic literature review | |
Huang et al. | Layered controllable video generation | |
Christen et al. | Diffh2o: Diffusion-based synthesis of hand-object interactions from textual descriptions | |
Fuest et al. | Diffusion models and representation learning: A survey | |
Zou et al. | ParCo: Part-Coordinating Text-to-Motion Synthesis | |
Chi et al. | M2d2m: Multi-motion generation from text with discrete diffusion models | |
Zhang et al. | Adversarial synthesis of human pose from text | |
Zhang et al. | Infinimotion: Mamba boosts memory in transformer for arbitrary long motion generation | |
Buckchash et al. | Variational conditioning of deep recurrent networks for modeling complex motion dynamics | |
Diko et al. | Semantically guided representation learning for action anticipation | |
Sudhakar et al. | Controlling the world by sleight of hand | |
Wu et al. | Disentangling stochastic pde dynamics for unsupervised video prediction | |
Niskanen et al. | Latest trends in artificial intelligence technology: A scoping review | |
Tang et al. | Prompting Future Driven Diffusion Model for Hand Motion Prediction | |
Gupta et al. | Pre-trained text-to-image diffusion models are versatile representation learners for control |