WO2019242454A1 - 一种物体建模运动方法、装置与设备 - Google Patents
一种物体建模运动方法、装置与设备 Download PDFInfo
- Publication number
- WO2019242454A1 WO2019242454A1 PCT/CN2019/088480 CN2019088480W WO2019242454A1 WO 2019242454 A1 WO2019242454 A1 WO 2019242454A1 CN 2019088480 W CN2019088480 W CN 2019088480W WO 2019242454 A1 WO2019242454 A1 WO 2019242454A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- target
- model
- movement mode
- module
- bone
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 133
- 210000000988 bone and bone Anatomy 0.000 claims description 106
- 230000004927 fusion Effects 0.000 claims description 16
- 238000004590 computer program Methods 0.000 claims description 11
- 238000004891 communication Methods 0.000 claims description 9
- 238000010295 mobile communication Methods 0.000 claims description 4
- IRLPACMLTUPBCL-KQYNXXCUSA-N 5'-adenylyl sulfate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OS(O)(=O)=O)[C@@H](O)[C@H]1O IRLPACMLTUPBCL-KQYNXXCUSA-N 0.000 claims description 2
- 230000003068 static effect Effects 0.000 abstract description 10
- 230000008569 process Effects 0.000 description 42
- 238000013461 design Methods 0.000 description 30
- 230000006870 function Effects 0.000 description 27
- 238000012545 processing Methods 0.000 description 22
- 238000004422 calculation algorithm Methods 0.000 description 18
- 238000010586 diagram Methods 0.000 description 14
- 238000013507 mapping Methods 0.000 description 14
- 238000004364 calculation method Methods 0.000 description 11
- 238000003860 storage Methods 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 230000009471 action Effects 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 7
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 241000251468 Actinopterygii Species 0.000 description 4
- 241000282472 Canis lupus familiaris Species 0.000 description 4
- 241000287828 Gallus gallus Species 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 238000003384 imaging method Methods 0.000 description 4
- 241000282994 Cervidae Species 0.000 description 3
- 241000282320 Panthera leo Species 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 3
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- 241000272525 Anas platyrhynchos Species 0.000 description 2
- 210000000459 calcaneus Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000014509 gene expression Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000002146 bilateral effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 239000004020 conductor Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000007499 fusion processing Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 230000001953 sensory effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 230000009182 swimming Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 238000009966 trimming Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G06T17/10—Constructive solid geometry [CSG] using solid primitives, e.g. cylinders, cubes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G06T17/20—Finite element generation, e.g. wire-frame surface description, tesselation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G06T17/20—Finite element generation, e.g. wire-frame surface description, tesselation
- G06T17/205—Re-meshing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G06T19/20—Editing of 3D images, e.g. changing shapes or colours, aligning objects or positioning parts
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/507—Depth or shape recovery from shading
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/50—Depth or shape recovery
- G06T7/55—Depth or shape recovery from multiple images
- G06T7/579—Depth or shape recovery from multiple images from motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/64—Three-dimensional objects
- G06V20/653—Three-dimensional objects by matching three-dimensional models, e.g. conformal mapping of Riemann surfaces
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
- G06V40/165—Detection; Localisation; Normalisation using facial parts and geometric relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G06T2200/24—Indexing scheme for image data processing or generation, in general involving graphical user interfaces [GUIs]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10028—Range image; Depth image; 3D point clouds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20212—Image combination
- G06T2207/20221—Image fusion; Image merging
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics
- G06T2219/20—Indexing scheme for editing of 3D models
- G06T2219/2016—Rotation, translation, scaling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics
- G06T2219/20—Indexing scheme for editing of 3D models
- G06T2219/2021—Shape modification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2219/00—Indexing scheme for manipulating 3D models or images for computer graphics
- G06T2219/20—Indexing scheme for editing of 3D models
- G06T2219/2024—Style variation
Definitions
- the present invention relates to the technical field of terminals, and in particular, to a method, an apparatus, and a device for modeling movement of an object.
- the methods for acquiring images include using various cameras, cameras, scanners, etc. By using these methods, usually only a planar image of the object can be obtained, that is, two-dimensional information of the object. In many fields, such as machine vision, face shape detection, physical profiling, automatic processing, product quality control, biomedicine, etc., the three-dimensional information of an object is essential. Therefore, 3D scanning technology came into being.
- the commonly used equipment is a 3D scanner; it is a scientific instrument used to detect and analyze the shape (geometric structure) and appearance data of objects or environments in the real world. (Such as color, surface albedo, etc.).
- the purpose of the 3D scanner is to create point clouds of the geometric surface of the object. These points can be used to interpolate into the surface shape of the object. The denser the point cloud can create a more accurate model (this process is also called 3D reconstruction). . If the scanner can obtain the surface color, it can further paste a texture map on the reconstructed surface, which is also called texture mapping.
- the 3D scanner in the prior art is complicated to use, requires users with professional skills to play, and the application scenarios are relatively limited; therefore, how to enable mass users to play with the 3D scanning technology is an urgent problem to be solved.
- Embodiments of the present invention provide a method, device, and device for object modeling movement, which can scan an object of interest anytime, anywhere, and achieve dynamic effects, enhance fun and playability, and improve user stickiness; lead the trend of the times.
- an embodiment of the present invention provides an object modeling movement method, which is applied to a mobile terminal, the mobile terminal includes a color camera and a depth camera; and the color camera and the depth camera are located on the same side, front or back of the mobile terminal ;
- the method specifically includes: using a color camera and a depth camera to perform a panoramic scan on the target object to obtain a 3D model of the target object; obtaining a target bone model; fusing the target bone model with the 3D model of the target object; obtaining a target movement mode; according to the target The motion mode controls the skeletal model, so that the 3D model of the target object moves according to the target motion mode.
- an embodiment of the present invention provides an object modeling motion device, which is applied to a mobile terminal, and the mobile terminal includes a color camera and a depth camera; and the color camera and the depth camera are located on the same side, front or back of the mobile terminal ;
- the device comprises: a scanning module for obtaining a 3D model of the target object when the color camera and the depth camera perform panoramic scanning on the target object; a first acquisition module for acquiring the target skeleton model; a fusion module;
- the skeletal model is fused with the 3D model of the target object;
- the second acquisition module is used to acquire the target movement mode;
- the motion module is used to control the skeletal model according to the target movement mode so that the 3D model of the target object is moved according to the target movement mode.
- the mobile terminal can implement an integrated design of objects from scanning, 3D reconstruction, bone assembly, and preset animation display. There is no need for users to use professional, cumbersome and complicated equipment for professional scanning, and no need to go to the PC to do complex modeling and animation processing. These functions are integrated together and provided to the user, enabling the user to use a mobile terminal.
- This series of operation methods can be easily played on the Internet, so that any "static object (or near static object)" around the user can be more lively and more vital. Increase the user's interest in using the terminal and improve the user's experience.
- the depth camera may use a TOF module.
- the depth camera may use a structured light module.
- the field angle of the depth camera ranges from 40 degrees to 80 degrees.
- the range of the infrared transmitting power in the depth camera can be selected between 50-400mw; the super strong light under special applications can generate higher power.
- the scanning distance when scanning the object is between 20 cm and 80 cm, and the scanning distance can be understood as the distance from the depth camera to the target object.
- the shooting frame rate of the depth camera during the scanning process can be selected to be not less than 25 fps.
- the skeletal model can be calculated by a series of algorithms according to the 3D model.
- a bone model making library may be provided to the user, such as some line segments and points, where the line segments represent bones and the points represent joint nodes.
- the skeletal model is uploaded to the cloud or stored locally.
- the method may be completed by a first obtaining module; on a hardware, it may be implemented by a processor calling a program instruction in a memory.
- a more open production library can be provided to the user, and the line segments and points are completely freely designed by the user, where the line segments represent bones and the points represent joint nodes.
- the skeletal model is uploaded to the cloud or stored locally.
- the method may be completed by a first obtaining module; on a hardware, it may be implemented by a processor calling a program instruction in a memory.
- a bone model having the highest degree of matching with the shape of the target object may be selected from at least one preset bone model as a target Skeleton model.
- Preset bone models can be stored online, in the cloud, or locally.
- a chicken bone model, a dog bone model, and a fish bone model are stored locally.
- the system recognizes the chicken bone model as the target bone model through shape recognition. Similar determination criteria include, but are not limited to, bone shape, bone length, bone thickness, number of bones, and bone composition.
- the method may be completed by a first obtaining module; on a hardware, it may be implemented by a processor calling a program instruction in a memory.
- a selection instruction of a user may be received, the selection instruction being used to select a target bone model from at least one preset bone model, and these preset models are stored locally Or from the cloud or the web.
- the method may be completed by a first obtaining module; on a hardware, it may be implemented by a processor calling a program instruction in a memory.
- the movement mode of the first object may be obtained, and the movement mode of the first object may be used as the target movement mode.
- the first object may be an object that is currently moving in real time; it may also be a movement mode of an object that has been photographed and saved in the past; or it may be a preset movement mode of a specific object.
- This method may be completed by a second acquisition module; on hardware, it may be implemented by a processor calling a program instruction in a memory.
- This method may be completed by a second acquisition module; on hardware, it may be implemented by a processor calling a program instruction in a memory.
- the preset target movement mode can be a complete set of movement modes or a movement mode corresponding to the user's operation. If the user beckons at the "resurrected object" displayed in the terminal, the object can be preset. Beckoned in response to animated ways to exercise. More generally, the user can input a preset interactive action to the terminal, and the terminal obtains a corresponding response movement mode according to the interaction action, and controls the 3D model of the object to move according to the response movement mode.
- the movement mode can be created by the user using an animation production software, of course, this software can be a tool set embedded in the system or included in the APP for scanning and reconstructing the movement.
- this software can be a tool set embedded in the system or included in the APP for scanning and reconstructing the movement.
- This method may be completed by a second acquisition module; on hardware, it may be implemented by a processor calling a program instruction in a memory.
- the movement mode may be to select a movement mode with the highest degree of attribute matching as a target movement mode among a plurality of pre-stored movement modes according to physical attributes.
- This method may be completed by a second acquisition module; on hardware, it may be implemented by a processor calling a program instruction in a memory.
- the movement mode may also be based on the skeletal model of the target object (which can be obtained by using any method in the previous step), and the skeletal model by the system or the user Carry out independent design to get the target movement mode.
- This way is to implement the most suitable animation operation on the 3D model of subsequent objects.
- This method may be completed by a second acquisition module; on hardware, it may be implemented by a processor calling a program instruction in a memory.
- the 3D model can be stored locally or in the cloud, and the 3D model can be called directly after some time, and the bone assembly can be freely selected
- the animation can be automatically played by the mobile terminal, and it can also be controlled by the user inputting operation instructions.
- the skinning operation is used to determine a change in the position of a point on the surface of the 3D model according to the movement of the skeletal model; and cause the 3D model of the target object to follow the skeletal model for movement.
- This method can be completed by a motion module; on hardware, it can be implemented by a processor calling program instructions in a memory.
- the degree of human-computer interaction is enhanced, giving users more freedom to play space, allowing users to deeply participate in the process of resurrecting objects, and developing imagination to increase fun.
- the processor can call the programs and instructions in the memory for corresponding processing, such as enabling the camera, collecting images, generating 3D models, obtaining bone models or animations, and storing bone models Or animation, adding special effects, and interacting with users.
- an embodiment of the present invention provides a terminal device, where the terminal device includes a memory, a processor, a bus, a depth camera, and the color camera; the color camera and the depth camera are located on the same side of the mobile terminal; the memory and the depth camera , The color camera and the processor are connected via a bus; the depth camera and the color camera are used for panoramic scanning of the target object under the control of the processor; the memory is used to store computer programs and instructions; the processor is used to call the computer program and The instructions cause the terminal device to execute any one of the possible design methods described above.
- the terminal device further includes an antenna system, and the antenna system sends and receives wireless communication signals to realize wireless communication with the mobile communication network under the control of the processor;
- the mobile communication network includes one of the following Or more: GSM network, CDMA network, 3G network, 4G network, 5G network, FDMA, TDMA, PDC, TACS, AMPS, WCDMA, TDSCDMA, WIFI and LTE networks.
- the invention realizes that objects from scanning, 3D reconstruction, bone assembly, and preset animation display can be completed in one terminal, realizes the resurrection of static objects, and improves the user's interest in using mobile terminals.
- FIG. 1 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
- FIG. 2 is a flowchart of an object modeling movement method according to an embodiment of the present invention
- FIG. 3 is a main process of scanning an object to implement animation in an embodiment of the present invention
- FIG. 4 is a schematic diagram of structured light according to an embodiment of the present invention.
- FIG. 5 is a schematic diagram of a TOF according to an embodiment of the present invention.
- FIG. 6 is a flowchart of a method of meshing + texture mapping according to an embodiment of the present invention.
- FIG. 7 is a flowchart of a specific gridization implementation scheme in an embodiment of the present invention.
- FIG. 8 is a flowchart of a specific texture mapping implementation scheme according to an embodiment of the present invention.
- FIG. 9 is a specific example of meshing + texture mapping in an embodiment of the present invention.
- FIG. 10 is a flowchart of a specific skeletal assembly scheme according to an embodiment of the present invention.
- FIG. 11 is a specific animation flowchart in an embodiment of the present invention.
- FIG. 12 is a schematic diagram of an object modeling motion device according to an embodiment of the present invention.
- the mobile terminal may be a device that provides users with photographing and / or data connectivity, a handheld device with a wireless connection function, or other processing devices connected to a wireless modem, such as a digital camera, a SLR camera Smart phones can also be other smart devices with camera functions and display functions, such as wearable devices, tablet computers, PDAs (Personal Digital Assistants), drones, aerial cameras, etc.
- a wireless modem such as a digital camera, a SLR camera Smart phones
- smart devices can also be other smart devices with camera functions and display functions, such as wearable devices, tablet computers, PDAs (Personal Digital Assistants), drones, aerial cameras, etc.
- FIG. 1 shows a schematic diagram of an optional hardware structure of the terminal 100.
- the terminal 100 may include a radio frequency unit 110, a memory 120, an input unit 130, a display unit 140, a photographing unit 150, an audio circuit 160, a speaker 161, a microphone 162, a processor 170, an external interface 180, a power supply 190, and the like component.
- the radio frequency unit 110 may be used to receive and transmit information or to receive and send signals during a call.
- the downlink information of the base station is received and processed by the processor 170; in addition, the uplink data of the design is transmitted to the base station.
- the RF circuit includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like.
- the radio frequency unit 110 can also communicate with network devices and other devices through wireless communication.
- the wireless communication may use any communication standard or protocol, including but not limited to Global System (GSM), General Packet Radio Service (GPRS), and Code Division Multiple Access (Code) Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), E-mail, Short Messaging Service (SMS), etc.
- GSM Global System
- GPRS General Packet Radio Service
- CDMA Code Division Multiple Access
- WCDMA Wideband Code Division Multiple Access
- LTE Long Term Evolution
- E-mail Short Messaging Service
- the memory 120 may be configured to store instructions and data.
- the memory 120 may mainly include a storage instruction area and a storage data area, and the storage data area may store a relationship between a joint touch gesture and an application function;
- the storage instruction area may store an operating system, an application, and at least one Software units such as instructions required for functions, or their subsets, extensions. It may also include a non-volatile random access memory;
- the processor 170 is provided with hardware, software, and data resources including management computing processing equipment, and supports control software and applications. It is also used for the storage of multimedia files, as well as the storage of running programs and applications.
- the input unit 130 may be configured to receive inputted numeric or character information, and generate key signal inputs related to user settings and function control of the portable multifunction device.
- the input unit 130 may include a touch screen 131 and other input devices 132.
- the touch screen 131 may collect a touch operation performed by the user on or near the touch screen (for example, the user uses a finger, a joint, a stylus or any suitable object to operate on the touch screen or near the touch screen), and drive the corresponding according to a preset program Connection device.
- the touch screen can detect a user's touch action on the touch screen, convert the touch action into a touch signal and send it to the processor 170, and can receive and execute a command sent by the processor 170; the touch signal includes at least a touch Point coordinate information.
- the touch screen 131 may provide an input interface and an output interface between the terminal 100 and a user.
- various types such as resistive, capacitive, infrared, and surface acoustic wave can be used to implement the touch screen.
- the input unit 130 may include other input devices.
- the other input devices 132 may include, but are not limited to, one or more of a physical keyboard, function keys (such as a volume control button 132, a switch button 133, etc.), a trackball, a mouse, a joystick, and the like.
- function keys such as a volume control button 132, a switch button 133, etc.
- a trackball such as a mouse, a joystick, and the like.
- the touch screen 131 may cover the display panel 141.
- the touch screen 131 detects a touch operation on or near the touch screen 131, the touch screen 131 is transmitted to the processor 170 to determine the type of the touch event, and the processor 170 then displays the touch event on the display panel according to the type of the touch event.
- Corresponding visual output is provided on the 141.
- the touch screen and the display unit may be integrated into one component to implement the input, output, and display functions of the terminal 100.
- the embodiment of the present invention uses the touch display screen to represent the function set of the touch screen and the display unit; In some embodiments, the touch screen and the display unit may also be used as two separate components.
- the display unit 140 may be configured to display information input by the user or information provided to the user and various menus of the terminal 100.
- the display unit is further configured to display an image obtained by the device using the camera 150, which may include a preview image in some shooting modes, a captured initial image, and a target image processed by a certain algorithm after shooting.
- the photographing unit 150 is configured to collect an image or a video, and may be triggered to be turned on by an application program instruction to implement a photographing or camera function.
- the shooting unit may include components such as an imaging lens, a filter, and an image sensor. The light emitted or reflected by the object enters the imaging lens, passes through the filter, and finally converges on the image sensor.
- the imaging lens is mainly used for focusing and imaging the light emitted or reflected by the object (also known as the object to be photographed or the target object) in the angle of view of the photograph;
- the filter is mainly used to filter the excess light waves in the light (for example, in addition to visible light) Light waves, such as infrared) are filtered out;
- the image sensor is mainly used to perform photoelectric conversion on the received light signal, convert it into an electrical signal, and input it into the process 170 for subsequent processing.
- the photographing unit 150 may further include a color camera (color camera) 151 and a depth camera (depth camera) 152; the color camera is used to collect a color image of a target object, and includes a color camera commonly used in current popular terminal products.
- the depth camera is used to obtain the depth information of the target object.
- the depth camera can be implemented by TOF technology and structured light technology.
- TOF is the abbreviation of Time of Flight technology, that is, the sensor emits modulated near-infrared light and reflects after encountering an object.
- the sensor calculates the distance between the shot and the object by calculating the time difference or phase difference between the light emission and the reflection. To generate depth information.
- the three-dimensional contours of objects can be presented in topographic maps with different colors representing different distances.
- structured light is a group of system structures composed of a projection element and a camera.
- Projection elements are used to project specific light information (such as grating diffraction) onto the surface of the object and the background, and then collected by the camera. Calculate the position and depth of the object based on changes in the light signal caused by the object (such as changes in light thickness and displacement); and then restore the entire three-dimensional space.
- the audio circuit 160, the speaker 161, and the microphone 162 may provide an audio interface between the user and the terminal 100.
- the audio circuit 160 can transmit the received electrical data converted electrical signals to the speaker 161, and the speaker 161 converts them into sound signals for output.
- the microphone 162 is used to collect sound signals and can also convert the collected sound signals. It is an electrical signal, which is converted into audio data after being received by the audio circuit 160, and then processed by the audio data output processor 170, and then sent to, for example, another terminal via the radio frequency unit 110, or the audio data is output to the memory 120 for further processing.
- the audio circuit may also include a headphone jack 163 for providing a connection interface between the audio circuit and the headphones.
- the processor 170 is a control center of the terminal 100, and uses various interfaces and lines to connect various parts of the entire mobile phone. By running or executing instructions stored in the memory 120 and calling data stored in the memory 120, various operations of the terminal 100 are performed. Functions and process data for overall monitoring of the phone.
- the processor 170 may include one or more processing units; preferably, the processor 170 may integrate an application processor and a modem processor, wherein the application processor mainly processes an operating system, a user interface, and an application program, etc.
- the modem processor mainly handles wireless communication. It can be understood that the foregoing modem processor may not be integrated into the processor 170.
- the processor and the memory may be implemented on a single chip.
- the processor 170 may also be used to generate corresponding operation control signals, send to the corresponding components of the computing processing device, read and process the data in the software, especially read and process the data and programs in the memory 120, so that the Each function module executes the corresponding function, thereby controlling the corresponding component to act according to the requirements of the instruction.
- the terminal 100 also includes an external interface 180.
- the external interface can be a standard Micro USB interface, or a multi-pin connector, which can be used to connect the terminal 100 to communicate with other devices, and can also be used to connect a charger to the terminal 100. Charging.
- the terminal 100 further includes a power source 190 (such as a battery) for supplying power to various components.
- a power source 190 such as a battery
- the power source can be logically connected to the processor 170 through a power management system, so as to implement functions such as management of charging, discharging, and power consumption management through the power management system.
- the terminal 100 may further include a flash, a wireless fidelity (WiFi) module, a Bluetooth module, sensors with different functions, and the like, and details are not described herein again. All the methods described below can be applied to the terminal shown in FIG. 1.
- FIG. 1 is only an example of a portable multifunctional device, and does not constitute a limitation on the portable multifunctional device. It may include more or fewer components than shown in the figure, or combine some components. Or different parts.
- an embodiment of the present invention provides an object modeling movement method, which is applied to a mobile terminal, the mobile terminal includes a color camera and a depth camera; and the color camera and the depth camera are located at all locations.
- the method includes the following steps:
- Step 21 Use a color camera and a depth camera to perform a panoramic scan on the target object (ie, the scanned object, referred to as an object in some paragraphs) to obtain a 3D model of the target object;
- Step 22 Obtain a target skeleton model
- Step 23 Fusion the target bone model with the 3D model of the target object
- Step 24 Obtain the target movement mode
- Step 25 Control the skeletal model according to the target movement mode, so that the 3D model of the target object moves according to the target movement mode.
- the color cameras and depth cameras mentioned above can be located on the front of the terminal device or on the back of the terminal device. Their specific arrangement and quantity can be flexibly determined according to the needs of the designer, which is not limited in this application.
- FIG. 3 is the main process from scanning the object to implementing the animation. It is preferred to scan an object, obtain a depth map by scanning with a depth camera, and obtain a color map by scanning with a color camera; fuse the depth map and the color map to obtain a mesh model with texture, that is, a 3D model of the object; embed the 3D model into the bones
- the model makes the skeleton model move according to the skeleton animation (it should be understood that the movement of the skeleton is usually not visible, of course, it can also be visible to the user under some special scene requirements), and the animation effect of the object is presented visually. Detailed description will be given below in conjunction with examples.
- Step 21 involves depth camera scanning, color camera scanning, and 3D reconstruction; specific examples are as follows.
- Depth cameras can include 3D / depth sensor or 3D / depth sensor modules to obtain the depth information of static objects. It should be understood that the scanned object should theoretically be a static object. In actual operation, the small dynamic To some extent it is acceptable.
- the depth information can be obtained using structured light technology and TOF. With the emerging of deep information acquisition methods, deep modules can also include more implementations, which are not limited in the present invention.
- a structured light schematic is shown in Figure 4, where 301 is an invisible infrared light source, 302 is a grating that generates a certain light pattern, 303 is a scanned object, 304 is an infrared camera, and the light pattern reflected by 303 is obtained Compared with the desired light pattern, the depth information of the scanned part of the target object is obtained through calculation.
- a TOF depth camera is shown in Figure 5, where 311 is the target object, 312 is the infrared transmitting end in the TOF camera, and 313 is the infrared receiving end.
- 312 emits infrared light (for example, but not limited to: 850nm-1200nm) at When the target object is reflected, the target object reflects infrared light.
- the reflected infrared light is received by 313.
- the sensors of 313 (such as but not limited to: CMOS array or CCD array with a resolution of more than 240 * 180) will generate a series of reflected infrared light.
- the voltage calculation unit 314 performs calculation based on the series of voltage difference signals, and finally obtains the depth information 315 of the scanned part of the target object.
- the depth camera and the color camera are synchronously called, and a certain correction algorithm is adopted so that the images corresponding to the scanned target object are consistent.
- the way the color image is acquired during scanning is basically the same as the way that ordinary cameras take pictures. This section does not repeat them.
- the distance between the object and the depth camera (or mobile terminal) is between 20cm and 80cm.
- a specific scanning method can be that the terminal is not moved, holding the target object in the range of 30cm to 70cm in front of the depth camera, and slowly rotating the object in all directions, so that the union of all scans can build a complete object. It should be noted that the hand holding the object should not cover the surface of the object as much as possible.
- a specific scanning method may be that the object is not moved, the hand-held terminal is placed in the range of 30cm to 70cm in front of the depth camera, and the object is scanned around the panorama, so that the union of all the scanned images can build a complete object. It should be noted that the hand holding the terminal should not cover the surface of the object as much as possible.
- a specific scanning method may be that the object is not moving, the hand-held terminal is placed in the range of 30cm to 70cm in front of the depth camera, and the object is scanned at a preset angle until the union of all the scan images can build a complete object. It should be noted that the hand holding the terminal should not cover the surface of the object as much as possible.
- the scene information includes the full picture of the object without leaving dead angles. Therefore, during the panoramic scan, multiple frames of depth maps (depth map sequences) will appear corresponding to each depth map. All correspond to scenes in the scanning range during one scan; multiple frames of color maps (color map sequences) also appear, and each color map corresponds to the scenes in the scan range during one scan.
- depth map sequences depth map sequences
- color map sequences color map sequences
- the shooting frame rate of the depth camera during the scanning process may be greater than or equal to 25 fps, for example, 30 fps, 60 fps, or 120 fps.
- the terminal may present the scanning progress of the target object, so that the user can observe whether the panorama of the target object has been covered, and the user can autonomously choose to continue scanning or terminate scanning.
- the position of the depth camera and the color camera can be front or rear
- the depth camera when the depth camera is located above the front of the phone, it can be used with the front color camera, and the front scan can be used for self-timer scanning.
- the depth camera When the depth camera is located above the back of the phone, it can be used with the rear color camera, and the rear scan can be used.
- the front and rear positions in the traditional sense should not constitute any physical position limitation.
- the depth camera and the color camera can be located on the same side.
- the terminal can also call a third-party shooting device, such as using an externally connected shooting lever, scanner, external camera, etc.
- an external color camera can be used, or it can also use An external depth camera can also be used for both.
- the above color camera scanning and depth camera scanning can be turned on when the user triggers the scanning function, and the triggering operations include timing, triggering a shutter, gesture operation, air separation sensing, device control, and the like.
- the system can prompt which objects are suitable for scanning or 3D modeling in the preview image; for example, a box or the like can be used to identify the objects in the preview image to prompt the user.
- the specific device parameters involved in the aforementioned depth camera and color camera are related to the manufacturing process and user requirements and the design constraints of the terminal, and are not specifically limited in the present invention.
- a depth map sequence 321 and a color map sequence 322 are obtained, where each frame obtained by the depth camera is a depth map of a scanned scene (for example, : Depth image), and each frame obtained by the color camera is a color image of the scanned scene (for example: RGB image); the depth map sequence 321 is obtained by meshing to obtain a mesh model of the target object.
- the sequence diagram 322 performs texture mapping on the mesh model to obtain the texture mapped mesh model 323, that is, a 3D model of the object.
- texture mapping may also be performed according to all frames or certain frames of images in the color sequence diagram.
- Step 331 Obtain a color image (including but not limited to RGB) and a depth image (Depth) in each scanning scene of the target object.
- Depth map depth map
- DepthMap contains information on the distance of the depth camera from multiple points on the surface of the target object.
- DepthMap is similar to a grayscale image, except that its one pixel value represents the actual distance of the depth camera from a point on the surface of the target object.
- Color and Depth images are usually registered.
- Step 332 Including, but not limited to, bilateral filtering and denoising the depth map, down-sampling the depth map to generate image pyramids of different resolutions, converting the depth map into a point cloud, estimating the normal vector of each vertex, and cutting the scanned object Out of range points.
- Step 333 In 332, a depth map and a color map sequence of the target object at different scanning positions are collected.
- a single frame 3D point cloud obtained from the acquired image sequence is converted into a unified coordinate system. , That is to obtain the pose transformation relationship between the objects at different scanning positions, that is, the pose estimation.
- Pose estimation is to estimate the 3D pose of an object based on an image sequence.
- feature point registration and point cloud based registration.
- point cloud-based fine registration is used.
- ICP iterative nearest neighbor algorithm
- rough registration between the two poses can also be performed as an initial value for fine registration. This method can support faster scanning.
- M being the transform matrix camera pose
- s i is the 3D point cloud that the current frame to calculate the pose
- d i is the observation model The point cloud in coordinates, where n i is the normal corresponding to the model point cloud, and the objective function represents the minimum sum of squares of the distance between the point cloud of the current frame and the plane of the voxel model point cloud.
- Step 334 The 2D depth map is converted into 3D information and integrated into a unified 3D voxel model.
- TSDF Trusted Signed Distance Function
- the value of the voxel after fusion is SDF (Signed Distance Function) value, Weight (weight) value, and optional color value.
- TSDF algorithm is currently the mainstream processing algorithm for 3D point cloud fusion. For weight calculation, the method of averaging is used. For each fusion, the old weight value is increased by one. The new value weight is 1, the new and old SDF values are correspondingly multiplied by their weights, added, and then divided by the number of fusions (new weight value) to obtain the new normalized SDF value.
- Step 335 Determine whether there is a right preset number of key frames saved at certain angles (such as but not limited to 30, 45, 60, 90, etc.) in the three directions of Roll / Yaw / Pitch, such as the saved key frames
- the number is less than the preset number (depending on whether the panorama of the target object is covered)
- the terminal will instruct the user to perform more scanning.
- the number of key frames is sufficient to cover the panorama of the target object, the user is prompted to complete the scanning, and the scanning can be ended and the subsequent steps can be performed.
- Step 336 In the real-time fusion process, the input key frame information required for the texture mapping is selected and cached, including information such as color images, poses (position differences between different images), and other information.
- a preset number (N) of key frames are selected in each of the Roll / Yaw / Pitch directions, and the 360-degree texture of the object can be completely restored.
- N the preset number of key frames are selected in each of the Roll / Yaw / Pitch directions, and the 360-degree texture of the object can be completely restored.
- N preset number of key frames are selected in each of the Roll / Yaw / Pitch directions, and the 360-degree texture of the object can be completely restored.
- the angle (YAW / Pitch / Roll) of each frame in the input image stream and then calculate the sharpness of each frame, construct a selection strategy based on the angle and sharpness, and select the key frame.
- the angle strategy is to divide 360 degrees into N 360 / N regions in different directions, and each region must have a clear color image.
- det x a (i + 1, j) -a (i, j)
- det y a (i, j + 1) -a (i, j)
- Step 337 Use Marching Cubes algorithm to realize 3D point cloud meshing, and generate triangular patches.
- the main idea of the Marching Cubes algorithm is to find the boundary between the content part and the background part of the 3D point cloud based on the voxel unit, and extract triangles from the voxel to fit this boundary.
- the prime points are called real points
- the background voxel points are called imaginary points.
- Such a three-dimensional point cloud is a lattice composed of various real and imaginary points.
- each of the 8 voxels of a voxel may be a real point or an imaginary point, so a voxel has a total of 2 to the 8th power, which is 256 possible cases.
- the core idea of Marching Cubes algorithm is to use these 256 enumerable situations to extract the equivalent triangle patches in the voxels.
- a voxel is a cube box composed of adjacent eight individual pixel points in a three-dimensional image.
- the cube of the MarchingCubes algorithm can also refer to this voxel. Note the difference between voxels and voxels.
- a voxel is a grid of 8 voxels, and each voxel (except on the boundary) is shared by 8 voxels.
- a specific texture mapping implementation scheme is as follows:
- Step 341 According to the mesh model (triangular patch information) and the pose information of the key frames, determine whether all patches are visible under the pose of each key frame. Input all the triangle patches information of the mesh model and the spatial coordinates of key frames, and output the information of whether all triangle patches are visible in the pose of each key frame.
- the collision detection process involves the calculation of triangle normal vectors in space, judging whether rays and triangles intersect, judging whether rays intersect with AABB ((Axis-aligned bounding box)) bounding boxes, and the construction of hierarchical binary trees.
- Step 342 According to the result of step 341 and the mesh model, the method of region division and graph cuts is used to mark each face on the mesh model to determine which key frame image (view) it chooses. ) To generate texture.
- the results of patch labeling can be used as input to the Affine Mapping (Warping) module to generate preliminary texture maps.
- Step 343 Map the texture of the corresponding area in the key frame image to the texture map, and smooth the patches (patches at the seams) of different key frames.
- each vertex of Vertex can be seen as two vertices: Vleft belonging to the left patch and Vright belonging to the right patch.
- the color before adjustment of each vertex V is recorded as fv, and the color correction value gv of each vertex V is obtained through the following minimization equation.
- v represents the vertex at the keyframe seam, that is, it belongs to the left seam piece and also belongs to the right seam piece.
- Fv is the color value before adjustment
- gv is the color correction value, which is the increase ( ⁇ ).
- the meaning of this formula is that in order to ensure the smoothness of the seams, the difference between the common points of the images in different frames after correction is As small as possible.
- Vi and Vj indicate that the increment of any two adjacent vertices on the same texture seam should be as small as possible to ensure that one does not increase too much and one decreases too little to cause unevenness.
- the patches with the same labels in the adjacent regions of the result of the patch are stored as patches, boundary smoothing is performed on all vertices of the patch, the pixel value of each vertex is adjusted, and the triangle rows of the final vertex siege are made
- An affine transformation based on position and pixels forms the final texture map.
- the texture atlas of the object is drawn on the surface of the mesh model of the object to obtain a 3D model of the object, which is generally saved in the .obj format. As shown in Figure 9, for example, the texture atlas of the lion is mapped to the mesh model of the lion. The 3D model of the lion texture is obtained.
- a 3D model of the target object after 3D reconstruction is obtained, that is, a mesh model with texture.
- you need to add bones to the textured mesh which involves how to obtain the bone model, that is, the target bone model.
- a user bone model making library may be provided, such as some line segments and points, where the line segments represent bones and the points represent joint nodes.
- users can be provided with a more open production library.
- Line segments and points are completely freely designed by users, where line segments represent bones and points represent joint nodes.
- a bone model with the highest degree of matching with the shape of the target object may be selected from at least one preset bone model as the target bone model.
- Preset bone models can be stored online, in the cloud, or locally.
- a chicken bone model, a dog bone model, and a fish bone model are stored locally.
- the system recognizes the chicken bone model as the target bone model through shape recognition. Similar determination criteria include, but are not limited to, bone shape, bone length, bone thickness, number of bones, and bone composition.
- a user's selection instruction may be received, and the selection instruction is used to select a target bone model from at least one preset bone model.
- a specific skeletal assembly scheme is as follows:
- Step 351 In order to approximate the axial plane and use other calculations, calculate adaptively sampled distance fields of trilinear differences. You can evaluate the signed distance from any point to the surface of an object by constructing a k-demension tree.
- a kd-tree is a data structure that divides a k-dimensional data space. It is mainly used for the search of key data in multidimensional space.
- Step 352 Calculate a set of sample points located approximately on the axial plane of the object, find the points where the bone joints may be located, and filter out those points close to the surface of the object.
- Step 353 In order to select the vertices of the skeletal diagram from the central axis plane, a sphere can be filled in the object, and all points on the central axis are sorted according to the distance from the surface of the 3D model, and the 3D model is drawn from the furthest point. The largest inscribed sphere inside (not exceeding the surface range of the 3D model) to get the radius of the sphere; after that, each point on the central axis is traversed, and the point will only be applied to the point if it is not included in any of the previously filled spheres. Click to construct an inscribed sphere.
- Step 354 A skeleton graph can be constructed by connecting some sphere centers, and connecting the sphere centers of any two spheres as an edge.
- the above steps 351 to 354 may be referred to as bone recognition.
- V represents a vertex and E represents an edge
- Step 356 Identify the bone hierarchy relationship and reduce the simple hierarchy to approximate the bone shape.
- a 3D model of the object with the assembled bone model can be obtained.
- a 3D model obtained by embedding the bone model is obtained in step 23.
- the movement mode of the first object may be obtained, and the movement mode of the first object may be used as the target movement mode.
- the first object may be an object that is currently moving in real time (for example, shooting a person who is running and extracting a person's skeleton through a neural network); or it may be a movement of an object that has been photographed and saved in the past (for example, A set of cute moves of a dog have been photographed before, and the movement mode of the movement is stored locally or in the cloud by an algorithm); it can also be a preset movement mode of a specific object. (For example, choose only the movements related to humans)
- a preset target movement mode For example, the actions of humans, dogs, cats, horses, etc. are stored locally, and users can choose a specific category they like based on their preferences or the degree of object type compliance).
- the motion mode can be created by the user using animation software.
- this software can be a tool set embedded in the system or a tool set loaded in the APP for scanning and rebuilding motion, or derived from the first Three-party animation design tool; it can be the current production method or the historical production motion mode or animation.
- the movement mode may be selected as a target movement mode among a plurality of pre-stored movement modes according to physical attributes. For example, a swimming animation of a fish, an animation of a frog taking off, and an animation of a horse running are stored locally in advance; if the target object scanned by the user is a deer, the animation of the horse running is used as the target movement mode of the deer. (Deer and horse are more similar to fish and frog in appearance, biological species, bone structure and other attributes)
- the movement mode may also be based on the skeletal model of the target object (which can be obtained by using any method in the previous step).
- the system or the user may independently design the skeletal model to obtain the target movement mode. This way is to implement the most suitable animation operation on the 3D model of subsequent objects.
- the movement mode may be a preset skeletal animation, which is generally produced by a professional animation designer.
- a skeletal animation describes the dynamics of the transformation of each node in the skeleton over time, and usually uses keyframes for storage and expression. Usually has the concept of FPS (Frame Per Second), that is, how many frames are contained in one second. Skeletal animation cannot exist without the skeleton, otherwise it cannot drive the 3D model, so the skeleton animation usually depends on a specific skeleton.
- the skeleton is usually called Rig, which describes which bones a set of skeletons have, and the connection relationship of each bone ,
- the default transformation that is, the pose) of each bone, and some additional information.
- the pose describes a transformed static state of each node in a skeleton, such as a frame of standing and running.
- Each skeleton will store a binding pose, which is the default pose when making this skeleton.
- the posture generally does not store the hierarchical relationship of the skeleton, but uses an array to store the transformation of each node in turn, and the node belongs to a specific calcaneus, so it cannot be used independently of the skeleton.
- poses are part of the results of skeletal animation sampling. It can be seen that the skeleton, pose, and skeletal animation are related to each other to achieve subsequent animation operations.
- Bone animation essentially records the changes in position, rotation, and scaling of a series of objects stored in a tree structure over time. Each of these objects is a bone.
- the realization process of animation is that the animation transformation of bones in a group of animations is mapped to the 3D model of the bones assembled in the previous step "Automatic skeletal assembly".
- the implementation of this action mapping includes but is not limited to game engines and animation engines Etc .; the 3D model performs posture transformation according to the bone transformation.
- the coherence is a series of animation actions.
- the user's sensory feeling is that the scanned object is "revived” and the static thing is "moved”.
- skinning technology is the basis for ensuring that the 3D model of the object follows the movement of the skeletal model.
- the animation of the 3D model of the object is the action of the 3D model equipped with bones mapped to a set of changing bone models.
- Each frame needs to realize the deformation of the 3D model surface (that is, the 3D model skin of the object) according to the changes of the bones. This process is called skinning.
- skinning This process is called skinning.
- a linear fusion skin (LBS) scheme can be adopted.
- LBS linear fusion skin
- the following formula can be used to obtain the position of the next state according to the position of the previous state.
- v i is the previous position
- v ′ i is the next position
- w i, j is the weight of the j-th bone at the point i
- T j is the transformation matrix.
- the next position can be calculated by reaching a certain number of points on the surface of the 3D model, and then the pose of the 3D model at the next position can be determined, thereby realizing the animation.
- the core of the skinning technique is to find the weight w i, j of each bone to each vertex.
- a method similar to thermal equilibrium can be used to calculate the weight.
- a 3D model is regarded as an insulating heat conductor, and the temperature of the i-th calcaneus is set to 1 °, while the temperature of the remaining bones is set. Set to 0 °. According to the principle of thermal equilibrium, this way we can set the temperature after the surface vertices are equilibrated to the weight of this point, and the weight value interval is [0,1].
- the weight calculation method based on thermal balance makes the calculation result of the weight have smooth characteristics, and the presented motion effect will be more real and natural.
- a specific animation process can be realized as shown in FIG. 11.
- motion mapping is performed on a 3D model equipped with a target skeleton model according to a motion model or an animation model, and the target skeleton model is controlled.
- the target skeletal model moves according to a preset motion model, and the skin data is calculated and updated in real time during the movement, so that the 3D model can follow the target skeletal model to achieve smooth movement, and then realize the animation of the 3D model.
- the skeletal structure of the target object and the skeletal structure of the animation may not be exactly the same. You can map the skeletal structure of the object and the skeletal structure of the animation. For example, the key nodes must be the same.
- the length of the bones it can be set in proportion; Adapt the skeletal model of the object to the skeletal structure of the animation, such as proportioning cropping and extension, at least so that the skeletal structure of the animation does not exceed the outer surface of the 3D model of the object. Further, some physical operations can be performed to make the trimming Animate the bones to maximize the 3D model of the object, making the animated bone model and the 3D model of the object more harmonious.
- the above steps 21-25 can be completed step by step in one go or at a certain time interval.
- the 3D model can be stored locally or in the cloud.
- the 3D model can be called directly after a certain period of time, freely select bone assembly or freely choose the animation method, and also select the animation background, including but not Limited to images taken in real time, other images already stored locally, cloud data images, etc.
- the terminal displays the object animation, it can also display the shadow of the object, or add other sound effects, special effects, and so on.
- the animation can be automatically played by the mobile terminal, and it can also be controlled by the user inputting operation instructions.
- a series of operations such as scanning, 3D reconstruction, bone assembly, and preset animation display can be realized on the mobile terminal as a whole, and 3D scanning can be easily played for users;
- the rendering of two-dimensional images can transition to the rendering of 3D animation, allowing users to finally realize the virtual animation actions of objects scanned and modeled by reality; greatly improving the user's interest in using mobile terminals and Sticky, leading photography applications into a new trend.
- an embodiment of the present invention provides an object modeling and movement device 700.
- the device 700 can be applied to various types of photographing equipment. As shown in FIG. 12, the device 700 includes a scanning module. 701, a first acquisition module 702, a fusion module 703, a second acquisition module 704, and a motion module 705; the device is applied to a mobile terminal, and the mobile terminal includes a color camera and a depth camera, and the color camera and the depth camera are located on the same side of the mobile terminal; For related characteristics, reference may be made to the description in the foregoing method embodiment.
- the scanning module 701 is configured to obtain a 3D model of a target object when the color camera and the depth camera perform panoramic scanning on the target object.
- the scanning module 701 can be called by a processor to enable and control the color camera and the depth camera by using program instructions in the memory. Further, pictures acquired during scanning can be selectively stored in the memory.
- the first obtaining module 702 is configured to obtain a target bone model.
- the first obtaining module 702 may be implemented by a processor invoking a corresponding program instruction. Further, the first obtaining module 702 may be implemented by invoking data and algorithms in a local memory or a cloud server and performing corresponding calculations.
- a fusion module 703 is used to fuse the target bone model with the 3D model of the target object.
- the fusion module 703 can be implemented by the processor calling corresponding program instructions. Further, the fusion module 703 can call data and algorithms in the local memory or cloud server. Perform the calculations accordingly.
- the second acquisition module 704 is configured to acquire a target movement mode.
- the second obtaining module 704 may be implemented by a processor calling a corresponding program instruction. Further, it may be implemented by calling data and algorithms in a local memory or a cloud server and performing corresponding calculations.
- the movement module 705 is configured to control the skeletal model according to the target movement mode, so that the 3D model of the target object moves according to the target movement mode.
- the motion module 705 may be implemented by a processor calling a corresponding program instruction. Further, it may also be implemented by calling data and algorithms in a local memory or a cloud server.
- the scanning module 701 is specifically configured to execute the method mentioned in step 21 and an equivalent replacement method
- the first acquisition module 702 is specifically configured to execute the method mentioned in step 22 and an equivalent replacement method Method
- the fusion module 703 is specifically configured to execute the method mentioned in step 23 and a method that can be equivalently replaced
- the second acquisition module 704 is specifically configured to execute the method mentioned in step 24 and a method that can be equivalently replaced
- a motion module 705 is specifically configured to execute the method mentioned in step 25 and a method that can be equivalently replaced.
- the scanning module 701 may perform the methods of steps 331-337 and 341-343; the fusion module 703 may perform the methods of steps 351-356.
- the apparatus 700 provided by the embodiment of the present invention can realize the integrated design of objects from scanning, 3D reconstruction, bone assembly, and preset animation display. There is no need for users to use professional, cumbersome and complicated equipment for professional scanning, and no need to go to the PC to do complex modeling and animation processing. These functions are integrated together and provided to the user, enabling the user to use a mobile terminal This series of operation methods can be easily played on the Internet, so that any "static object (or near static object)" around the user can be more lively and more vital. Increase the user's interest in using the terminal and improve the user's experience.
- each of the above modules can be a separately established processing element, or it can be integrated into a certain chip of the terminal to implement, in addition, it can also be stored in the form of program code in the storage element of the controller and processed by a certain processor The components call and execute the functions of the above modules.
- each module can be integrated together or can be implemented independently.
- the processing element described herein may be an integrated circuit chip with signal processing capabilities.
- each step of the above method or each of the above modules may be completed by an integrated logic circuit of hardware in a processor element or an instruction in the form of software.
- the processing element may be a general-purpose processor, such as a central processing unit (English: central processing unit, CPU), or one or more integrated circuits configured to implement the above methods, such as one or more specific integrations. Circuit (English: application-specific integrated circuit, ASIC for short), or one or more microprocessors (English: digital signal processor, dsp), or one or more field programmable gate arrays (English: field-programmable gate array (abbreviated as FPGA).
- the embodiments of the present invention may be provided as a method, a system, or a computer program product. Therefore, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Moreover, the present invention may take the form of a computer program product implemented on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) containing computer-usable program code.
- computer-usable storage media including, but not limited to, disk storage, CD-ROM, optical storage, etc.
- These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing device to work in a specific manner such that the instructions stored in the computer-readable memory produce a manufactured article including an instruction device, the instructions
- the device implements the functions specified in one or more flowcharts and / or one or more blocks of the block diagram.
- These computer program instructions can also be loaded onto a computer or other programmable data processing device, so that a series of steps can be performed on the computer or other programmable device to produce a computer-implemented process, which can be executed on the computer or other programmable device.
- the instructions provide steps for implementing the functions specified in one or more flowcharts and / or one or more blocks of the block diagrams.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computer Graphics (AREA)
- Geometry (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computer Hardware Design (AREA)
- Architecture (AREA)
- Data Mining & Analysis (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Evolutionary Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Processing Or Creating Images (AREA)
- Image Generation (AREA)
- Auxiliary Devices For And Details Of Packaging Control (AREA)
Abstract
Description
Claims (20)
- 一种物体建模运动方法,其特征在于,所述方法应用于移动终端,所述移动终端包括彩色摄像头和深度传感器模组;所述方法包括:利用所述彩色摄像头和所述深度传感器模组对目标物体进行全景扫描,得到所述目标物体的3D模型;获取目标运动方式;使所述目标物体的3D模型根据所述目标运动方式进行运动。
- 如权利要求1所述方法,其特征在于,所述深度传感器模组包括:TOF模组,或者结构光模组;其中,所述彩色摄像头和所述深度传感器模组位于所述移动终端的同一侧。
- 如权利要求1或2所述方法,其特征在于,在所述得到所述目标物体的3D模型之后,所述方法还包括:获取目标骨骼模型;将所述目标骨骼模型与所述目标物体的3D模型融合;在所述使所述目标物体的3D模型根据所述目标运动方式进行运动之前,所述方法还包括:根据所述目标运动方式控制所述骨骼模型。
- 如权利要求1-3任意一项所述方法,其特征在于,所述获取目标骨骼模型包括:接收用户的操作指令,所述操作指令用于将至少两个线段和至少一个点组合成骨骼模型;其中,线段表示所述骨骼模型中的骨骼,点表示所述骨骼模型中的关节节点。
- 如权利要求1-3任意一项所述方法,其特征在于,所述获取目标骨骼模型包括:从至少一个预设骨骼模型中选择与所述目标物体的外形匹配度最高的骨骼模型作为目标骨骼模型。
- 如权利要求1-5任一项所述方法,其特征在于,所述获取目标运动方式包括:获取第一物体的运动方式,将所述第一物体的运动方式作为目标运动方式。
- 如权利要求1-5任一项所述方法,其特征在于,所述获取目标运动方式包括:呈现至少两个运动方式给用户,接收用户的选择指令,在所述至少两个运动方式中确定出目标运动方式;或,根据目标物体的属性在多个预先存储的运动方式中选择出属性匹配度最高的运动方式作为目标运动方式。
- 如权利要求1-5任一项所述方法,其特征在于,所述获取目标运动方式包括:接收用户针对所述骨骼模型制作的动画,将所述动画确定为目标运动方式。
- 一种物体建模运动装置,其特征在于,所述装置应用于移动终端,所述移动终端包括彩色摄像头和深度传感器模组;所述装置包括:扫描模块,用于当所述彩色摄像头和所述深度传感器对目标物体进行全景扫描时,得到所述目标物体的3D模型;第二获取模块,用于获取目标运动方式;运动模块,用于使所述目标物体的3D模型根据所述目标运动方式进行运动。
- 如权利要求9所述装置,其特征在于,所述深度传感器模组包括:TOF模组,或者结构光模组;其中,所述彩色摄像头和所述深度传感器模组位于所述移动终端的同一侧
- 如权利要求9或10所述装置,其特征在于,所述装置还包括:第一获取模块,用于获取目标骨骼模型;以及,融合模块;用于将所述目标骨骼模型与所述目标物体的3D模型融合;其中,所述运动模块还具体用于根据所述目标运动方式控制所述骨骼模型。
- 如权利要求9-11任一项所述装置,其特征在于,所述第一获取模块具体用于:接收用户的操作指令,所述操作指令用于将至少两个线段和至少一个点组合成骨骼模型;其中,线段表示所述骨骼模型中的骨骼,点表示所述骨骼模型中的关节节点;或,从至少一个预设骨骼模型中选择与所述目标物体的外形匹配度最高的骨骼模型作为目标骨骼模型;或,获取第一物体的运动方式,将所述第一物体的运动方式作为目标运动方式。
- 如权利要求9-12任一项所述装置,其特征在于,所述第二获取模块具体用于:呈现至少两个运动方式给用户;接收用户的选择指令,在所述至少两个运动方式中确定出目标运动方式;或,接收用户针对所述骨骼模型制作的动画,将所述动画确定为目标运动方式;或,根据所述物理属性在多个预先存储的运动方式中选择出属性匹配度最高的运动方式作为目标运动方式。
- 一种终端设备,其特征在于,所述终端设备包含存储器、处理器、总线、深度传感器模组和彩色摄像头;所述彩色摄像头和所述深度传感器模组位于所述移动终端的同一侧;所述存储器、所述深度传感器模组、彩色摄像头以及所述处理器通过所述总线相连;所述深度传感器模组和所述彩色摄像头用于在所述处理器的控制下对目标物体进行全景扫描;所述存储器用于存储计算机程序和指令;所述处理器用于调用所述存储器中存储的所述计算机程序和指令,使所述终端设备执行如权利要求1~9任一项所述方法。
- 如权利要求14所述的终端设备,所述终端设备还包括天线系统、所述天线系统在处理器的控制下,收发无线通信信号实现与移动通信网络的无线通信;所述移动通信网络包括以下的一种或多种:GSM网络、CDMA网络、3G网络、4G网络、5G网络、FDMA、TDMA、PDC、TACS、AMPS、WCDMA、TDSCDMA、WIFI以及LTE网络。
- 一种物体建模运动方法,其特征在于,所述方法应用于移动终端,所述移动终端包括彩色摄像头和深度传感器模组;且所述彩色摄像头和所述深度传感器模组位于所述移动终端的同一侧;所述深度传感器模组包括TOF模组或者结构光模组;所述方法包括:利用所述彩色摄像头和所述深度传感器模组对目标物体进行全景扫描,得到所述目标物体的3D模型;获取目标骨骼模型;将所述目标骨骼模型与所述目标物体的3D模型融合;获取目标运动方式;根据所述目标运动方式控制所述骨骼模型,使所述目标物体的3D模型根据所述目标运动方式进行运动。
- 如权利要求16所述方法,其特征在于,所述获取目标骨骼模型包括:接收用户的操作指令,所述操作指令用于将至少两个线段和至少一个点组合成骨骼模型;其中,线段表示所述骨骼模型中的骨骼,点表示所述骨骼模型中的关节节点;或,从至少一个预设骨骼模型中选择与所述目标物体的外形匹配度最高的骨骼模型作为目标骨骼模型。
- 如权利要求16或17所述方法,其特征在于,所述获取目标运动方式包括:获取第一物体的运动方式,将所述第一物体的运动方式作为目标运动方式;或,呈现至少两个运动方式给用户,接收用户的选择指令,在所述至少两个运动方式中确定出目标运动方式;或,接收用户针对所述骨骼模型制作的动画,将所述动画确定为目标运动方式;或,根据目标物体的属性在多个预先存储的运动方式中选择出属性匹配度最高的运动方式作为目标运动方式。
- 一种物体建模运动装置,其特征在于,所述装置应用于移动终端,所述移动终端包括彩色摄像头和深度传感器模组,所述彩色摄像头和所述深度传感器模组位于所述移动终端的同一侧;所述深度传感器模组包括TOF模组或结构光模组;所述装置包括:扫描模块,用于当所述彩色摄像头和所述深度传感器对目标物体进行全景扫描时,得到所述目标物体的3D模型;第一获取模块,用于获取目标骨骼模型;融合模块;用于将所述目标骨骼模型与所述目标物体的3D模型融合;第二获取模块,用于获取目标运动方式;运动模块,用于根据所述目标运动方式控制所述骨骼模型,使所述目标物体的3D模型根据所述目标运动方式进行运动。
- 如权利要求19所述装置,其特征在于,所述第一获取模块具体用于:接收用户的操作指令,所述操作指令用于将至少两个线段和至少一个点组合成骨骼模型;其中,线段表示所述骨骼模型中的骨骼,点表示所述骨骼模型中的关节节点;或,从至少一个预设骨骼模型中选择与所述目标物体的外形匹配度最高的骨骼模型作为目标骨骼模型;或,获取第一物体的运动方式,将所述第一物体的运动方式作为目标运动方式。所述第二获取模块具体用于:呈现至少两个运动方式给用户,接收用户的选择指令,在所述至少两个运动方式中确定出目标运动方式;或,接收用户针对所述骨骼模型制作的动画,将所述动画确定为目标运动方式;或,根据所述物理属性在多个预先存储的运动方式中选择出属性匹配度最高的运动方式作为目标运动方式。
Priority Applications (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020217001341A KR102524422B1 (ko) | 2018-06-21 | 2019-05-27 | 객체 모델링 및 움직임 방법 및 장치, 그리고 기기 |
BR112020025903-9A BR112020025903A2 (pt) | 2018-06-21 | 2019-05-27 | Método e aparelho de modelagem e movimento de objeto, e dispositivo |
SG11202012802RA SG11202012802RA (en) | 2018-06-21 | 2019-05-27 | Object modeling and movement method and apparatus, and device |
CA3104558A CA3104558A1 (en) | 2018-06-21 | 2019-05-27 | Object modeling and movement method and apparatus, and device |
AU2019291441A AU2019291441B2 (en) | 2018-06-21 | 2019-05-27 | Object modeling and movement method and apparatus, and device |
JP2020570722A JP7176012B2 (ja) | 2018-06-21 | 2019-05-27 | オブジェクト・モデリング動作方法及び装置並びにデバイス |
EP19821647.5A EP3726476A4 (en) | 2018-06-21 | 2019-05-27 | OBJECT MODELING PROCESS, APPARATUS AND DEVICE |
US16/931,024 US11436802B2 (en) | 2018-06-21 | 2020-07-16 | Object modeling and movement method and apparatus, and device |
US17/879,164 US20220383579A1 (en) | 2018-06-21 | 2022-08-02 | Object modeling and movement method and apparatus, and device |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810646701.0 | 2018-06-21 | ||
CN201810646701.0A CN110634177A (zh) | 2018-06-21 | 2018-06-21 | 一种物体建模运动方法、装置与设备 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US16/931,024 Continuation US11436802B2 (en) | 2018-06-21 | 2020-07-16 | Object modeling and movement method and apparatus, and device |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019242454A1 true WO2019242454A1 (zh) | 2019-12-26 |
Family
ID=68967803
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/088480 WO2019242454A1 (zh) | 2018-06-21 | 2019-05-27 | 一种物体建模运动方法、装置与设备 |
Country Status (10)
Country | Link |
---|---|
US (2) | US11436802B2 (zh) |
EP (1) | EP3726476A4 (zh) |
JP (1) | JP7176012B2 (zh) |
KR (1) | KR102524422B1 (zh) |
CN (3) | CN111640176A (zh) |
AU (1) | AU2019291441B2 (zh) |
BR (1) | BR112020025903A2 (zh) |
CA (1) | CA3104558A1 (zh) |
SG (1) | SG11202012802RA (zh) |
WO (1) | WO2019242454A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111383309A (zh) * | 2020-03-06 | 2020-07-07 | 腾讯科技(深圳)有限公司 | 一种骨骼动画驱动方法、装置及存储介质 |
JP7086362B1 (ja) | 2021-03-29 | 2022-06-20 | 株式会社セルシス | 情報処理システム、情報処理方法および情報処理プログラム |
JP2022550555A (ja) * | 2020-01-14 | 2022-12-02 | ▲騰▼▲訊▼科技(深▲セン▼)有限公司 | 画像処理方法、装置、電子機器及びコンピュータプログラム |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7079287B2 (ja) * | 2019-11-07 | 2022-06-01 | 株式会社スクウェア・エニックス | 鑑賞システム、モデル構成装置、制御方法、プログラム及び記録媒体 |
US11636618B2 (en) * | 2019-11-14 | 2023-04-25 | Samsung Electronics Co., Ltd. | Device and method with simultaneous implementation of localization and mapping |
US11080862B2 (en) * | 2019-11-18 | 2021-08-03 | Ncku Research And Development Foundation | Reliability based keyframe switching system and method adaptable to ICP |
US11023730B1 (en) * | 2020-01-02 | 2021-06-01 | International Business Machines Corporation | Fine-grained visual recognition in mobile augmented reality |
EP4191540A4 (en) * | 2020-07-27 | 2024-08-07 | Vrc Inc | 3D DATA SYSTEM AND 3D DATA GENERATION METHODS |
CN112014799B (zh) * | 2020-08-05 | 2024-02-09 | 七海行(深圳)科技有限公司 | 一种数据采集方法及巡检装置 |
CN111951360B (zh) * | 2020-08-14 | 2023-06-23 | 腾讯科技(深圳)有限公司 | 动画模型处理方法、装置、电子设备及可读存储介质 |
CN111921202B (zh) * | 2020-09-16 | 2021-01-08 | 成都完美天智游科技有限公司 | 虚拟场景的数据处理方法、装置、设备及可读存储介质 |
CN112330777B (zh) * | 2020-11-03 | 2022-11-18 | 上海镱可思多媒体科技有限公司 | 基于三维动画的电机仿真运行数据生成方法、系统及终端 |
CN112347540B (zh) * | 2020-11-09 | 2023-09-08 | 重庆智慧之源科技有限公司 | 建筑物智能检测建模系统 |
CN112417746B (zh) * | 2020-11-18 | 2022-11-25 | 中北大学 | 一种基于神经网络预测碰撞检测的方法 |
CN112435316B (zh) * | 2020-11-30 | 2023-05-12 | 上海米哈游天命科技有限公司 | 一种游戏中的防穿模方法、装置、电子设备及存储介质 |
CN118154732A (zh) * | 2020-12-22 | 2024-06-07 | 完美世界(北京)软件科技发展有限公司 | 动画数据的处理方法及装置、存储介质、计算机设备 |
CN112541969B (zh) * | 2021-01-21 | 2022-04-12 | 深圳市桔子智能信息科技有限公司 | 一种三维人体模型骨骼动态转移绑定方法 |
US20220237838A1 (en) * | 2021-01-27 | 2022-07-28 | Nvidia Corporation | Image synthesis using one or more neural networks |
CN113034691A (zh) * | 2021-03-22 | 2021-06-25 | 广州虎牙科技有限公司 | 人体模型的骨骼绑定方法、装置及电子设备 |
CN112927331B (zh) * | 2021-03-31 | 2023-09-22 | 腾讯科技(深圳)有限公司 | 角色模型的动画生成方法和装置、存储介质及电子设备 |
KR102571744B1 (ko) * | 2021-05-06 | 2023-08-29 | 한국전자통신연구원 | 3차원 콘텐츠 생성 방법 및 장치 |
CN113313794B (zh) | 2021-05-19 | 2022-11-08 | 深圳市慧鲤科技有限公司 | 动画迁移方法和装置、设备及存储介质 |
CN114118664A (zh) * | 2021-07-21 | 2022-03-01 | 岭南师范学院 | 一种解决属性权重和时间权重复杂性的动态决策方法 |
WO2023022373A1 (en) * | 2021-08-19 | 2023-02-23 | Samsung Electronics Co., Ltd. | Method and system for generating an animation from a static image |
CN117321637A (zh) | 2021-08-19 | 2023-12-29 | 三星电子株式会社 | 用于从静态图像生成动画的方法和系统 |
CN113744400B (zh) * | 2021-09-09 | 2024-07-16 | 网易(杭州)网络有限公司 | 地形蒙版选区确定方法、装置及计算机设备 |
EP4250243A1 (en) * | 2021-11-25 | 2023-09-27 | CLO Virtual Fashion Inc. | Method and apparatus for determining body part boundary surface of three-dimensional avatar |
US12069228B2 (en) * | 2021-12-28 | 2024-08-20 | Faro Technologies, Inc. | Artificial panorama image production and in-painting for occluded areas in images |
KR102561903B1 (ko) * | 2022-01-25 | 2023-08-02 | 스크린커플스(주) | 클라우드 서버를 이용한 ai 기반의 xr 콘텐츠 서비스 방법 |
WO2023224251A1 (en) * | 2022-05-16 | 2023-11-23 | Samsung Electronics Co., Ltd. | Systems and methods for recognizing non-line-of-sight human actions |
CN116055778B (zh) * | 2022-05-30 | 2023-11-21 | 荣耀终端有限公司 | 视频数据的处理方法、电子设备及可读存储介质 |
CN116452755B (zh) * | 2023-06-15 | 2023-09-22 | 成就医学科技(天津)有限公司 | 一种骨骼模型构建方法、系统、介质及设备 |
CN116664727B (zh) * | 2023-07-27 | 2023-12-08 | 深圳市中手游网络科技有限公司 | 一种游戏动画模型识别方法及处理系统 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102448565A (zh) * | 2009-05-29 | 2012-05-09 | 微软公司 | 将骨架数据实时重新定标到游戏化身 |
CN102915112A (zh) * | 2011-06-23 | 2013-02-06 | 奥美可互动有限责任公司 | 用于近距离动作跟踪的系统和方法 |
US8542252B2 (en) * | 2009-05-29 | 2013-09-24 | Microsoft Corporation | Target digitization, extraction, and tracking |
CN103597516A (zh) * | 2011-06-06 | 2014-02-19 | 微软公司 | 控制虚拟环境中的对象 |
CN103703489A (zh) * | 2011-06-06 | 2014-04-02 | 微软公司 | 对象数字化 |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4229316B2 (ja) * | 2003-05-09 | 2009-02-25 | 株式会社バンダイナムコゲームス | 画像生成システム、プログラム及び情報記憶媒体 |
JP5845830B2 (ja) * | 2011-11-09 | 2016-01-20 | ソニー株式会社 | 情報処理装置、表示制御方法、およびプログラム |
JP6018707B2 (ja) * | 2012-06-21 | 2016-11-02 | マイクロソフト コーポレーション | デプスカメラを使用するアバター構築 |
CN102800126A (zh) * | 2012-07-04 | 2012-11-28 | 浙江大学 | 基于多模态融合的实时人体三维姿态恢复的方法 |
WO2014042121A1 (ja) * | 2012-09-12 | 2014-03-20 | 独立行政法人産業技術総合研究所 | 動作評価装置及びそのプログラム |
CN203240221U (zh) * | 2013-01-17 | 2013-10-16 | 佛山科学技术学院 | 一种电动转台 |
KR102058857B1 (ko) * | 2013-04-08 | 2019-12-26 | 삼성전자주식회사 | 촬영 장치 및 촬영 제어 방법 |
RU2668408C2 (ru) * | 2013-08-04 | 2018-09-28 | Айсмэтч Лтд | Устройства, системы и способы виртуализации зеркала |
CN104021584B (zh) * | 2014-06-25 | 2017-06-06 | 无锡梵天信息技术股份有限公司 | 一种骨骼蒙皮动画的实现方法 |
US9626803B2 (en) * | 2014-12-12 | 2017-04-18 | Qualcomm Incorporated | Method and apparatus for image processing in augmented reality systems |
CN105137973B (zh) * | 2015-08-21 | 2017-12-01 | 华南理工大学 | 一种人机协作场景下的机器人智能躲避人类方法 |
US20170054897A1 (en) * | 2015-08-21 | 2017-02-23 | Samsung Electronics Co., Ltd. | Method of automatically focusing on region of interest by an electronic device |
CN105225269B (zh) * | 2015-09-22 | 2018-08-17 | 浙江大学 | 基于运动机构的三维物体建模系统 |
JP2017080203A (ja) * | 2015-10-29 | 2017-05-18 | キヤノンマーケティングジャパン株式会社 | 情報処理装置、情報処理方法、プログラム |
CN105590096B (zh) * | 2015-12-18 | 2019-05-28 | 运城学院 | 基于深度映射的人体活动识别特征表达方法 |
JP6733267B2 (ja) * | 2016-03-31 | 2020-07-29 | 富士通株式会社 | 情報処理プログラム、情報処理方法および情報処理装置 |
KR101819730B1 (ko) * | 2016-04-19 | 2018-01-17 | 광주과학기술원 | 3차원 객체 검출 및 자세추정 방법 |
CN107577334A (zh) * | 2016-07-04 | 2018-01-12 | 中兴通讯股份有限公司 | 一种移动终端的体感操作方法及装置 |
CN106251389B (zh) * | 2016-08-01 | 2019-12-24 | 北京小小牛创意科技有限公司 | 制作动画的方法和装置 |
ZA201701187B (en) * | 2016-08-10 | 2019-07-31 | Tata Consultancy Services Ltd | Systems and methods for identifying body joint locations based on sensor data analysis |
US20180225858A1 (en) * | 2017-02-03 | 2018-08-09 | Sony Corporation | Apparatus and method to generate realistic rigged three dimensional (3d) model animation for view-point transform |
CN107248195A (zh) * | 2017-05-31 | 2017-10-13 | 珠海金山网络游戏科技有限公司 | 一种增强现实的主播方法、装置和系统 |
CN107274465A (zh) * | 2017-05-31 | 2017-10-20 | 珠海金山网络游戏科技有限公司 | 一种虚拟现实的主播方法、装置和系统 |
CN108053435A (zh) * | 2017-11-29 | 2018-05-18 | 深圳奥比中光科技有限公司 | 基于手持移动设备的动态实时三维重建方法和系统 |
CN108154551B (zh) * | 2017-11-29 | 2021-04-30 | 奥比中光科技集团股份有限公司 | 实时动态重建三维人体模型的方法及系统 |
US9959671B1 (en) * | 2018-01-18 | 2018-05-01 | Scandy, LLC | System and method for capturing, processing and rendering data through a template-driven processing pipeline |
-
2018
- 2018-06-21 CN CN202010312661.3A patent/CN111640176A/zh active Pending
- 2018-06-21 CN CN201810646701.0A patent/CN110634177A/zh active Pending
- 2018-06-21 CN CN202010312594.5A patent/CN111640175A/zh active Pending
-
2019
- 2019-05-27 KR KR1020217001341A patent/KR102524422B1/ko active IP Right Grant
- 2019-05-27 CA CA3104558A patent/CA3104558A1/en active Pending
- 2019-05-27 WO PCT/CN2019/088480 patent/WO2019242454A1/zh unknown
- 2019-05-27 BR BR112020025903-9A patent/BR112020025903A2/pt unknown
- 2019-05-27 SG SG11202012802RA patent/SG11202012802RA/en unknown
- 2019-05-27 EP EP19821647.5A patent/EP3726476A4/en active Pending
- 2019-05-27 AU AU2019291441A patent/AU2019291441B2/en not_active Expired - Fee Related
- 2019-05-27 JP JP2020570722A patent/JP7176012B2/ja active Active
-
2020
- 2020-07-16 US US16/931,024 patent/US11436802B2/en active Active
-
2022
- 2022-08-02 US US17/879,164 patent/US20220383579A1/en not_active Abandoned
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102448565A (zh) * | 2009-05-29 | 2012-05-09 | 微软公司 | 将骨架数据实时重新定标到游戏化身 |
US8542252B2 (en) * | 2009-05-29 | 2013-09-24 | Microsoft Corporation | Target digitization, extraction, and tracking |
CN103597516A (zh) * | 2011-06-06 | 2014-02-19 | 微软公司 | 控制虚拟环境中的对象 |
CN103703489A (zh) * | 2011-06-06 | 2014-04-02 | 微软公司 | 对象数字化 |
CN102915112A (zh) * | 2011-06-23 | 2013-02-06 | 奥美可互动有限责任公司 | 用于近距离动作跟踪的系统和方法 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3726476A4 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2022550555A (ja) * | 2020-01-14 | 2022-12-02 | ▲騰▼▲訊▼科技(深▲セン▼)有限公司 | 画像処理方法、装置、電子機器及びコンピュータプログラム |
JP7299414B2 (ja) | 2020-01-14 | 2023-06-27 | ▲騰▼▲訊▼科技(深▲セン▼)有限公司 | 画像処理方法、装置、電子機器及びコンピュータプログラム |
CN111383309A (zh) * | 2020-03-06 | 2020-07-07 | 腾讯科技(深圳)有限公司 | 一种骨骼动画驱动方法、装置及存储介质 |
CN111383309B (zh) * | 2020-03-06 | 2023-03-17 | 腾讯科技(深圳)有限公司 | 一种骨骼动画驱动方法、装置及存储介质 |
JP7086362B1 (ja) | 2021-03-29 | 2022-06-20 | 株式会社セルシス | 情報処理システム、情報処理方法および情報処理プログラム |
JP2022152058A (ja) * | 2021-03-29 | 2022-10-12 | 株式会社セルシス | 情報処理システム、情報処理方法および情報処理プログラム |
Also Published As
Publication number | Publication date |
---|---|
JP2021527895A (ja) | 2021-10-14 |
US11436802B2 (en) | 2022-09-06 |
CN110634177A (zh) | 2019-12-31 |
CA3104558A1 (en) | 2019-12-26 |
KR20210019552A (ko) | 2021-02-22 |
AU2019291441B2 (en) | 2023-07-06 |
SG11202012802RA (en) | 2021-01-28 |
EP3726476A4 (en) | 2021-04-07 |
US20220383579A1 (en) | 2022-12-01 |
AU2019291441A1 (en) | 2021-01-21 |
JP7176012B2 (ja) | 2022-11-21 |
CN111640176A (zh) | 2020-09-08 |
KR102524422B1 (ko) | 2023-04-20 |
EP3726476A1 (en) | 2020-10-21 |
BR112020025903A2 (pt) | 2021-03-16 |
US20200349765A1 (en) | 2020-11-05 |
CN111640175A (zh) | 2020-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019242454A1 (zh) | 一种物体建模运动方法、装置与设备 | |
CN111739146B (zh) | 物体三维模型重建方法及装置 | |
KR20180121494A (ko) | 단안 카메라들을 이용한 실시간 3d 캡처 및 라이브 피드백을 위한 방법 및 시스템 | |
US20220245912A1 (en) | Image display method and device | |
JP2006053694A (ja) | 空間シミュレータ、空間シミュレート方法、空間シミュレートプログラム、記録媒体 | |
Fei et al. | 3d gaussian splatting as new era: A survey | |
EP3533218B1 (en) | Simulating depth of field | |
WO2023066120A1 (zh) | 图像处理方法、装置、电子设备及存储介质 | |
CN113628327A (zh) | 一种头部三维重建方法及设备 | |
EP3980975B1 (en) | Method of inferring microdetail on skin animation | |
LU502672B1 (en) | A method for selecting scene points, distance measurement and a data processing apparatus | |
Nguyen et al. | High resolution 3d content creation using unconstrained and uncalibrated cameras | |
CA2716257A1 (en) | System and method for interactive painting of 2d images for iterative 3d modeling | |
Pan et al. | Research on technology production in Chinese virtual character industry | |
US20220164863A1 (en) | Object virtualization processing method and device, electronic device and storage medium | |
Lechlek et al. | Interactive hdr image-based rendering from unstructured ldr photographs | |
US20230196702A1 (en) | Object Deformation with Bindings and Deformers Interpolated from Key Poses | |
CN117611778A (zh) | 一种直播中背景替换方法、系统、存储介质及直播设备 | |
CN118898680A (zh) | 一种对象模型的构建方法、装置、电子设备和存储介质 | |
CN117726644A (zh) | 轮廓线绘制方法、装置、计算机设备和存储介质 | |
CN117876590A (zh) | 三维模型的重建方法、装置、计算机设备及存储介质 | |
CN116389704A (zh) | 视频处理方法、装置、计算机设备、存储介质和产品 | |
Huynh | Development of a standardized framework for cost-effective communication system based on 3D data streaming and real-time 3D reconstruction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19821647 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2019821647 Country of ref document: EP Effective date: 20200713 |
|
ENP | Entry into the national phase |
Ref document number: 2020570722 Country of ref document: JP Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 3104558 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112020025903 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 20217001341 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2019291441 Country of ref document: AU Date of ref document: 20190527 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 112020025903 Country of ref document: BR Kind code of ref document: A2 Effective date: 20201217 |