CN106997236B - 基于多模态输入进行交互的方法和设备 - Google Patents
基于多模态输入进行交互的方法和设备 Download PDFInfo
- Publication number
- CN106997236B CN106997236B CN201610049586.XA CN201610049586A CN106997236B CN 106997236 B CN106997236 B CN 106997236B CN 201610049586 A CN201610049586 A CN 201610049586A CN 106997236 B CN106997236 B CN 106997236B
- Authority
- CN
- China
- Prior art keywords
- information
- input
- module
- element information
- structural data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 239000011521 glass Substances 0.000 claims abstract description 123
- 230000033001 locomotion Effects 0.000 claims abstract description 29
- 238000012545 processing Methods 0.000 claims abstract description 27
- 238000013527 convolutional neural network Methods 0.000 claims description 34
- 230000009471 action Effects 0.000 claims description 22
- 230000006870 function Effects 0.000 claims description 15
- 238000010801 machine learning Methods 0.000 claims description 12
- 230000006399 behavior Effects 0.000 claims description 10
- 241001269238 Data Species 0.000 claims description 9
- 239000011159 matrix material Substances 0.000 claims description 8
- 238000007637 random forest analysis Methods 0.000 claims description 6
- 239000011551 heat transfer agent Substances 0.000 claims description 4
- 238000004458 analytical method Methods 0.000 abstract description 47
- 230000002452 interceptive effect Effects 0.000 abstract description 34
- 230000003993 interaction Effects 0.000 abstract description 16
- 230000004927 fusion Effects 0.000 description 21
- 238000000605 extraction Methods 0.000 description 18
- 230000003190 augmentative effect Effects 0.000 description 17
- 238000012549 training Methods 0.000 description 15
- 238000013507 mapping Methods 0.000 description 14
- 230000008569 process Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 12
- 238000013528 artificial neural network Methods 0.000 description 10
- 238000013135 deep learning Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 210000002569 neuron Anatomy 0.000 description 10
- 239000000284 extract Substances 0.000 description 7
- 238000000926 separation method Methods 0.000 description 7
- 230000008859 change Effects 0.000 description 5
- 230000004913 activation Effects 0.000 description 4
- 238000013529 biological neural network Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 238000003066 decision tree Methods 0.000 description 4
- 238000013136 deep learning model Methods 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000006073 displacement reaction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000007499 fusion processing Methods 0.000 description 4
- 230000017525 heat dissipation Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 238000000547 structure data Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 210000003128 head Anatomy 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000008447 perception Effects 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 239000011800 void material Substances 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000013479 data entry Methods 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012821 model calculation Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24133—Distances to prototypes
- G06F18/24137—Distances to cluster centroïds
- G06F18/2414—Smoothing the distance, e.g. radial basis function networks [RBFN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/251—Fusion techniques of input or preprocessed data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/40—Software arrangements specially adapted for pattern recognition, e.g. user interfaces or toolboxes therefor
- G06F18/41—Interactive pattern learning with a human teacher
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G06F3/012—Head tracking input arrangements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/03—Arrangements for converting the position or the displacement of a member into a coded form
- G06F3/033—Pointing devices displaced or positioned by the user, e.g. mice, trackballs, pens or joysticks; Accessories therefor
- G06F3/038—Control and interface arrangements therefor, e.g. drivers or device-embedded control circuitry
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
- G06F3/04883—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures for inputting data by handwriting, e.g. gesture or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/803—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of input or preprocessed data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/28—Recognition of hand or arm movements, e.g. recognition of deaf sign language
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/038—Indexing scheme relating to G06F3/038
- G06F2203/0381—Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Computing Systems (AREA)
- Medical Informatics (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
Description
Claims (16)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610049586.XA CN106997236B (zh) | 2016-01-25 | 2016-01-25 | 基于多模态输入进行交互的方法和设备 |
PCT/CN2017/078225 WO2017129149A1 (zh) | 2016-01-25 | 2017-03-25 | 基于多模态输入进行交互的方法和设备 |
US16/044,335 US10664060B2 (en) | 2016-01-25 | 2018-07-24 | Multimodal input-based interaction method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610049586.XA CN106997236B (zh) | 2016-01-25 | 2016-01-25 | 基于多模态输入进行交互的方法和设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106997236A CN106997236A (zh) | 2017-08-01 |
CN106997236B true CN106997236B (zh) | 2018-07-13 |
Family
ID=59397459
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610049586.XA Active CN106997236B (zh) | 2016-01-25 | 2016-01-25 | 基于多模态输入进行交互的方法和设备 |
Country Status (3)
Country | Link |
---|---|
US (1) | US10664060B2 (zh) |
CN (1) | CN106997236B (zh) |
WO (1) | WO2017129149A1 (zh) |
Families Citing this family (85)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
AU2014214676A1 (en) | 2013-02-07 | 2015-08-27 | Apple Inc. | Voice trigger for a digital assistant |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
CN106997236B (zh) * | 2016-01-25 | 2018-07-13 | 亮风台(上海)信息科技有限公司 | 基于多模态输入进行交互的方法和设备 |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
CN107479706B (zh) * | 2017-08-14 | 2020-06-16 | 中国电子科技集团公司第二十八研究所 | 一种基于HoloLens的战场态势信息构建与交互实现方法 |
CN109656655A (zh) * | 2017-08-15 | 2019-04-19 | 北京蓦然认知科技有限公司 | 一种用于执行交互指令的方法、设备及存储介质 |
CN109426860A (zh) * | 2017-08-23 | 2019-03-05 | 幻视互动(北京)科技有限公司 | 一种基于神经网络的mr混合现实信息处理方法及装置 |
CN109583462A (zh) * | 2017-09-28 | 2019-04-05 | 幻视互动(北京)科技有限公司 | 基于深度神经网络的数据流处理方法、装置及系统 |
US11437032B2 (en) | 2017-09-29 | 2022-09-06 | Shanghai Cambricon Information Technology Co., Ltd | Image processing apparatus and method |
CN107831890A (zh) * | 2017-10-11 | 2018-03-23 | 北京华捷艾米科技有限公司 | 基于ar的人机交互方法、装置及设备 |
CN109725699B (zh) * | 2017-10-20 | 2022-05-20 | 荣耀终端有限公司 | 识别码的识别方法、装置和设备 |
CN107784355A (zh) * | 2017-10-26 | 2018-03-09 | 北京光年无限科技有限公司 | 虚拟人多模态交互数据处理方法和系统 |
US10732937B2 (en) * | 2017-10-31 | 2020-08-04 | Fujitsu Limited | Programming by voice |
US10394958B2 (en) * | 2017-11-09 | 2019-08-27 | Conduent Business Services, Llc | Performing semantic analyses of user-generated text content using a lexicon |
US10867054B2 (en) | 2017-11-14 | 2020-12-15 | Thomas STACHURA | Information security/privacy via a decoupled security accessory to an always listening assistant device |
US10872607B2 (en) | 2017-11-14 | 2020-12-22 | Thomas STACHURA | Information choice and security via a decoupled router with an always listening assistant device |
US10999733B2 (en) | 2017-11-14 | 2021-05-04 | Thomas STACHURA | Information security/privacy via a decoupled security accessory to an always listening device |
US11100913B2 (en) | 2017-11-14 | 2021-08-24 | Thomas STACHURA | Information security/privacy via a decoupled security cap to an always listening assistant device |
US10867623B2 (en) | 2017-11-14 | 2020-12-15 | Thomas STACHURA | Secure and private processing of gestures via video input |
CN110018979A (zh) * | 2018-01-09 | 2019-07-16 | 幻视互动(北京)科技有限公司 | 一种基于重构算法集并加速处理混合现实数据流的mr智能眼镜及方法 |
CN108334199A (zh) * | 2018-02-12 | 2018-07-27 | 华南理工大学 | 基于增强现实的移动式多模态交互方法及装置 |
US11663002B2 (en) | 2018-02-13 | 2023-05-30 | Shanghai Cambricon Information Technology Co., Ltd | Computing device and method |
EP3651078B1 (en) | 2018-02-13 | 2021-10-27 | Shanghai Cambricon Information Technology Co., Ltd | Computation device and method |
US11630666B2 (en) | 2018-02-13 | 2023-04-18 | Shanghai Cambricon Information Technology Co., Ltd | Computing device and method |
CN110162162B (zh) | 2018-02-14 | 2023-08-18 | 上海寒武纪信息科技有限公司 | 处理器的控制装置、方法及设备 |
US10839214B2 (en) * | 2018-03-13 | 2020-11-17 | International Business Machines Corporation | Automated intent to action mapping in augmented reality environments |
CN108406848A (zh) * | 2018-03-14 | 2018-08-17 | 安徽果力智能科技有限公司 | 一种基于场景分析的智能机器人及其运动控制方法 |
US11307880B2 (en) | 2018-04-20 | 2022-04-19 | Meta Platforms, Inc. | Assisting users with personalized and contextual communication content |
US11676220B2 (en) | 2018-04-20 | 2023-06-13 | Meta Platforms, Inc. | Processing multimodal user input for assistant systems |
US11715042B1 (en) | 2018-04-20 | 2023-08-01 | Meta Platforms Technologies, Llc | Interpretability of deep reinforcement learning models in assistant systems |
US10782986B2 (en) | 2018-04-20 | 2020-09-22 | Facebook, Inc. | Assisting users with personalized and contextual communication content |
US11886473B2 (en) | 2018-04-20 | 2024-01-30 | Meta Platforms, Inc. | Intent identification for agent matching by assistant systems |
EP3624020A4 (en) | 2018-05-18 | 2021-05-05 | Shanghai Cambricon Information Technology Co., Ltd | CALCULATION PROCEDURES AND RELATED PRODUCTS |
CN108874126B (zh) * | 2018-05-30 | 2021-08-31 | 北京致臻智造科技有限公司 | 基于虚拟现实设备的交互方法及系统 |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
KR102470893B1 (ko) | 2018-06-27 | 2022-11-25 | 상하이 캠브리콘 인포메이션 테크놀로지 컴퍼니 리미티드 | 온 칩 코드의 브레이크 포인트에 의한 디버그 방법, 온 칩 프로세서 및 브레이크 포인트에 의한 칩 디버그 시스템 |
CN108921081B (zh) * | 2018-06-27 | 2020-10-09 | 百度在线网络技术(北京)有限公司 | 用户操作的检测方法和装置 |
KR102519467B1 (ko) | 2018-08-28 | 2023-04-06 | 캠브리콘 테크놀로지스 코퍼레이션 리미티드 | 데이터 전처리 방법, 장치, 컴퓨터 설비 및 저장 매체 |
US11703939B2 (en) | 2018-09-28 | 2023-07-18 | Shanghai Cambricon Information Technology Co., Ltd | Signal processing device and related products |
CN111385462A (zh) | 2018-12-28 | 2020-07-07 | 上海寒武纪信息科技有限公司 | 信号处理装置、信号处理方法及相关产品 |
CN109858524B (zh) * | 2019-01-04 | 2020-10-16 | 北京达佳互联信息技术有限公司 | 手势识别方法、装置、电子设备及存储介质 |
CN113728380A (zh) | 2019-02-07 | 2021-11-30 | 托马斯·斯塔胡拉 | 用于智能扬声器的隐私装置 |
US11176935B2 (en) * | 2019-02-15 | 2021-11-16 | Wipro Limited | System and method for controlling devices through voice interaction |
US11741951B2 (en) * | 2019-02-22 | 2023-08-29 | Lenovo (Singapore) Pte. Ltd. | Context enabled voice commands |
CN109814726B (zh) * | 2019-02-28 | 2022-07-01 | 亮风台(上海)信息科技有限公司 | 一种执行智能交互处理模块的方法与设备 |
US11847554B2 (en) | 2019-04-18 | 2023-12-19 | Cambricon Technologies Corporation Limited | Data processing method and related products |
CN111832737B (zh) | 2019-04-18 | 2024-01-09 | 中科寒武纪科技股份有限公司 | 一种数据处理方法及相关产品 |
CN110109541B (zh) * | 2019-04-25 | 2022-04-05 | 广州智伴人工智能科技有限公司 | 一种多模态交互的方法 |
US11676028B2 (en) | 2019-06-12 | 2023-06-13 | Shanghai Cambricon Information Technology Co., Ltd | Neural network quantization parameter determination method and related products |
CN112085192B (zh) | 2019-06-12 | 2024-03-29 | 上海寒武纪信息科技有限公司 | 一种神经网络的量化参数确定方法及相关产品 |
CN110288016B (zh) * | 2019-06-21 | 2021-09-28 | 济南大学 | 一种多模态意图融合方法及应用 |
CN110196642B (zh) * | 2019-06-21 | 2022-05-17 | 济南大学 | 一种基于意图理解模型的导航式虚拟显微镜 |
CN110597382B (zh) * | 2019-08-08 | 2023-03-17 | 中广核工程有限公司 | 一种核电站控制室多通道融合人机交互方法以及系统 |
US12001955B2 (en) | 2019-08-23 | 2024-06-04 | Anhui Cambricon Information Technology Co., Ltd. | Data processing method, device, computer equipment and storage medium |
JP7146953B2 (ja) | 2019-08-27 | 2022-10-04 | 安徽寒武紀信息科技有限公司 | データ処理方法、装置、コンピュータデバイス、及び記憶媒体 |
TWI731442B (zh) * | 2019-10-18 | 2021-06-21 | 宏碁股份有限公司 | 電子裝置及其利用觸控資料的物件資訊辨識方法 |
CN111177346B (zh) * | 2019-12-19 | 2022-10-14 | 爱驰汽车有限公司 | 人机交互方法、装置、电子设备、存储介质 |
CN111143539B (zh) * | 2019-12-31 | 2023-06-23 | 重庆和贯科技有限公司 | 基于知识图谱的教学领域问答方法 |
CN111274910B (zh) * | 2020-01-16 | 2024-01-30 | 腾讯科技(深圳)有限公司 | 场景互动方法、装置及电子设备 |
CN112306352A (zh) * | 2020-02-24 | 2021-02-02 | 北京字节跳动网络技术有限公司 | 用于处理信息的系统、方法和装置 |
CN113917687A (zh) * | 2020-07-08 | 2022-01-11 | 佐臻股份有限公司 | 智能眼镜轻量化装置 |
US11495226B2 (en) * | 2020-07-14 | 2022-11-08 | Disney Enterprises, Inc. | System and method for configurable control of voice command systems |
CN111857370B (zh) * | 2020-07-27 | 2022-03-15 | 吉林大学 | 一种多通道交互设备研发平台 |
CN111736709A (zh) * | 2020-08-25 | 2020-10-02 | 北京深光科技有限公司 | Ar眼镜控制方法、设备、存储介质及装置 |
CN111968470B (zh) * | 2020-09-02 | 2022-05-17 | 济南大学 | 一种面向虚实融合的闯关交互式实验方法和系统 |
CN112099630B (zh) * | 2020-09-11 | 2024-04-05 | 济南大学 | 一种多模态意图逆向主动融合的人机交互方法 |
CN112099633A (zh) * | 2020-09-16 | 2020-12-18 | 济南大学 | 一种多模态感知的智能实验方法及装置 |
CN112506125B (zh) * | 2020-11-19 | 2024-07-09 | 北京海云捷迅科技股份有限公司 | 一种多模态控制方法、装置和系统 |
CN112835447A (zh) * | 2021-01-22 | 2021-05-25 | 哈尔滨工业大学 | 穿戴式计算机多通道人机交互方法、装置、设备及系统 |
CN113518114B (zh) * | 2021-05-12 | 2024-07-12 | 江苏力行电力电子科技有限公司 | 一种基于多模态自组网的人工智能控制方法及系统 |
CN113656546A (zh) * | 2021-08-17 | 2021-11-16 | 百度在线网络技术(北京)有限公司 | 多模态搜索方法、装置、设备、存储介质以及程序产品 |
US20230076716A1 (en) * | 2021-09-03 | 2023-03-09 | Apple Inc. | Multi-device gesture control |
CN113806609B (zh) * | 2021-09-26 | 2022-07-12 | 郑州轻工业大学 | 一种基于mit和fsm的多模态情感分析方法 |
CN116522947A (zh) * | 2022-01-20 | 2023-08-01 | 北京邮电大学 | 基于智简网络的信息发送方法、装置、电子设备及介质 |
CN114881179B (zh) * | 2022-07-08 | 2022-09-06 | 济南大学 | 一种基于意图理解的智能实验方法 |
CN115329578A (zh) * | 2022-08-19 | 2022-11-11 | 南京邮电大学 | 基于多模态融合的三维建模系统及建模方法 |
JP2024048680A (ja) * | 2022-09-28 | 2024-04-09 | キヤノン株式会社 | 制御装置、制御方法、プログラム |
CN115756161B (zh) * | 2022-11-15 | 2023-09-26 | 华南理工大学 | 多模态交互结构力学分析方法、系统、计算机设备及介质 |
CN115797655B (zh) * | 2022-12-13 | 2023-11-07 | 南京恩博科技有限公司 | 一种人物交互检测模型、方法、系统及装置 |
CN116994069B (zh) * | 2023-09-22 | 2023-12-22 | 武汉纺织大学 | 一种基于多模态信息的图像解析方法及系统 |
CN118035689A (zh) * | 2024-04-11 | 2024-05-14 | 中国信息通信研究院 | 基于实时三维模型重建的智能设备线上操作系统 |
CN118535023B (zh) * | 2024-07-26 | 2024-10-01 | 杭州联创信息技术有限公司 | 一种基于多模态下的视觉交互系统 |
CN118585071B (zh) * | 2024-08-07 | 2024-10-22 | 杭州李未可科技有限公司 | 基于ar眼镜的多模态大模型的主动交互系统 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102298694A (zh) * | 2011-06-21 | 2011-12-28 | 广东爱科数字科技有限公司 | 一种应用于远程信息服务的人机交互识别系统 |
CN102824092A (zh) * | 2012-08-31 | 2012-12-19 | 华南理工大学 | 一种窗帘的智能手势和语音控制系统及其控制方法 |
CN103793060A (zh) * | 2014-02-14 | 2014-05-14 | 杨智 | 一种用户交互系统和方法 |
CN104238726A (zh) * | 2013-06-17 | 2014-12-24 | 腾讯科技(深圳)有限公司 | 智能眼镜控制方法、装置及一种智能眼镜 |
CN104965592A (zh) * | 2015-07-08 | 2015-10-07 | 苏州思必驰信息科技有限公司 | 基于语音和手势识别的多模态非触摸人机交互方法及系统 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5712658A (en) * | 1993-12-28 | 1998-01-27 | Hitachi, Ltd. | Information presentation apparatus and information display apparatus |
JP2000132305A (ja) * | 1998-10-23 | 2000-05-12 | Olympus Optical Co Ltd | 操作入力装置 |
US7148879B2 (en) * | 2000-07-06 | 2006-12-12 | At&T Corp. | Bioacoustic control system, method and apparatus |
US8570378B2 (en) * | 2002-07-27 | 2013-10-29 | Sony Computer Entertainment Inc. | Method and apparatus for tracking three-dimensional movements of an object using a depth sensing camera |
US8313380B2 (en) * | 2002-07-27 | 2012-11-20 | Sony Computer Entertainment America Llc | Scheme for translating movements of a hand-held controller into inputs for a system |
US20100194694A1 (en) * | 2009-01-30 | 2010-08-05 | Nokia Corporation | Method and Apparatus for Continuous Stroke Input |
JP5617246B2 (ja) * | 2010-01-12 | 2014-11-05 | ソニー株式会社 | 画像処理装置、物体選択方法及びプログラム |
TWI590133B (zh) * | 2010-12-31 | 2017-07-01 | 樂金顯示科技股份有限公司 | 驅動觸控感測器之設備及方法 |
CN103412640A (zh) * | 2013-05-16 | 2013-11-27 | 胡三清 | 牙齿控制的字符或命令输入的装置及方法 |
CN105518262B (zh) * | 2013-09-05 | 2018-02-13 | 斗山英维高株式会社 | 用于去除硫氧化物的排气后处理装置及方法 |
US9405415B2 (en) * | 2013-10-01 | 2016-08-02 | Synaptics Incorporated | Targeted transcapacitance sensing for a matrix sensor |
US9594433B2 (en) * | 2013-11-05 | 2017-03-14 | At&T Intellectual Property I, L.P. | Gesture-based controls via bone conduction |
US10338678B2 (en) * | 2014-01-07 | 2019-07-02 | Nod, Inc. | Methods and apparatus for recognition of start and/or stop portions of a gesture using an auxiliary sensor |
KR101749070B1 (ko) * | 2015-11-02 | 2017-06-20 | 현대자동차주식회사 | 사용자 인터페이스 평가 장치 및 그 평가 방법 |
CN106997236B (zh) * | 2016-01-25 | 2018-07-13 | 亮风台(上海)信息科技有限公司 | 基于多模态输入进行交互的方法和设备 |
CN106997235B (zh) * | 2016-01-25 | 2018-07-13 | 亮风台(上海)信息科技有限公司 | 用于实现增强现实交互和展示的方法、设备 |
-
2016
- 2016-01-25 CN CN201610049586.XA patent/CN106997236B/zh active Active
-
2017
- 2017-03-25 WO PCT/CN2017/078225 patent/WO2017129149A1/zh active Application Filing
-
2018
- 2018-07-24 US US16/044,335 patent/US10664060B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102298694A (zh) * | 2011-06-21 | 2011-12-28 | 广东爱科数字科技有限公司 | 一种应用于远程信息服务的人机交互识别系统 |
CN102824092A (zh) * | 2012-08-31 | 2012-12-19 | 华南理工大学 | 一种窗帘的智能手势和语音控制系统及其控制方法 |
CN104238726A (zh) * | 2013-06-17 | 2014-12-24 | 腾讯科技(深圳)有限公司 | 智能眼镜控制方法、装置及一种智能眼镜 |
CN103793060A (zh) * | 2014-02-14 | 2014-05-14 | 杨智 | 一种用户交互系统和方法 |
CN104965592A (zh) * | 2015-07-08 | 2015-10-07 | 苏州思必驰信息科技有限公司 | 基于语音和手势识别的多模态非触摸人机交互方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
US20180329512A1 (en) | 2018-11-15 |
US10664060B2 (en) | 2020-05-26 |
WO2017129149A1 (zh) | 2017-08-03 |
CN106997236A (zh) | 2017-08-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106997236B (zh) | 基于多模态输入进行交互的方法和设备 | |
US20210056764A1 (en) | Transmodal input fusion for a wearable system | |
US11797105B2 (en) | Multi-modal hand location and orientation for avatar movement | |
US9081419B2 (en) | Natural gesture based user interface methods and systems | |
JP2021163456A (ja) | クロスモーダル処理方法、装置、電子機器及びコンピュータ記憶媒体 | |
CN111651035B (zh) | 一种基于多模态交互的虚拟实验系统及方法 | |
CN108475113B (zh) | 用于检测用户的手部姿态的方法、系统和介质 | |
CN102222431A (zh) | 基于机器的手语翻译器 | |
CN111418198A (zh) | 提供文本相关图像的电子装置及其操作方法 | |
Krishnaswamy et al. | Communicating and acting: Understanding gesture in simulation semantics | |
WO2024078088A1 (zh) | 互动处理方法及装置 | |
CN109416570A (zh) | 使用有限状态机和姿态语言离散值的手部姿态api | |
Mohd et al. | Multi-modal data fusion in enhancing human-machine interaction for robotic applications: a survey | |
Shahzad et al. | Role of zoning in facial expression using deep learning | |
Farhadi et al. | Domain adaptation in reinforcement learning: a comprehensive and systematic study | |
Nguen et al. | Deep CNN-based recognition of JSL finger spelling | |
WO2020195017A1 (ja) | 経路認識方法、経路認識装置、経路認識プログラム、及び経路認識プログラム記録媒体 | |
WO2017116878A1 (en) | Multimodal interaction using a state machine and hand gestures discrete values | |
CN113176822A (zh) | 虚拟用户检测 | |
Carrino et al. | Gesture-based hybrid approach for HCI in ambient intelligent environmments | |
CN117369649B (zh) | 基于本体感觉的虚拟现实交互系统及方法 | |
Meng et al. | A smart glove of combining virtual and real environments for chemical experiment | |
JP2001229398A (ja) | パフォーマンス動画ジェスチャーの取得及び動画キャラクター上での再生方法及び装置 | |
Piumsomboon | Natural hand interaction for augmented reality. | |
Nowosielski | Swipe-like text entry by head movements and a single row keyboard |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Liao Chunyuan Inventor after: Tang Rongxing Inventor after: Huang Mei Inventor before: Liao Chunyuan Inventor before: Tang Rongxing Inventor before: Ling Haibin Inventor before: Huang Mei |
|
CB03 | Change of inventor or designer information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Methods and equipment for interaction based on multimodal input Effective date of registration: 20221008 Granted publication date: 20180713 Pledgee: Industrial Bank Co.,Ltd. Shanghai Xuhui sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000277 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
CP02 | Change in the address of a patent holder |
Address after: 7th Floor, No. 1, Lane 5005, Shenjiang Road, Pudong New Area Free Trade Pilot Zone, Shanghai, October 2012 Patentee after: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Address before: Room 1109, No. 570, Shengxia Road, Zhangjiang High-tech Park, Pudong New Area, Shanghai, March 2012 Patentee before: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. |
|
CP02 | Change in the address of a patent holder | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20230906 Granted publication date: 20180713 Pledgee: Industrial Bank Co.,Ltd. Shanghai Xuhui sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2022310000277 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Methods and devices for interacting based on multimodal inputs Effective date of registration: 20231107 Granted publication date: 20180713 Pledgee: Industrial Bank Co.,Ltd. Shanghai Caohejing sub branch Pledgor: HISCENE INFORMATION TECHNOLOGY Co.,Ltd. Registration number: Y2023310000719 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |