Nothing Special   »   [go: up one dir, main page]

CN102629246A - Server used for recognizing browser voice commands and browser voice command recognition system - Google Patents

Server used for recognizing browser voice commands and browser voice command recognition system Download PDF

Info

Publication number
CN102629246A
CN102629246A CN2012100297926A CN201210029792A CN102629246A CN 102629246 A CN102629246 A CN 102629246A CN 2012100297926 A CN2012100297926 A CN 2012100297926A CN 201210029792 A CN201210029792 A CN 201210029792A CN 102629246 A CN102629246 A CN 102629246A
Authority
CN
China
Prior art keywords
browser
text
voice command
recognition
command
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012100297926A
Other languages
Chinese (zh)
Other versions
CN102629246B (en
Inventor
喻俨
王瑜
杨永智
刘铁锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
All China (wuhan) Information Technology Co Ltd
Original Assignee
BEIJING MOBO TAP TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING MOBO TAP TECHNOLOGY Co Ltd filed Critical BEIJING MOBO TAP TECHNOLOGY Co Ltd
Priority to CN201210029792.6A priority Critical patent/CN102629246B/en
Publication of CN102629246A publication Critical patent/CN102629246A/en
Application granted granted Critical
Publication of CN102629246B publication Critical patent/CN102629246B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The invention provides a server used for recognizing browser voice commands and a browser voice command recognition system, which realize that a user can control web pages browsed by a user terminal through voice, and directly open the web pages to browse and obtain search results through voice. The server comprises a communication device used for receiving the browser voice commands sent by the user terminal, a voice recognition system used for phonetically recognizing the browser voice commands as texts, and a semantic recognition device used for semantically recognizing the phonetically recognized texts, so as to convert the browser commands. The invention further provides a browser voice command recognition method.

Description

The server and the recognition methods of browser voice command of identification browser voice command
Technical field
The present invention relates to the speech processes field; More specifically; Relate to a kind of server and method thereof that is used to discern the browser voice command; And relate to a kind of browser voice command recognition and method thereof, and can carry out Intelligent Recognition to the browser voice command of user's input, realized that the webpage that the user browses user terminal through voice carries out voice control.
Background technology
Recent years,, adopt voice to provide convenience for the user as interactive means along with the fast development and the widespread use of speech recognition technology.Speech recognition is to convert the vocabulary content in people's the voice into literal, that is, voice are to the conversion (Speech to text) of literal, thereby the user adopts utterance to accomplish the input of literal.On mobile phone, adopt speech recognition technology can make things convenient for exchanging of people and mobile phone, such as the automatic dialing of voice, only with saying callee's name, promptly automatic group of phone has been saved the time of user inquiring phone to the callee.Semantic identification is analysis and the judgement of the semanteme of literal being carried out intelligence, and semantic recognition technology often is structured in accurately on the speech recognition basis, such as the Siri voice assistant function that Iphone adopted of apple.Siri can let the cellphone subscriber realize the control to mobile phone through voice, through to natural language understanding and study and combine context that the question and answer mode service is provided.Voice and semantic recognition technology begin to be applied in gradually in the browser, just in the Chrome browser, have added the function of phonetic search, the excellent mobile phone browser voice version separately that also released one after another of looking of Tengxun and UC such as Google Google.But still there is not satisfied place in above-mentioned being applied in the man-machine interaction process.This mainly shows following 2 points:
1. existing browser phonetic accessing internet process can only be accomplished the mapping of a literal that identifies to network address simply; The operation of user when using the browser phonetic accessing internet only limits to browse the known website of certain user; Say " opening Sina " such as the user; Browser then through searching literal-website mapping table, is opened " www.sina.com ".
2. in addition, most webpages all do not provide the interface of interactive voice, and Google provides the function of phonetic search, but its range of application is confined to the search box input of Google.When the user wants button click, link on webpage, in the time of submission form etc., still need the auxiliary of mouse and keyboard equipment.
Summary of the invention
In order to realize the free mutual and intelligent sound web page browsing of user and user terminal, realized the present invention.The objective of the invention is to propose a kind of server and recognition methods of browser voice command that is used to discern the browser voice command; And a kind of browser voice command recognition and method thereof, wherein can carry out speech recognition and semantic identification to user's browser voice command.Realize that the webpage that the user browses user terminal through voice carries out voice control, just can directly open web page browsing, obtain Search Results through voice.Thereby user terminal is more intelligent, hommization, and " communication " between user and the user terminal becomes convenient, timely.And need not to use mouse, the utility appliance of keyboard and so on.
According to first aspect present invention, a kind of server that is used to discern the browser voice command is proposed, comprising: communicator is used to receive the browser voice command that user terminal sends; Speech recognition equipment, being used for the speech recognition of said browser voice command is text; With semantic recognition device, be used for the text of said speech recognition is carried out semanteme identification, to convert browser command into.
According to second aspect present invention, the recognition methods of a kind of browser voice command is proposed, comprising: communication steps receives the browser voice command that user terminal sends; Speech recognition steps is a text with the speech recognition of said browser voice command; With semantic identification step, the text of said speech recognition is carried out semanteme identification, to convert browser command into.
According to a third aspect of the invention we; A kind of browser voice command recognition is proposed; Comprise user terminal and the server that is connected through network with user terminal, wherein: said user terminal comprises: input media is used to receive the browser voice command that the user imports; Speech recognition equipment, being used for the speech recognition of said browser voice command is text; First communicator is used for the text of speech recognition is sent to said server; Said server comprises: the second communication device is used to receive the text of said speech recognition; With semantic recognition device, be used for the text of said speech recognition is carried out semanteme identification, to convert browser command into.
According to a forth aspect of the invention, propose the recognition methods of a kind of browser voice command, comprising: input step, user terminal receives the browser voice command of user's input; Speech recognition steps, user terminal is a text with the speech recognition of said browser voice command; First communication steps, user terminal sends to said server with the text of speech recognition; Second communication step, server receive the text of said speech recognition; With semantic identification step, server carries out semanteme identification to the text of said speech recognition, to convert browser command into.
Description of drawings
From the detailed description below in conjunction with accompanying drawing, above-mentioned feature and advantage of the present invention will be more obvious, wherein:
Fig. 1 a illustrates the synoptic diagram according to the browser voice command recognition of first embodiment of the invention;
Fig. 1 b illustrates the synoptic diagram according to the browser voice command recognition of second embodiment of the invention;
Fig. 2 illustrates the synoptic diagram of the semantic recognition device of browser voice command recognition;
Fig. 3 is the mark device example in proper order that its part-of-speech tagging unit of semantic recognition device adopts;
Fig. 4 illustrates the process flow diagram of carrying out the recognition methods of browser voice command according to the browser voice command recognition of first embodiment of the invention;
Fig. 5 shows the process flow diagram of method for recognizing semantics;
Fig. 6 illustrates the synoptic diagram according to the browser voice command recognition of third embodiment of the invention;
Fig. 7 is based on an example of the browser voice command of key word;
Fig. 8 a is the example of the current web page of browser voice command interactive operation;
Fig. 8 b is an example of interactive operation database matching table of the present invention;
Fig. 9 is an example of identification browser voice command.
Embodiment
Below, the preferred embodiments of the present invention will be described with reference to the drawings.In the accompanying drawings, components identical will be by identical reference symbol or numeral.In addition, in following description of the present invention, with the specific descriptions of omitting known function and configuration, to avoid making theme of the present invention unclear.
Fig. 1 a shows the browser voice command recognition according to first embodiment of the invention.The browser voice command recognition comprises user terminal 1 and the server 2 that is connected through the communication network (not shown) with user terminal.User terminal 1 comprises: be used to utilize model bank with the speech recognition of the user's input speech recognition equipment 10 for text (natural language text); First coalignment 12 that is used for the text and the stored mapping table subclass of speech recognition are mated judges it is first judgment means 14 that sends to server in the user terminal execution with the browser command of the text matches of speech recognition or with the text of this speech recognition with being used for according to matching result.In addition, user terminal 1 also comprises input-output unit, communicator, memory storage etc., starts from clearly purpose, and is also not shown at this.Said server 2 comprises: be used for second coalignment 22 that text and stored mapping table with the speech recognition that receives mate; Be used for judging whether the text of speech recognition is carried out semantic second judgment means of discerning 24 according to matching result; If mate fully; Then second judgment means 24 finds out corresponding order and is sent to user terminal according to mapping table, otherwise judges and carry out semantic identification; With the semantic recognition device 20 that is used for the text of speech recognition is carried out semantic identification.Server also comprises: communicator and store dictionary, corpus, concerns the memory storage (not shown) of databases such as storehouse, interactive operation database at network address storehouse, parameter library.
Wherein, said user terminal 1 includes but not limited to: wired and radio communication device, for example: mobile phone, PDA (personal digital assistant), computing machine etc.For those skilled in the art clearly, first coalignment 20 and first judgment means, 14, the second coalignments 22 and second judgment means 24 are option means.
Fig. 2 is the synoptic diagram of the semantic recognition device of browser voice command recognition.The specific instructions that semantic recognition device 20 becomes browser to understand text-converted.Semantic recognition device 20 comprises: data pretreatment unit 201, participle unit 202, part-of-speech tagging unit 203, analytic unit 204, extracting unit 205 and converting unit 206.
To combine Fig. 2-5 to specifically describe the browser voice command how the browser voice command recognition discerns user's input below.
At first with reference to figure 4, the identifying of browser voice command recognition comprises speech recognition and two stages of semantic identification.After receiving user's browser voice command, the speech recognition stage is accomplished the transfer process from speech-to-text, the specific instructions that semantic cognitive phase becomes browser to understand text-converted.In the semantic identifying with user terminal as local cache, if mate successfully then, otherwise by server execution semantic analysis process directly in the user terminal execution.Thereby the quickening response speed, the traffic consumes of having reduced the user.Be appreciated that the speech recognition stage can carry out at user terminal, also can carry out at server end.
Particularly, at step S401, user terminal 1 receives the browser voice command of user's input.At step S402, the browser voice command that speech recognition equipment 10 is imported the user carries out feature extraction and matees with model bank, to convert text into.The recognition technology that is adopted is a known technology, does not repeat them here.At step S403, first coalignment 12 is done coupling fully with the text of conversion and the mapping table subclass from the text to the browser command.The mapping table subclass is the sub-set from text to the browser command mapping table, and wherein browser command comprises interpolation, opens bookmark etc.At S404, first judgment means 14 will directly be carried out (S408) with the browser that the text corresponding command is transferred to client when mating successfully, when coupling is unsuccessful, text is sent to server 2 process.
At step S405, second coalignment 22 of server 2 matees the text and the mapping table of speech recognition earlier fully, and mapping table is the mapping from the text to the browser command, and browser command comprises; For example, refresh the page, advance; Retreat, query history is opened bookmark etc.At step S406, second judgment means 24 will send to the browser execution of client with the text corresponding command when mating successfully, when coupling is unsuccessful, send text to semantic recognition device 20.At step S407,20 pairs of texts of this semanteme recognition device are carried out semantic identification, mate according to semantic recognition result and database.Afterwards, server 2 order that coupling is obtained sends to client browser and carries out.
To combine Fig. 5 to specifically describe the process how semantic recognition device 20 carries out semantic identification below.Though illustrating, the present invention can it is understandable that to those skilled in the art the present invention not only carries out semanteme to Chinese and English and discerns, and can also carry out semanteme identification to other language to Chinese and the semantic identification of English execution.
At first, at step S501, the data pretreatment unit utilizes the text of 201 pairs of speech recognitions of unisonance dictionary to do data scrubbing and error correction.Its reason is following: 1. because interference such as language itself, speaker's accent, background noise, the accuracy rate of speech recognition can not reach 100%, have can error correction the space.2. the text of coming from speech conversion is owing to receive the influence of various factors, such as the tone of speaking, and some insignificant exclamation etc.; Can not on grammer, guarantee it is the statement of a standard criterion; Do not damage under the situation of available data guaranteeing thus, need be to the text pre-service, remove the influence of tone auxiliary word etc. and the incoherent word of identification text on the one hand; Correct some data in the text through the statistics fuzzy matching of homonym on the other hand, improve degree of confidence.
Then at step S502, the 202 pairs of pretreated texts in participle unit are done participle.It is speech rather than individual character that text is done the minimum linguistic unit that semantic identification relied on.The accuracy rate of participle depends on two aspects: algorithm and dictionary.The dictionary that the present invention adopts comprises the order that all browsers are supported, thereby improves the participle accuracy rate to browser command.
Different language adopts different participle techniques owing to constitute different needs.Such as, English is unit with the speech, separate by the space between speech and the speech, and Chinese is to be unit with the word, adjacent word couples together and constitutes a speech, does not have the decollator that shows.Therefore participle of the present invention unit 202 adopts is simple canonical participle and branch word algorithm MMSEG (A Word Identification System for Mandarin Chinese Text Based on Two Variants of the Maximum Matching Algorithm) algorithm based on dictionary; Thereby realize to English the participle of Chinese.
At step S503, the 203 pairs of word segmentation result in part-of-speech tagging unit are carried out part-of-speech tagging.Same speech (being the front and back literary composition of statement) under different contexts possibly have different parts of speech.The process of part-of-speech tagging is a process of a large amount of language materials being carried out statistics and training.The corpus that the present invention adopts is collected has the substantive test user to use the data of browser command, then the browser related command is done training on mark.Part-of-speech tagging unit 203 adopts a plurality of mark devices to carry out N unit (N-gram) chain type mark method.The order of mark device (tagger) is set as shown in Figure 3.The manual work of wherein special mark device is adjustable, is used to correct the wrong of part-of-speech tagging or specifies part of speech by force.Acquiescence mark device is stamped special mark with the speech that all successfully do not mark, and charges to server log and supplies ex-post analysis and processing.
At step S504, parsing (parsing) and packet transaction (chunking) done in 204 pairs of speech that marked of analytic unit.Step S501-S503 has accomplished the information processing of fine-grained speech, and analytic unit 204 carries out parsing and the grouping on the language construction level, just on the sentence structure aspect, text is done analysis and ambiguity elimination.It is that Earley Chart resolves that analytic unit 204 adopts algorithm.Browser voice command recognition of the present invention defines a series of rules that can dynamically adjust respectively and resolves to the browser command of different language, for example, and CFG (context free grammar).The net result that utilizes rule analysis to obtain is a sentence structure analytic tree.
At step S505,205 pairs of sentence structure analytic trees of extracting unit are done entity and are extracted.The entity that extracting unit 205 is extracted all is the object that the browser voice command need be paid close attention to.Entity extracts chain type to carry out, that is, text that will be corresponding with the browser voice command takes turns doing entity according to priority orders and extracts; Wherein, Extracting unit 205 utilizes the key word library pair text corresponding with the browser voice command to carry out keyword extraction, if there is not the key word of coupling, then extracts corresponding action (action) and parameter (arguments); In case extract successfully then return results; Extracting if all can't accomplish entity to the end, then is the parameter of search with the text, and whole text is carried out search command.For example, the corresponding text of user's voice input is " butterfly ", the key word that key word library does not match, and then user's browser is opened the page of Baidu and search " butterfly " automatically.So, depend on the setting of user browser to still being other search engine with Baidu, Google.Wherein this key word library has comprised user that social network sites (facebook, everybody etc.), e-commerce website (amazon, Taobao etc.) and server obtain through the back-end data analysis search word commonly used (such as " seeing a film ", " novel " etc.) of surfing the Net.Key word library also can be to have included the database of searching for maximum speech on the network.
At last, at step S506, converting unit 206 utilizes network address storehouse and parameter library that the entity after extracting is done conversion.The entity that extraction obtains still is more abstract, and for example, " Sina's homepage " has only through certain conversion and could entity be transformed into object (such as " http://www.sina.com.cn ") concrete, that can directly be discerned by browser.If extracting unit 205 extracts keyword, then converting unit 206 analyzes the context of which entity as said keyword through the search relationship storehouse, utilize at last keyword with and context search the network address storehouse, obtain the info web that the user need browse.Concern that wherein the storehouse comprises relations such as the contextual nested or progressive relation of key word and its.The network address of magnanimity has been included in the network address storehouse.
Fig. 1 b is the browser voice command recognition according to second embodiment of the invention.Than the browser voice command recognition of first embodiment, be different from Fig. 1 a and carry out speech recognition at user terminal, the browser voice command recognition of this example is carried out speech recognition at server end.With omit among Fig. 1 b with Fig. 1 a in to the description of identical ingredient, to avoid making the present invention unclear.The speech recognition equipment 10 of server 2 utilizes model bank that the speech recognition of user's input is sent to user terminal 1 for text and through communicator.First coalignment 12 of user terminal 1 matees the text and the stored mapping table subclass of speech recognition.When first judgment means 14 was judged the result and mated fully, user terminal 1 was carried out the browser command corresponding with the text of speech recognition.When first judgment means 14 was judged the result and do not matched, user terminal 1 sent to server with the text of this speech recognition.Second coalignment 22 matees the text and the stored mapping table of the speech recognition that receives.If mate fully, then second judgment means 24 finds out corresponding order and is sent to user terminal according to mapping table, otherwise judges and carry out semanteme identification.The text of 20 pairs of speech recognitions of semantic recognition device is carried out semantic identification.Its process is identical with foregoing description, refuses repeated description at this.Alternatively, the speech recognition equipment 10 of server 2 utilizes model bank that the speech recognition of user input is sent to second coalignment 22 after for text to carry out and handle.
Fig. 7 is based on an example of the browser voice command of key word.The user can say a series of key word during through browser speech search information; These key words have clearer and more definite nested or progressive relation usually in context; Server of the present invention can be done identification and coupling to the key word in the such context in this case.When the user says " Facebook John Doe graduated from Harvard "; The semantic recognition device 20 of server identifies these key words and contextual information: " Facebook " (key word); " John Doe " (context) and " Harvard " (context); And can determine the Facebook homepage that the user wants to browse John Doe according to key word and its context relation through the search relationship storehouse, and John Doe once was the student of Harvard university.On this basis, browser can directly be opened this people's homepage URL: Http:// www.facebook.com/pages/JohnDoeHarvard
Fig. 6 is the browser voice command recognition according to third embodiment of the invention.Browser voice command recognition than second embodiment; Difference is that the user terminal in the browser voice command recognition shown in Figure 6 also comprises context deriving means 16 and command execution interface 18, and server also comprises order injection device 26.With omit among Fig. 6 with Fig. 1 b in to the description of identical ingredient, to avoid making the present invention unclear.
The user often needs during through the browser access webpage and web page contents carries out interaction; Mostly common this interaction is to carry out with the triggering mode of click on PC; Browser voice command recognition according to third embodiment of the invention has realized the voice interface operation, and the user need not to click the mouse and can browse required webpage.Wherein, the context deriving means 16 of user terminal 1 obtains contextual information and sends to server 2 through communicator 19.Alternatively, contextual information can comprise the decibels that user's current browsing info web or user speak.
The semantic recognition device 20 of server 2 utilizes the contextual information that receives based on interactive operation database matching table text to be carried out semantic identification; Entity is extracted the order that obtains obtain the JavaScript content through searching interactive data storehouse matching list; Thereby order injection device 26 returns the content of voice command and be infused in the webpage that the active user browses from server end with the mode of Dynamic Java Script script and carries out; To reach the effect of triggering; The command execution interface 18 of user terminal automatically performs said script on current web page, thereby having realized that the user need not to click the mouse can open required web page browsing.With reference to figure 8a, the user says " I will buy " when browsing Taobao's commodity webpage.These voice can convert " purchase " instruction after through last browser voice command recognition semantic processes into; Should instruction pass through after the context coupling of database afterwards; Obtain the JavaScript content for script shown in Fig. 8 b; Server 2 returns said content for script and be infused in the webpage that the active user browses from server end; User terminal is then directly carried out on the commodity webpage through the script executing interface that browser provides, and opens and buys link, and effect is clicked " buying immediately " button with the user effect is the same.Fig. 8 b can comprise decibels that the user speaks as context, thereby server can return to user terminal according to the different JavaScript scripts of decibels coupling different in the context.
Be appreciated that block diagram shown in Figure 6 is exemplary browser voice command recognition.In the present invention, context deriving means 16, command execution interface 18 and order injection device 26 can be option means.
Fig. 9 is an example of identification browser voice command, starts voice command with the user in Taobao's mobile edition homepage and says that " uh, buy down jackets " is example, described the browser voice command recognition and carried out the process that voice command is discerned.
Through browser voice command recognition of the present invention and method; Realized the intelligent sound web page browsing, thereby the user only needs can control the webpage that needs are browsed through voice, and need not to use mouse; The utility appliance of keyboard and so on has strengthened the interactivity of user and user terminal.
Be noted that the present invention is not limited to top described embodiment, can also expand to other technical field, the present invention all can be considered in the field that relates to the voice signal processing, perhaps can technical scheme of the present invention be applied to other Related product or method.Though invention has been described in conjunction with the preferred embodiments.But such description should be appreciated that only for purposes of illustration those skilled in the art can carry out other modification, replacement and variation under the situation of spirit that does not break away from accompanying claims and scope.

Claims (20)

1. server that is used to discern the browser voice command comprises:
Communicator is used to receive the browser voice command that user terminal sends;
Speech recognition equipment, being used for the speech recognition of said browser voice command is text; With
Semantic recognition device is used for the text of said speech recognition is carried out semanteme identification, to convert browser command into.
2. the server that is used to discern the browser voice command as claimed in claim 1, wherein
Said server also comprises:
The order injection device is used for the browser command of conversion is sent to said user terminal and is infused in the webpage of user's current browsing.
3. according to claim 1 or claim 2 the server that is used to discern the browser voice command, wherein communicator also receives contextual information that user terminal sends to offer semantic recognition device.
4. the server that is used to discern the browser voice command as claimed in claim 3, wherein contextual information comprises the decibels that current web page that the user browses or user speak.
5. like the described server that is used to discern the browser voice command of one of claim 1 to 4, wherein said server also comprises:
Coalignment is used for the text and the mapping table of the said speech recognition that receives are mated; With
Judgment means is used for judging it is that the browser command corresponding with the text of said speech recognition sent to user terminal or the text of said speech recognition is carried out semantic identification according to matching result.
6. like the described server that is used to discern the browser voice command of one of claim 1 to 5, wherein said semantic recognition device comprises:
The data pretreatment unit is used for the text of said speech recognition is carried out data scrubbing and error correction;
The participle unit is used for pretreated text is carried out participle;
The part-of-speech tagging unit is used for the text of participle is carried out part-of-speech tagging;
Analytic unit is used for the speech of part-of-speech tagging is resolved and divide into groups;
Extracting unit is used for that the speech of analyzing is carried out entity and extracts; With
Converting unit is used for converting the entity that extracts into browser command based on database.
7. like the described server that is used to discern the browser voice command of one of claim 1 to 4, wherein
Said speech recognition equipment obtains key word from the text of identification, and based on the context that concern storehouse analysis of key speech and utilize keyword with and context search database being browser command with the text-converted of discerning.
8. browser voice command recognition methods comprises:
Communication steps receives the browser voice command that user terminal sends;
Speech recognition steps is a text with the speech recognition of said browser voice command; With
Semantic identification step carries out semanteme to the text of said speech recognition and discerns, to convert browser command into.
9. browser voice command as claimed in claim 8 recognition methods also comprises:
The order implantation step is used for the browser command of conversion is sent to said user terminal and is infused in the webpage of user's current browsing.
10. like claim 8 or 9 described browser voice command recognition methodss, wherein
Communication steps also comprises the step of the contextual information that receives the user terminal transmission; And
Semantic identification step also comprises the step of the text of said speech recognition being carried out semantic identification based on contextual information.
11. browser voice command as claimed in claim 10 recognition methods, wherein contextual information comprises the decibels that current web page that the user browses or user speak.
12., also comprise like the recognition methods of the described browser voice command of one of claim 8 to 11:
The coupling step is mated the text and the mapping table of the said speech recognition that receives; With
Determining step judges it is that the browser command corresponding with the text of said speech recognition sent to user terminal or the text of said speech recognition is carried out semantic identification according to matching result.
13. like the recognition methods of the described browser voice command of one of claim 8 to 12, wherein said semantic identification step comprises:
The data pre-treatment step is carried out data scrubbing and error correction to the text of said speech recognition;
The participle step is carried out participle to pretreated text;
The part-of-speech tagging step is carried out part-of-speech tagging to the text of participle;
Analytical procedure is resolved and divide into groups the speech of part-of-speech tagging;
Extraction step is carried out entity to the speech of analyzing and is extracted; With
Switch process converts the entity that extracts into browser command based on database.
14. like the recognition methods of the described browser voice command of one of claim 8 to 11, wherein semantic identification step comprises:
From the text of identification, obtain key word, and based on the context that concern storehouse analysis of key speech and utilize keyword with and context search database being the step of browser command with the text-converted of discerning.
15. a browser voice command recognition comprises user terminal and the server that is connected through network with user terminal, wherein:
Said user terminal comprises:
Input media is used to receive the browser voice command that the user imports;
Speech recognition equipment, being used for the speech recognition of said browser voice command is text;
First communicator is used for the text of speech recognition is sent to said server;
Said server comprises:
The second communication device is used to receive the text of said speech recognition; With
Semantic recognition device is used for the text of said speech recognition is carried out semanteme identification, to convert browser command into.
16. browser voice command recognition as claimed in claim 15, wherein said user terminal also comprises:
The context deriving means is used to obtain contextual information and sends to server.
17. like claim 15 or 16 described browser voice command recognition, wherein
Said server also comprises:
The order injection device is used for the browser command of conversion is sent to user terminal and is infused in the webpage of said user's current browsing; With
Said user terminal also comprises:
The command execution interface triggers on the webpage of said user's current browsing and carries out said browser command.
18. browser voice command recognition methods comprises:
Input step, user terminal receives the browser voice command of user's input;
Speech recognition steps, user terminal is a text with the speech recognition of said browser voice command;
First communication steps, user terminal sends to said server with the text of speech recognition;
Second communication step, server receive the text of said speech recognition; With
Semantic identification step, server carries out semanteme identification to the text of said speech recognition, to convert browser command into.
19. browser voice command as claimed in claim 18 recognition methods wherein also comprises:
The context obtaining step obtains contextual information and sends to server as context.
20., also comprise like claim 18 or 19 described browser voice command recognition methodss:
The order implantation step is sent to the browser command of conversion user terminal and is infused in the webpage of said user's current browsing; With
The command execution step triggers on the webpage of said user's current browsing and carries out said browser command.
CN201210029792.6A 2012-02-10 2012-02-10 Recognize the server and browser voice command identification method of browser voice command Expired - Fee Related CN102629246B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210029792.6A CN102629246B (en) 2012-02-10 2012-02-10 Recognize the server and browser voice command identification method of browser voice command

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210029792.6A CN102629246B (en) 2012-02-10 2012-02-10 Recognize the server and browser voice command identification method of browser voice command

Publications (2)

Publication Number Publication Date
CN102629246A true CN102629246A (en) 2012-08-08
CN102629246B CN102629246B (en) 2017-06-27

Family

ID=46587506

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210029792.6A Expired - Fee Related CN102629246B (en) 2012-02-10 2012-02-10 Recognize the server and browser voice command identification method of browser voice command

Country Status (1)

Country Link
CN (1) CN102629246B (en)

Cited By (158)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102833633A (en) * 2012-09-04 2012-12-19 深圳创维-Rgb电子有限公司 System and method for controlling television voice
CN102902719A (en) * 2012-08-31 2013-01-30 四川长虹电器股份有限公司 Voice-control webpage browsing method for embedded browser
CN103077234A (en) * 2013-01-07 2013-05-01 季雪峰 Voice website navigation system and method
WO2013189342A2 (en) * 2013-01-22 2013-12-27 中兴通讯股份有限公司 Information processing method and mobile terminal
CN103631800A (en) * 2012-08-23 2014-03-12 腾讯科技(深圳)有限公司 Information processing method and device
CN103680498A (en) * 2012-09-26 2014-03-26 华为技术有限公司 Speech recognition method and speech recognition equipment
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN103839549A (en) * 2012-11-22 2014-06-04 腾讯科技(深圳)有限公司 Voice instruction control method and system
CN104183237A (en) * 2014-09-04 2014-12-03 百度在线网络技术(北京)有限公司 Speech processing method and device for portable terminal
CN104240700A (en) * 2014-08-26 2014-12-24 智歌科技(北京)有限公司 Global voice interaction method and system for vehicle-mounted terminal device
CN104240707A (en) * 2012-11-26 2014-12-24 北京奇虎科技有限公司 Browser and voice identification processing method for same
CN104239296A (en) * 2013-06-06 2014-12-24 深圳市快播科技有限公司 Method and system for remotely controlling multi-screen adapter browser
CN104281609A (en) * 2013-07-08 2015-01-14 腾讯科技(深圳)有限公司 Voice input instruction matching rule configuration method and device
CN104462186A (en) * 2014-10-17 2015-03-25 百度在线网络技术(北京)有限公司 Method and device for voice search
CN104575499A (en) * 2013-10-09 2015-04-29 携程计算机技术(上海)有限公司 Voice control method of mobile terminal and mobile terminal
CN104766606A (en) * 2015-03-24 2015-07-08 上海修源网络科技有限公司 Method and device for translating natural languages into commands and navigation application of method and device
CN104966513A (en) * 2014-06-09 2015-10-07 腾讯科技(深圳)有限公司 Language command processing method and apparatus thereof
CN105096942A (en) * 2014-05-21 2015-11-25 清华大学 Semantic analysis method and semantic analysis device
TWI512655B (en) * 2014-04-18 2015-12-11
CN105161106A (en) * 2015-08-20 2015-12-16 深圳Tcl数字技术有限公司 Voice control method of intelligent terminal, voice control device and television system
CN105320726A (en) * 2014-05-30 2016-02-10 苹果公司 Reducing the need for manual start/end-pointing and trigger phrases
CN105320663A (en) * 2014-06-11 2016-02-10 安凯(广州)微电子技术有限公司 Control method and device and intelligent device
CN105404617A (en) * 2014-09-15 2016-03-16 华为技术有限公司 Remote desktop control method, controlled end and control system
CN105550308A (en) * 2015-12-14 2016-05-04 联想(北京)有限公司 Information processing method, retrieval method and electronic device
CN105549948A (en) * 2015-12-18 2016-05-04 合肥寰景信息技术有限公司 Voice operation method applied to network community
CN105551489A (en) * 2015-12-18 2016-05-04 合肥寰景信息技术有限公司 Voice control platform of network community
CN105869634A (en) * 2016-03-31 2016-08-17 重庆大学 Field-based method and system for feeding back text error correction after speech recognition
CN105940399A (en) * 2014-03-28 2016-09-14 松下知识产权经营株式会社 Speech retrieval device, speech retrieval method, and display device
CN105957530A (en) * 2016-04-28 2016-09-21 海信集团有限公司 Speech control method, device and terminal equipment
CN106164904A (en) * 2014-03-31 2016-11-23 微软技术许可有限责任公司 Client-side individual voice WEB navigates
WO2017000696A1 (en) * 2015-06-30 2017-01-05 中兴通讯股份有限公司 Interactive processing method and device
CN106683677A (en) * 2015-11-06 2017-05-17 阿里巴巴集团控股有限公司 Method and device for recognizing voice
CN106874185A (en) * 2016-12-27 2017-06-20 中车株洲电力机车研究所有限公司 A kind of automated testing method driven based on voiced keyword and system
CN106980614A (en) * 2016-01-15 2017-07-25 中国科学院声学研究所 A kind of Web page speech control implementation method extended based on JavaScript
WO2017156893A1 (en) * 2016-03-18 2017-09-21 深圳Tcl数字技术有限公司 Voice control method and smart television
CN107578776A (en) * 2017-09-25 2018-01-12 咪咕文化科技有限公司 Voice interaction awakening method and device and computer readable storage medium
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging
CN107844470A (en) * 2016-09-18 2018-03-27 腾讯科技(深圳)有限公司 A kind of voice data processing method and its equipment
CN108074565A (en) * 2016-11-11 2018-05-25 上海诺悦智能科技有限公司 Phonetic order redirects the method and system performed with detailed instructions
CN108304153A (en) * 2017-03-02 2018-07-20 腾讯科技(深圳)有限公司 Voice interactive method and device
WO2018197939A1 (en) * 2017-04-24 2018-11-01 International Business Machines Corporation Adding voice commands to invoke web services
CN108959255A (en) * 2018-06-28 2018-12-07 北京百度网讯科技有限公司 Entity labeled data collection construction method, device and equipment
CN108959511A (en) * 2018-06-27 2018-12-07 北京小度信息科技有限公司 Voice-based information search method, device, equipment and computer storage medium
CN109302473A (en) * 2018-09-28 2019-02-01 重庆赢者科技有限公司 A kind of voice SMS transmission system and method
CN109410932A (en) * 2018-10-17 2019-03-01 百度在线网络技术(北京)有限公司 Voice operating method and apparatus based on HTML5 webpage
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
US10390213B2 (en) 2014-09-30 2019-08-20 Apple Inc. Social reminders
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
WO2020005241A1 (en) * 2018-06-27 2020-01-02 Google Llc Rendering responses to a spoken utterance of a user utilizing a local text-response map
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
CN110737817A (en) * 2018-07-02 2020-01-31 中兴通讯股份有限公司 Information processing method and device of browser, intelligent device and storage medium
US10553215B2 (en) 2016-09-23 2020-02-04 Apple Inc. Intelligent automated assistant
US10580409B2 (en) 2016-06-11 2020-03-03 Apple Inc. Application integration with a digital assistant
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
CN111095892A (en) * 2017-09-15 2020-05-01 三星电子株式会社 Electronic device and control method thereof
US10657961B2 (en) 2013-06-08 2020-05-19 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US10657966B2 (en) 2014-05-30 2020-05-19 Apple Inc. Better resolution when referencing to concepts
US10681212B2 (en) 2015-06-05 2020-06-09 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US10692504B2 (en) 2010-02-25 2020-06-23 Apple Inc. User profiling for voice input processing
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10714117B2 (en) 2013-02-07 2020-07-14 Apple Inc. Voice trigger for a digital assistant
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10741181B2 (en) 2017-05-09 2020-08-11 Apple Inc. User interface for correcting recognition errors
US10741185B2 (en) 2010-01-18 2020-08-11 Apple Inc. Intelligent automated assistant
US10748546B2 (en) 2017-05-16 2020-08-18 Apple Inc. Digital assistant services based on device capabilities
US10769385B2 (en) 2013-06-09 2020-09-08 Apple Inc. System and method for inferring user intent from speech inputs
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US10909171B2 (en) 2017-05-16 2021-02-02 Apple Inc. Intelligent automated assistant for media exploration
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US10930282B2 (en) 2015-03-08 2021-02-23 Apple Inc. Competing devices responding to voice triggers
US10942703B2 (en) 2015-12-23 2021-03-09 Apple Inc. Proactive assistance based on dialog communication between devices
US10942702B2 (en) 2016-06-11 2021-03-09 Apple Inc. Intelligent device arbitration and control
CN112506405A (en) * 2020-12-03 2021-03-16 浪潮云信息技术股份公司 Artificial intelligent voice large screen command method based on Internet supervision field
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
CN112700779A (en) * 2020-12-29 2021-04-23 南方电网深圳数字电网研究院有限公司 Voice interaction method, system, browser and storage medium
US11010127B2 (en) 2015-06-29 2021-05-18 Apple Inc. Virtual assistant for media playback
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US11048473B2 (en) 2013-06-09 2021-06-29 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
CN113068410A (en) * 2019-10-15 2021-07-02 谷歌有限责任公司 Efficient and low latency automated assistant control for smart devices
US11069336B2 (en) 2012-03-02 2021-07-20 Apple Inc. Systems and methods for name pronunciation
US11070949B2 (en) 2015-05-27 2021-07-20 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display
US11069347B2 (en) 2016-06-08 2021-07-20 Apple Inc. Intelligent automated assistant for media exploration
US11120372B2 (en) 2011-06-03 2021-09-14 Apple Inc. Performing actions associated with task items that represent tasks to perform
US11126400B2 (en) 2015-09-08 2021-09-21 Apple Inc. Zero latency digital assistant
US11127397B2 (en) 2015-05-27 2021-09-21 Apple Inc. Device voice control
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
US11217251B2 (en) 2019-05-06 2022-01-04 Apple Inc. Spoken notifications
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US11231904B2 (en) 2015-03-06 2022-01-25 Apple Inc. Reducing response latency of intelligent automated assistants
US11237797B2 (en) 2019-05-31 2022-02-01 Apple Inc. User activity shortcut suggestions
US11269678B2 (en) 2012-05-15 2022-03-08 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11314370B2 (en) 2013-12-06 2022-04-26 Apple Inc. Method for extracting salient dialog usage from live data
CN114489557A (en) * 2021-12-15 2022-05-13 青岛海尔科技有限公司 Voice interaction method, device, equipment and storage medium
US11348582B2 (en) 2008-10-02 2022-05-31 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
US11350253B2 (en) 2011-06-03 2022-05-31 Apple Inc. Active transport based notifications
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11380310B2 (en) 2017-05-12 2022-07-05 Apple Inc. Low-latency intelligent automated assistant
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11388291B2 (en) 2013-03-14 2022-07-12 Apple Inc. System and method for processing voicemail
US11405466B2 (en) 2017-05-12 2022-08-02 Apple Inc. Synchronization and task delegation of a digital assistant
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11423886B2 (en) 2010-01-18 2022-08-23 Apple Inc. Task flow identification based on user intent
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11467802B2 (en) 2017-05-11 2022-10-11 Apple Inc. Maintaining privacy of personal information
US11468282B2 (en) 2015-05-15 2022-10-11 Apple Inc. Virtual assistant in a communication session
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
US11495218B2 (en) 2018-06-01 2022-11-08 Apple Inc. Virtual assistant operation in multi-device environments
US11500672B2 (en) 2015-09-08 2022-11-15 Apple Inc. Distributed personal assistant
US11516537B2 (en) 2014-06-30 2022-11-29 Apple Inc. Intelligent automated assistant for TV user interactions
US11526368B2 (en) 2015-11-06 2022-12-13 Apple Inc. Intelligent automated assistant in a messaging environment
US11532306B2 (en) 2017-05-16 2022-12-20 Apple Inc. Detecting a trigger of a digital assistant
US11580990B2 (en) 2017-05-12 2023-02-14 Apple Inc. User-specific acoustic models
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
US11656884B2 (en) 2017-01-09 2023-05-23 Apple Inc. Application integration with a digital assistant
US11657813B2 (en) 2019-05-31 2023-05-23 Apple Inc. Voice identification in digital assistant systems
US11671920B2 (en) 2007-04-03 2023-06-06 Apple Inc. Method and system for operating a multifunction portable electronic device using voice-activation
US11696060B2 (en) 2020-07-21 2023-07-04 Apple Inc. User identification using headphones
US11755276B2 (en) 2020-05-12 2023-09-12 Apple Inc. Reducing description length based on confidence
US11765209B2 (en) 2020-05-11 2023-09-19 Apple Inc. Digital assistant hardware abstraction
US11790914B2 (en) 2019-06-01 2023-10-17 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
US11798547B2 (en) 2013-03-15 2023-10-24 Apple Inc. Voice activated device for use with a voice-based digital assistant
US11809483B2 (en) 2015-09-08 2023-11-07 Apple Inc. Intelligent automated assistant for media search and playback
US11838734B2 (en) 2020-07-20 2023-12-05 Apple Inc. Multi-device audio adjustment coordination
US11853536B2 (en) 2015-09-08 2023-12-26 Apple Inc. Intelligent automated assistant in a media environment
US11914848B2 (en) 2020-05-11 2024-02-27 Apple Inc. Providing relevant data items based on context
US11928604B2 (en) 2005-09-08 2024-03-12 Apple Inc. Method and apparatus for building an intelligent automated assistant
US12010262B2 (en) 2013-08-06 2024-06-11 Apple Inc. Auto-activating smart responses based on activities from remote devices
US12014118B2 (en) 2017-05-15 2024-06-18 Apple Inc. Multi-modal interfaces having selection disambiguation and text modification capability
WO2024152185A1 (en) * 2023-01-17 2024-07-25 康佳集团股份有限公司 Internet-of-things device control system and method, and storage medium
US12051413B2 (en) 2015-09-30 2024-07-30 Apple Inc. Intelligent device identification

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1521953A (en) * 2003-01-29 2004-08-18 ̨����ӹ�ҵ�ɷ����޹�˾ Apparatus and method for computation process of natural language using hand held communication equipment
CN1526132A (en) * 2000-10-10 2004-09-01 ض� Language independent voice-based search system
CN1735929A (en) * 2003-01-09 2006-02-15 摩托罗拉公司 Voice browser dialog enabler for a communication system
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1526132A (en) * 2000-10-10 2004-09-01 ض� Language independent voice-based search system
CN1735929A (en) * 2003-01-09 2006-02-15 摩托罗拉公司 Voice browser dialog enabler for a communication system
CN1521953A (en) * 2003-01-29 2004-08-18 ̨����ӹ�ҵ�ɷ����޹�˾ Apparatus and method for computation process of natural language using hand held communication equipment
CN201402461Y (en) * 2008-12-30 2010-02-10 同济大学第一附属中学 Sound-control web-page browser and voice control module thereof

Cited By (250)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11928604B2 (en) 2005-09-08 2024-03-12 Apple Inc. Method and apparatus for building an intelligent automated assistant
US11671920B2 (en) 2007-04-03 2023-06-06 Apple Inc. Method and system for operating a multifunction portable electronic device using voice-activation
US11348582B2 (en) 2008-10-02 2022-05-31 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US11900936B2 (en) 2008-10-02 2024-02-13 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US12087308B2 (en) 2010-01-18 2024-09-10 Apple Inc. Intelligent automated assistant
US11423886B2 (en) 2010-01-18 2022-08-23 Apple Inc. Task flow identification based on user intent
US10741185B2 (en) 2010-01-18 2020-08-11 Apple Inc. Intelligent automated assistant
US10692504B2 (en) 2010-02-25 2020-06-23 Apple Inc. User profiling for voice input processing
US10417405B2 (en) 2011-03-21 2019-09-17 Apple Inc. Device access using voice authentication
US11120372B2 (en) 2011-06-03 2021-09-14 Apple Inc. Performing actions associated with task items that represent tasks to perform
US11350253B2 (en) 2011-06-03 2022-05-31 Apple Inc. Active transport based notifications
US11069336B2 (en) 2012-03-02 2021-07-20 Apple Inc. Systems and methods for name pronunciation
US11269678B2 (en) 2012-05-15 2022-03-08 Apple Inc. Systems and methods for integrating third party services with a digital assistant
US11321116B2 (en) 2012-05-15 2022-05-03 Apple Inc. Systems and methods for integrating third party services with a digital assistant
CN103631800A (en) * 2012-08-23 2014-03-12 腾讯科技(深圳)有限公司 Information processing method and device
CN102902719A (en) * 2012-08-31 2013-01-30 四川长虹电器股份有限公司 Voice-control webpage browsing method for embedded browser
CN102833633A (en) * 2012-09-04 2012-12-19 深圳创维-Rgb电子有限公司 System and method for controlling television voice
CN103680498A (en) * 2012-09-26 2014-03-26 华为技术有限公司 Speech recognition method and speech recognition equipment
US9368108B2 (en) 2012-09-26 2016-06-14 Huawei Technologies Co., Ltd. Speech recognition method and device
CN103839549A (en) * 2012-11-22 2014-06-04 腾讯科技(深圳)有限公司 Voice instruction control method and system
CN104240707A (en) * 2012-11-26 2014-12-24 北京奇虎科技有限公司 Browser and voice identification processing method for same
CN103077234A (en) * 2013-01-07 2013-05-01 季雪峰 Voice website navigation system and method
CN103945044A (en) * 2013-01-22 2014-07-23 中兴通讯股份有限公司 Information processing method and mobile terminal
WO2013189342A3 (en) * 2013-01-22 2014-02-13 中兴通讯股份有限公司 Information processing method and mobile terminal
WO2013189342A2 (en) * 2013-01-22 2013-12-27 中兴通讯股份有限公司 Information processing method and mobile terminal
US10978090B2 (en) 2013-02-07 2021-04-13 Apple Inc. Voice trigger for a digital assistant
US11557310B2 (en) 2013-02-07 2023-01-17 Apple Inc. Voice trigger for a digital assistant
US11862186B2 (en) 2013-02-07 2024-01-02 Apple Inc. Voice trigger for a digital assistant
US11636869B2 (en) 2013-02-07 2023-04-25 Apple Inc. Voice trigger for a digital assistant
US10714117B2 (en) 2013-02-07 2020-07-14 Apple Inc. Voice trigger for a digital assistant
US11388291B2 (en) 2013-03-14 2022-07-12 Apple Inc. System and method for processing voicemail
US11798547B2 (en) 2013-03-15 2023-10-24 Apple Inc. Voice activated device for use with a voice-based digital assistant
CN104239296B (en) * 2013-06-06 2018-04-06 深圳市快播科技有限公司 The remote control method and system of multi-screen adapter browser
CN104239296A (en) * 2013-06-06 2014-12-24 深圳市快播科技有限公司 Method and system for remotely controlling multi-screen adapter browser
US10657961B2 (en) 2013-06-08 2020-05-19 Apple Inc. Interpreting and acting upon commands that involve sharing information with remote devices
US11048473B2 (en) 2013-06-09 2021-06-29 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
US10769385B2 (en) 2013-06-09 2020-09-08 Apple Inc. System and method for inferring user intent from speech inputs
US11727219B2 (en) 2013-06-09 2023-08-15 Apple Inc. System and method for inferring user intent from speech inputs
US12073147B2 (en) 2013-06-09 2024-08-27 Apple Inc. Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant
CN104281609B (en) * 2013-07-08 2020-03-17 腾讯科技(深圳)有限公司 Configuration method and device for voice input instruction matching rule
CN104281609A (en) * 2013-07-08 2015-01-14 腾讯科技(深圳)有限公司 Voice input instruction matching rule configuration method and device
US12010262B2 (en) 2013-08-06 2024-06-11 Apple Inc. Auto-activating smart responses based on activities from remote devices
CN104575499A (en) * 2013-10-09 2015-04-29 携程计算机技术(上海)有限公司 Voice control method of mobile terminal and mobile terminal
CN104575499B (en) * 2013-10-09 2019-12-20 上海携程商务有限公司 Voice control method of mobile terminal and mobile terminal
US11314370B2 (en) 2013-12-06 2022-04-26 Apple Inc. Method for extracting salient dialog usage from live data
CN103714815A (en) * 2013-12-09 2014-04-09 何永 Voice control method and device thereof
CN105940399B (en) * 2014-03-28 2019-04-26 松下知识产权经营株式会社 Voice search device, speech search method and display device
CN105940399A (en) * 2014-03-28 2016-09-14 松下知识产权经营株式会社 Speech retrieval device, speech retrieval method, and display device
CN106164904A (en) * 2014-03-31 2016-11-23 微软技术许可有限责任公司 Client-side individual voice WEB navigates
TWI512655B (en) * 2014-04-18 2015-12-11
CN105096942A (en) * 2014-05-21 2015-11-25 清华大学 Semantic analysis method and semantic analysis device
US12118999B2 (en) 2014-05-30 2024-10-15 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
CN105320726A (en) * 2014-05-30 2016-02-10 苹果公司 Reducing the need for manual start/end-pointing and trigger phrases
US10699717B2 (en) 2014-05-30 2020-06-30 Apple Inc. Intelligent assistant for home automation
US10657966B2 (en) 2014-05-30 2020-05-19 Apple Inc. Better resolution when referencing to concepts
US11810562B2 (en) 2014-05-30 2023-11-07 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US10373617B2 (en) 2014-05-30 2019-08-06 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US11257504B2 (en) 2014-05-30 2022-02-22 Apple Inc. Intelligent assistant for home automation
CN105320726B (en) * 2014-05-30 2019-08-20 苹果公司 Reduce the demand to manual beginning/end point and triggering phrase
US10497365B2 (en) 2014-05-30 2019-12-03 Apple Inc. Multi-command single utterance input method
US11133008B2 (en) 2014-05-30 2021-09-28 Apple Inc. Reducing the need for manual start/end-pointing and trigger phrases
US10878809B2 (en) 2014-05-30 2020-12-29 Apple Inc. Multi-command single utterance input method
US10714095B2 (en) 2014-05-30 2020-07-14 Apple Inc. Intelligent assistant for home automation
US11670289B2 (en) 2014-05-30 2023-06-06 Apple Inc. Multi-command single utterance input method
US10417344B2 (en) 2014-05-30 2019-09-17 Apple Inc. Exemplar-based natural language processing
US11699448B2 (en) 2014-05-30 2023-07-11 Apple Inc. Intelligent assistant for home automation
CN104966513A (en) * 2014-06-09 2015-10-07 腾讯科技(深圳)有限公司 Language command processing method and apparatus thereof
CN104966513B (en) * 2014-06-09 2018-09-04 腾讯科技(深圳)有限公司 Verbal order treating method and apparatus
CN105320663A (en) * 2014-06-11 2016-02-10 安凯(广州)微电子技术有限公司 Control method and device and intelligent device
US11516537B2 (en) 2014-06-30 2022-11-29 Apple Inc. Intelligent automated assistant for TV user interactions
US11838579B2 (en) 2014-06-30 2023-12-05 Apple Inc. Intelligent automated assistant for TV user interactions
CN104240700A (en) * 2014-08-26 2014-12-24 智歌科技(北京)有限公司 Global voice interaction method and system for vehicle-mounted terminal device
CN104183237A (en) * 2014-09-04 2014-12-03 百度在线网络技术(北京)有限公司 Speech processing method and device for portable terminal
CN105404617B (en) * 2014-09-15 2018-12-14 华为技术有限公司 A kind of control method of remote desktop, controlled end and control system
CN105404617A (en) * 2014-09-15 2016-03-16 华为技术有限公司 Remote desktop control method, controlled end and control system
US10438595B2 (en) 2014-09-30 2019-10-08 Apple Inc. Speaker identification and unsupervised speaker adaptation techniques
US10453443B2 (en) 2014-09-30 2019-10-22 Apple Inc. Providing an indication of the suitability of speech recognition
US10390213B2 (en) 2014-09-30 2019-08-20 Apple Inc. Social reminders
CN104462186A (en) * 2014-10-17 2015-03-25 百度在线网络技术(北京)有限公司 Method and device for voice search
US11231904B2 (en) 2015-03-06 2022-01-25 Apple Inc. Reducing response latency of intelligent automated assistants
US10529332B2 (en) 2015-03-08 2020-01-07 Apple Inc. Virtual assistant activation
US10930282B2 (en) 2015-03-08 2021-02-23 Apple Inc. Competing devices responding to voice triggers
US11842734B2 (en) 2015-03-08 2023-12-12 Apple Inc. Virtual assistant activation
US11087759B2 (en) 2015-03-08 2021-08-10 Apple Inc. Virtual assistant activation
CN104766606B (en) * 2015-03-24 2018-03-16 上海修源网络科技有限公司 Method, apparatus and its navigation application by natural language translation into instruction
CN104766606A (en) * 2015-03-24 2015-07-08 上海修源网络科技有限公司 Method and device for translating natural languages into commands and navigation application of method and device
US11468282B2 (en) 2015-05-15 2022-10-11 Apple Inc. Virtual assistant in a communication session
US12001933B2 (en) 2015-05-15 2024-06-04 Apple Inc. Virtual assistant in a communication session
US11127397B2 (en) 2015-05-27 2021-09-21 Apple Inc. Device voice control
US11070949B2 (en) 2015-05-27 2021-07-20 Apple Inc. Systems and methods for proactively identifying and surfacing relevant content on an electronic device with a touch-sensitive display
US10681212B2 (en) 2015-06-05 2020-06-09 Apple Inc. Virtual assistant aided communication with 3rd party service in a communication session
US11947873B2 (en) 2015-06-29 2024-04-02 Apple Inc. Virtual assistant for media playback
US11010127B2 (en) 2015-06-29 2021-05-18 Apple Inc. Virtual assistant for media playback
WO2017000696A1 (en) * 2015-06-30 2017-01-05 中兴通讯股份有限公司 Interactive processing method and device
CN105161106A (en) * 2015-08-20 2015-12-16 深圳Tcl数字技术有限公司 Voice control method of intelligent terminal, voice control device and television system
US11954405B2 (en) 2015-09-08 2024-04-09 Apple Inc. Zero latency digital assistant
US11809483B2 (en) 2015-09-08 2023-11-07 Apple Inc. Intelligent automated assistant for media search and playback
US11853536B2 (en) 2015-09-08 2023-12-26 Apple Inc. Intelligent automated assistant in a media environment
US11126400B2 (en) 2015-09-08 2021-09-21 Apple Inc. Zero latency digital assistant
US11550542B2 (en) 2015-09-08 2023-01-10 Apple Inc. Zero latency digital assistant
US11500672B2 (en) 2015-09-08 2022-11-15 Apple Inc. Distributed personal assistant
US12051413B2 (en) 2015-09-30 2024-07-30 Apple Inc. Intelligent device identification
US11809886B2 (en) 2015-11-06 2023-11-07 Apple Inc. Intelligent automated assistant in a messaging environment
CN106683677A (en) * 2015-11-06 2017-05-17 阿里巴巴集团控股有限公司 Method and device for recognizing voice
US11664020B2 (en) 2015-11-06 2023-05-30 Alibaba Group Holding Limited Speech recognition method and apparatus
US11526368B2 (en) 2015-11-06 2022-12-13 Apple Inc. Intelligent automated assistant in a messaging environment
US10956666B2 (en) 2015-11-09 2021-03-23 Apple Inc. Unconventional virtual assistant interactions
US11886805B2 (en) 2015-11-09 2024-01-30 Apple Inc. Unconventional virtual assistant interactions
US10354652B2 (en) 2015-12-02 2019-07-16 Apple Inc. Applying neural network language models to weighted finite state transducers for automatic speech recognition
CN105550308A (en) * 2015-12-14 2016-05-04 联想(北京)有限公司 Information processing method, retrieval method and electronic device
CN105550308B (en) * 2015-12-14 2019-07-26 联想(北京)有限公司 A kind of information processing method, search method and electronic equipment
CN105549948A (en) * 2015-12-18 2016-05-04 合肥寰景信息技术有限公司 Voice operation method applied to network community
CN105551489A (en) * 2015-12-18 2016-05-04 合肥寰景信息技术有限公司 Voice control platform of network community
US10942703B2 (en) 2015-12-23 2021-03-09 Apple Inc. Proactive assistance based on dialog communication between devices
CN106980614A (en) * 2016-01-15 2017-07-25 中国科学院声学研究所 A kind of Web page speech control implementation method extended based on JavaScript
CN106980614B (en) * 2016-01-15 2019-09-24 中国科学院声学研究所 A kind of Web page speech control implementation method based on JavaScript extension
WO2017156893A1 (en) * 2016-03-18 2017-09-21 深圳Tcl数字技术有限公司 Voice control method and smart television
CN105869634A (en) * 2016-03-31 2016-08-17 重庆大学 Field-based method and system for feeding back text error correction after speech recognition
CN105869634B (en) * 2016-03-31 2019-11-19 重庆大学 It is a kind of based on field band feedback speech recognition after text error correction method and system
CN105957530A (en) * 2016-04-28 2016-09-21 海信集团有限公司 Speech control method, device and terminal equipment
CN105957530B (en) * 2016-04-28 2020-01-03 海信集团有限公司 Voice control method and device and terminal equipment
US10600413B2 (en) 2016-04-28 2020-03-24 Hisense Co., Ltd. Voice control method, device and terminal
US11227589B2 (en) 2016-06-06 2022-01-18 Apple Inc. Intelligent list reading
US11069347B2 (en) 2016-06-08 2021-07-20 Apple Inc. Intelligent automated assistant for media exploration
US11037565B2 (en) 2016-06-10 2021-06-15 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10733993B2 (en) 2016-06-10 2020-08-04 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US11657820B2 (en) 2016-06-10 2023-05-23 Apple Inc. Intelligent digital assistant in a multi-tasking environment
US10580409B2 (en) 2016-06-11 2020-03-03 Apple Inc. Application integration with a digital assistant
US11152002B2 (en) 2016-06-11 2021-10-19 Apple Inc. Application integration with a digital assistant
US10942702B2 (en) 2016-06-11 2021-03-09 Apple Inc. Intelligent device arbitration and control
US11809783B2 (en) 2016-06-11 2023-11-07 Apple Inc. Intelligent device arbitration and control
US11749275B2 (en) 2016-06-11 2023-09-05 Apple Inc. Application integration with a digital assistant
US10474753B2 (en) 2016-09-07 2019-11-12 Apple Inc. Language identification using recurrent neural networks
CN107844470B (en) * 2016-09-18 2021-04-30 腾讯科技(深圳)有限公司 Voice data processing method and equipment thereof
CN107844470A (en) * 2016-09-18 2018-03-27 腾讯科技(深圳)有限公司 A kind of voice data processing method and its equipment
US10553215B2 (en) 2016-09-23 2020-02-04 Apple Inc. Intelligent automated assistant
CN108074565A (en) * 2016-11-11 2018-05-25 上海诺悦智能科技有限公司 Phonetic order redirects the method and system performed with detailed instructions
CN106874185A (en) * 2016-12-27 2017-06-20 中车株洲电力机车研究所有限公司 A kind of automated testing method driven based on voiced keyword and system
US11656884B2 (en) 2017-01-09 2023-05-23 Apple Inc. Application integration with a digital assistant
CN108304153A (en) * 2017-03-02 2018-07-20 腾讯科技(深圳)有限公司 Voice interactive method and device
US10574517B2 (en) 2017-04-24 2020-02-25 International Business Machines Corporation Adding voice commands to invoke web services
WO2018197939A1 (en) * 2017-04-24 2018-11-01 International Business Machines Corporation Adding voice commands to invoke web services
US10741181B2 (en) 2017-05-09 2020-08-11 Apple Inc. User interface for correcting recognition errors
US10417266B2 (en) 2017-05-09 2019-09-17 Apple Inc. Context-aware ranking of intelligent response suggestions
US11467802B2 (en) 2017-05-11 2022-10-11 Apple Inc. Maintaining privacy of personal information
US10726832B2 (en) 2017-05-11 2020-07-28 Apple Inc. Maintaining privacy of personal information
US10847142B2 (en) 2017-05-11 2020-11-24 Apple Inc. Maintaining privacy of personal information
US10395654B2 (en) 2017-05-11 2019-08-27 Apple Inc. Text normalization based on a data-driven learning network
US11599331B2 (en) 2017-05-11 2023-03-07 Apple Inc. Maintaining privacy of personal information
US11538469B2 (en) 2017-05-12 2022-12-27 Apple Inc. Low-latency intelligent automated assistant
US11862151B2 (en) 2017-05-12 2024-01-02 Apple Inc. Low-latency intelligent automated assistant
US11580990B2 (en) 2017-05-12 2023-02-14 Apple Inc. User-specific acoustic models
US11301477B2 (en) 2017-05-12 2022-04-12 Apple Inc. Feedback analysis of a digital assistant
US11405466B2 (en) 2017-05-12 2022-08-02 Apple Inc. Synchronization and task delegation of a digital assistant
US11380310B2 (en) 2017-05-12 2022-07-05 Apple Inc. Low-latency intelligent automated assistant
US12014118B2 (en) 2017-05-15 2024-06-18 Apple Inc. Multi-modal interfaces having selection disambiguation and text modification capability
US11675829B2 (en) 2017-05-16 2023-06-13 Apple Inc. Intelligent automated assistant for media exploration
US10909171B2 (en) 2017-05-16 2021-02-02 Apple Inc. Intelligent automated assistant for media exploration
US11532306B2 (en) 2017-05-16 2022-12-20 Apple Inc. Detecting a trigger of a digital assistant
US10403278B2 (en) 2017-05-16 2019-09-03 Apple Inc. Methods and systems for phonetic matching in digital assistant services
US10748546B2 (en) 2017-05-16 2020-08-18 Apple Inc. Digital assistant services based on device capabilities
US10311144B2 (en) 2017-05-16 2019-06-04 Apple Inc. Emoji word sense disambiguation
CN107608957A (en) * 2017-09-06 2018-01-19 百度在线网络技术(北京)有限公司 Text modification method, apparatus and its equipment based on voice messaging
CN111095892A (en) * 2017-09-15 2020-05-01 三星电子株式会社 Electronic device and control method thereof
US11874904B2 (en) 2017-09-15 2024-01-16 Samsung Electronics Co., Ltd. Electronic device including mode for using an artificial intelligence assistant function of another electronic device
CN111095892B (en) * 2017-09-15 2022-06-24 三星电子株式会社 Electronic device and control method thereof
CN107578776B (en) * 2017-09-25 2021-08-06 咪咕文化科技有限公司 Voice interaction awakening method and device and computer readable storage medium
CN107578776A (en) * 2017-09-25 2018-01-12 咪咕文化科技有限公司 Voice interaction awakening method and device and computer readable storage medium
US10733375B2 (en) 2018-01-31 2020-08-04 Apple Inc. Knowledge-based framework for improving natural language understanding
US10789959B2 (en) 2018-03-02 2020-09-29 Apple Inc. Training speaker recognition models for digital assistants
US10592604B2 (en) 2018-03-12 2020-03-17 Apple Inc. Inverse text normalization for automatic speech recognition
US10818288B2 (en) 2018-03-26 2020-10-27 Apple Inc. Natural assistant interaction
US11710482B2 (en) 2018-03-26 2023-07-25 Apple Inc. Natural assistant interaction
US10909331B2 (en) 2018-03-30 2021-02-02 Apple Inc. Implicit identification of translation payload with neural machine translation
US11145294B2 (en) 2018-05-07 2021-10-12 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11900923B2 (en) 2018-05-07 2024-02-13 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US11907436B2 (en) 2018-05-07 2024-02-20 Apple Inc. Raise to speak
US11169616B2 (en) 2018-05-07 2021-11-09 Apple Inc. Raise to speak
US11487364B2 (en) 2018-05-07 2022-11-01 Apple Inc. Raise to speak
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
US11854539B2 (en) 2018-05-07 2023-12-26 Apple Inc. Intelligent automated assistant for delivering content from user experiences
US10984780B2 (en) 2018-05-21 2021-04-20 Apple Inc. Global semantic word embeddings using bi-directional recurrent neural networks
US10892996B2 (en) 2018-06-01 2021-01-12 Apple Inc. Variable latency device coordination
US10403283B1 (en) 2018-06-01 2019-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US11009970B2 (en) 2018-06-01 2021-05-18 Apple Inc. Attention aware virtual assistant dismissal
US12080287B2 (en) 2018-06-01 2024-09-03 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US10720160B2 (en) 2018-06-01 2020-07-21 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US11495218B2 (en) 2018-06-01 2022-11-08 Apple Inc. Virtual assistant operation in multi-device environments
US10984798B2 (en) 2018-06-01 2021-04-20 Apple Inc. Voice interaction at a primary device to access call functionality of a companion device
US12067985B2 (en) 2018-06-01 2024-08-20 Apple Inc. Virtual assistant operations in multi-device environments
US10684703B2 (en) 2018-06-01 2020-06-16 Apple Inc. Attention aware virtual assistant dismissal
US11630525B2 (en) 2018-06-01 2023-04-18 Apple Inc. Attention aware virtual assistant dismissal
US11386266B2 (en) 2018-06-01 2022-07-12 Apple Inc. Text correction
US11360577B2 (en) 2018-06-01 2022-06-14 Apple Inc. Attention aware virtual assistant dismissal
US11431642B2 (en) 2018-06-01 2022-08-30 Apple Inc. Variable latency device coordination
US10504518B1 (en) 2018-06-03 2019-12-10 Apple Inc. Accelerated task performance
US10944859B2 (en) 2018-06-03 2021-03-09 Apple Inc. Accelerated task performance
US10496705B1 (en) 2018-06-03 2019-12-03 Apple Inc. Accelerated task performance
WO2020005241A1 (en) * 2018-06-27 2020-01-02 Google Llc Rendering responses to a spoken utterance of a user utilizing a local text-response map
EP3800633A1 (en) * 2018-06-27 2021-04-07 Google LLC Rendering responses to a spoken utterance of a user utilizing a local text-response map
US10891958B2 (en) 2018-06-27 2021-01-12 Google Llc Rendering responses to a spoken utterance of a user utilizing a local text-response map
CN108959511A (en) * 2018-06-27 2018-12-07 北京小度信息科技有限公司 Voice-based information search method, device, equipment and computer storage medium
CN108959255A (en) * 2018-06-28 2018-12-07 北京百度网讯科技有限公司 Entity labeled data collection construction method, device and equipment
CN110737817A (en) * 2018-07-02 2020-01-31 中兴通讯股份有限公司 Information processing method and device of browser, intelligent device and storage medium
US11010561B2 (en) 2018-09-27 2021-05-18 Apple Inc. Sentiment prediction from textual data
US10839159B2 (en) 2018-09-28 2020-11-17 Apple Inc. Named entity normalization in a spoken dialog system
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
US11893992B2 (en) 2018-09-28 2024-02-06 Apple Inc. Multi-modal inputs for voice commands
US11170166B2 (en) 2018-09-28 2021-11-09 Apple Inc. Neural typographical error modeling via generative adversarial networks
CN109302473A (en) * 2018-09-28 2019-02-01 重庆赢者科技有限公司 A kind of voice SMS transmission system and method
CN109410932A (en) * 2018-10-17 2019-03-01 百度在线网络技术(北京)有限公司 Voice operating method and apparatus based on HTML5 webpage
CN109410932B (en) * 2018-10-17 2022-03-01 百度在线网络技术(北京)有限公司 Voice operation method and device based on HTML5 webpage
US11475898B2 (en) 2018-10-26 2022-10-18 Apple Inc. Low-latency multi-speaker speech recognition
US11638059B2 (en) 2019-01-04 2023-04-25 Apple Inc. Content playback on multiple devices
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
US11783815B2 (en) 2019-03-18 2023-10-10 Apple Inc. Multimodality in digital assistant systems
US11705130B2 (en) 2019-05-06 2023-07-18 Apple Inc. Spoken notifications
US11475884B2 (en) 2019-05-06 2022-10-18 Apple Inc. Reducing digital assistant latency when a language is incorrectly determined
US11423908B2 (en) 2019-05-06 2022-08-23 Apple Inc. Interpreting spoken requests
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
US11675491B2 (en) 2019-05-06 2023-06-13 Apple Inc. User configurable task triggers
US11217251B2 (en) 2019-05-06 2022-01-04 Apple Inc. Spoken notifications
US11140099B2 (en) 2019-05-21 2021-10-05 Apple Inc. Providing message response suggestions
US11888791B2 (en) 2019-05-21 2024-01-30 Apple Inc. Providing message response suggestions
US11657813B2 (en) 2019-05-31 2023-05-23 Apple Inc. Voice identification in digital assistant systems
US11360739B2 (en) 2019-05-31 2022-06-14 Apple Inc. User activity shortcut suggestions
US11496600B2 (en) 2019-05-31 2022-11-08 Apple Inc. Remote execution of machine-learned models
US11237797B2 (en) 2019-05-31 2022-02-01 Apple Inc. User activity shortcut suggestions
US11289073B2 (en) 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11360641B2 (en) 2019-06-01 2022-06-14 Apple Inc. Increasing the relevance of new available information
US11790914B2 (en) 2019-06-01 2023-10-17 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
US11488406B2 (en) 2019-09-25 2022-11-01 Apple Inc. Text detection using global geometry estimators
JP2022153537A (en) * 2019-10-15 2022-10-12 グーグル エルエルシー Efficient, low-latency, automated assistant control of smart device
JP7536834B2 (en) 2019-10-15 2024-08-20 グーグル エルエルシー Efficient, low-latency automated assistant control for smart devices
JP7114722B2 (en) 2019-10-15 2022-08-08 グーグル エルエルシー Efficient, low-latency automated assistant control of smart devices
JP2022500682A (en) * 2019-10-15 2022-01-04 グーグル エルエルシーGoogle LLC Efficient, low-latency automatic assistant control for smart devices
US11783814B2 (en) 2019-10-15 2023-10-10 Google Llc Efficient and low latency automated assistant control of smart devices
CN113068410A (en) * 2019-10-15 2021-07-02 谷歌有限责任公司 Efficient and low latency automated assistant control for smart devices
US11765209B2 (en) 2020-05-11 2023-09-19 Apple Inc. Digital assistant hardware abstraction
US11914848B2 (en) 2020-05-11 2024-02-27 Apple Inc. Providing relevant data items based on context
US11924254B2 (en) 2020-05-11 2024-03-05 Apple Inc. Digital assistant hardware abstraction
US11755276B2 (en) 2020-05-12 2023-09-12 Apple Inc. Reducing description length based on confidence
US11838734B2 (en) 2020-07-20 2023-12-05 Apple Inc. Multi-device audio adjustment coordination
US11750962B2 (en) 2020-07-21 2023-09-05 Apple Inc. User identification using headphones
US11696060B2 (en) 2020-07-21 2023-07-04 Apple Inc. User identification using headphones
CN112506405A (en) * 2020-12-03 2021-03-16 浪潮云信息技术股份公司 Artificial intelligent voice large screen command method based on Internet supervision field
CN112700779A (en) * 2020-12-29 2021-04-23 南方电网深圳数字电网研究院有限公司 Voice interaction method, system, browser and storage medium
CN114489557A (en) * 2021-12-15 2022-05-13 青岛海尔科技有限公司 Voice interaction method, device, equipment and storage medium
CN114489557B (en) * 2021-12-15 2024-03-22 青岛海尔科技有限公司 Voice interaction method, device, equipment and storage medium
WO2024152185A1 (en) * 2023-01-17 2024-07-25 康佳集团股份有限公司 Internet-of-things device control system and method, and storage medium

Also Published As

Publication number Publication date
CN102629246B (en) 2017-06-27

Similar Documents

Publication Publication Date Title
CN102629246A (en) Server used for recognizing browser voice commands and browser voice command recognition system
KR101359715B1 (en) Method and apparatus for providing mobile voice web
JP6440732B2 (en) Automatic task classification based on machine learning
US20130219277A1 (en) Gesture and Voice Controlled Browser
CN100524295C (en) An arrangement and a method relating to access to internet content
US9564122B2 (en) Language model adaptation based on filtered data
US7742922B2 (en) Speech interface for search engines
US20100005086A1 (en) Resource locator suggestions from input character sequence
CN103106287B (en) A kind of processing method and system of user search sentence
KR101709055B1 (en) Apparatus and Method for Question Analysis for Open web Question-Answering
CN104142831A (en) Application program searching method and device
US10303747B2 (en) Method, apparatus and system for controlling address input
RU2631975C2 (en) Method and system for user input command processing
CN102982025A (en) Identification method and device for searching requirement
US20090182759A1 (en) Extracting entities from a web page
JP5139499B2 (en) Extraction apparatus, extraction method, program, and information processing apparatus for distributing program
KR20110017760A (en) System for extracting multimedia contents descriptor and method therefor
KR100519748B1 (en) Method and apparatus for internet navigation through continuous voice command
CN106021516A (en) Search method and device
Lei et al. Leveraging Zipf’s law to analyze statistical distribution of Chinese corpus
US11551674B2 (en) Multi-pipeline language processing platform
TWI767197B (en) Method and server for providing interactive voice tutorial
US11403340B2 (en) Hierarchical multi-tier language processing platform
Brøndsted et al. Mobile information access with spoken query answering
Kwon et al. Novel Speech Web Architecture Based on Information Selection Agent

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: BAINA (WUHAN) INFORMATION TECHNOLOGY CO., LTD.

Free format text: FORMER OWNER: BEIJING BAINA INFORMATION TECHNOLOGY CO., LTD.

Effective date: 20130924

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100083 HAIDIAN, BEIJING TO: 430074 WUHAN, HUBEI PROVINCE

TA01 Transfer of patent application right

Effective date of registration: 20130924

Address after: 430074, No. 77 Optics Valley Avenue, Hubei, Optics Valley, Wuhan finance port, A2 building, 3 floor

Applicant after: All China (Wuhan) Information Technology Co., Ltd.

Address before: 100083, Beijing, Haidian District, a clear road No. 38 Gold Hotel, room 607-608, room 6

Applicant before: Beijing Mobo Tap Technology Co., Ltd.

GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170627

Termination date: 20190210

CF01 Termination of patent right due to non-payment of annual fee