Nothing Special   »   [go: up one dir, main page]

CN107316639A - A kind of data inputting method and device based on speech recognition, electronic equipment - Google Patents

A kind of data inputting method and device based on speech recognition, electronic equipment Download PDF

Info

Publication number
CN107316639A
CN107316639A CN201710358744.4A CN201710358744A CN107316639A CN 107316639 A CN107316639 A CN 107316639A CN 201710358744 A CN201710358744 A CN 201710358744A CN 107316639 A CN107316639 A CN 107316639A
Authority
CN
China
Prior art keywords
recognition result
voice
word
words
words recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710358744.4A
Other languages
Chinese (zh)
Inventor
梁伟文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xinmeihutong Technology Co
Original Assignee
Beijing Xinmeihutong Technology Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xinmeihutong Technology Co filed Critical Beijing Xinmeihutong Technology Co
Priority to CN201710358744.4A priority Critical patent/CN107316639A/en
Publication of CN107316639A publication Critical patent/CN107316639A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention provides a kind of data inputting method based on speech recognition, the problem of phonetic entry efficiency is low in the prior art is solved.Methods described includes:Input voice is identified, at least two voice identification results are obtained;Every institute's speech recognition result is the words recognition result sequence that each the corresponding words recognition result of word voice is constituted in the input voice;Determine whole words recognition results in corresponding at least two voice identification results of each word voice;According to the order in the words recognition result sequence, the corresponding at least part words recognition result of each word is shown;According to selection operation of the user to the words recognition result of display, information to be entered is generated.Method disclosed by the invention, information to be entered is constituted by the way that the corresponding multiple words recognition results of each word voice are shown, and according to the selection of user, is accurately generated information to be entered and is completed input, further increases the efficiency of phonetic entry.

Description

A kind of data inputting method and device based on speech recognition, electronic equipment
Technical field
Field, more particularly to a kind of data inputting method and dress based on speech recognition are inputted the present invention relates to information Put, electronic equipment.
Background technology
With the development of speech recognition technology, increasing electronic equipment or defeated using row information is entered by speech recognition Enter.Pronunciation inputting method of the prior art is usually:First, the speech data of user's input is obtained;Then, to the language of acquisition Sound data are identified;Finally, voice identification result is directly inputted.Or, when voice identification result has it is multiple when, with list shape Formula shows multiple recognition results, then, the recognition result selected according to the selection input user of user.But, when user's input Voice is nonstandard or during input homonym, the recognition result of some or some words may be inaccurate or there are multiple candidates Recognition result, therefore, the voice identification result or the recognition result of list display directly inputted may not be that user really will The information of input, in this case, user can only delete the voice identification result of identification mistake, then re-start voice defeated Enter.In the prior art, also a kind of way is to show a plurality of tabular form of voice identification result one, selects defeated for user Enter.This method equally exists the problem of a plurality of voice identification result is not user's information really to be inputted.
It can be seen that, the data inputting method of the prior art based on speech recognition is at least present:Due to single word voice Recognition result is inaccurate or there are multiple candidates, causes input or shows the recognition result of user inaccurate, it is necessary to user's weight New input, the problem of input efficiency is low caused by.
The content of the invention
Technical problems to be solved in this application are:Data inputting method of the prior art based on speech recognition is present Input efficiency it is low the problem of.
In order to solve the above problems, the embodiments of the invention provide a kind of data inputting method based on speech recognition, bag Include:
Input voice is identified, at least two voice identification results are obtained;Wherein, every institute's speech recognition result For the words recognition result sequence that the corresponding words recognition result of each word voice is constituted in the input voice;
Determine whole words recognition knots in corresponding at least two voice identification results of each word voice Really;
According to the input sequence of the word voice, the corresponding at least part words recognition of each word voice is shown As a result;
According to selection operation of the user at least part words recognition result of display, information to be entered is generated.
On the other hand, the embodiments of the invention provide a kind of message input device based on speech recognition, including:
Sound identification module, for input voice to be identified, obtains at least two voice identification results;Wherein, often Bar institute speech recognition result is the word that each the corresponding words recognition result of word voice is constituted in the input voice Recognition result sequence;
Words recognition result determining module, for determining that corresponding at least two voices of each word voice are known Whole words recognition results in other result;
Words recognition result display module, for the input sequence according to the word voice, shows the words recognition As a result the corresponding at least part words recognition result of each word voice that determining module is determined;
Input information generating module, for described in being shown according to user to the words recognition result display module at least The selection operation of partial words recognition result, generates information to be entered.
The third aspect, the embodiments of the invention provide a kind of electronic equipment, including memory, processor and is stored in described On memory and the computer program that can run on a processor, this hair is realized during computer program described in the computing device The data inputting method based on speech recognition described in bright embodiment.
The third aspect, the embodiments of the invention provide a kind of computer-readable recording medium, is stored thereon with computer journey Sequence, the program realizes the step of the data inputting method based on speech recognition described in the embodiment of the present invention when being executed by processor Suddenly.
Data inputting method based on speech recognition disclosed in embodiments of the invention, by knowing to input voice Not, at least two voice identification results are obtained;Wherein, every institute's speech recognition result is each word in the input voice The words recognition result sequence of the corresponding words recognition result composition of language voice;Determine that each word voice is corresponding described Whole words recognition results at least two voice identification results;And according to the order in the words recognition result sequence, Show the corresponding at least part words recognition result of each word;Finally, according to words recognition result of the user to display Selection operation, generate information to be entered, solve that data inputting method based on speech recognition in the prior art is present is defeated The problem of entering inefficiency.By the way that the corresponding multiple words recognition results of each word voice are shown, and according to user Selection composition information to be entered, accurately generate information to be entered and complete input, further increase the efficiency of phonetic entry.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be in embodiment or description of the prior art The required accompanying drawing used is briefly described, it should be apparent that, drawings in the following description are only some realities of the present invention Example is applied, for those of ordinary skill in the art, without having to pay creative labor, can also be attached according to these Figure obtains other accompanying drawings.
Fig. 1 is the data inputting method flow chart based on speech recognition of the embodiment of the present invention one;
Fig. 2 is the data inputting method flow chart based on speech recognition of the embodiment of the present invention two;
Fig. 3 is one of words recognition result display schematic diagram in embodiment two;
Fig. 4 is one of words recognition result display schematic diagram in embodiment two;
Fig. 5 is the data inputting method flow chart based on speech recognition of the embodiment of the present invention three;
Fig. 6 is one of words recognition result display schematic diagram in embodiment three;
Fig. 7 is two of words recognition result display schematic diagram in embodiment three;
Fig. 8 is one of the message input device structural representation of the embodiment of the present invention four based on speech recognition;
Fig. 9 is the two of message input device structural representation of the embodiment of the present invention four based on speech recognition;
Figure 10 is the three of message input device structural representation of the embodiment of the present invention four based on speech recognition.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation is described, it is clear that described embodiment is a part of embodiment of the invention, rather than whole embodiments.Based on this hair Embodiment in bright, the every other implementation that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of protection of the invention.
Embodiment one
A kind of data inputting method based on speech recognition disclosed by the invention, as shown in figure 1, methods described includes:Step Rapid 100 to step 130.
Step 100, input voice is identified, obtains at least two voice identification results.
Wherein, every institute's speech recognition result is the corresponding words recognition of each word voice in the input voice As a result the words recognition result sequence constituted.
When user is by application, the phonetic entry control of such as interface of input method is inputted after voice, carried by calling system The interface of confession, can obtain the input voice of user.Then, using will further call input of the speech recognition engine to acquisition Voice is identified, to obtain voice identification result.
Generally, speech recognition engine carries out segment processing according to the acoustic characteristic of speech data to the voice of input first, Then, each section of voice is identified respectively.User, would generally be by of short duration pronunciation between different terms when inputting voice Interval, therefore, after segment processing, every section of voice will correspond to the voice of a word.Speech recognition engine is to every section of language After sound is identified, it will obtain the words recognition result of each word voice.Because different user stings the difference of word and accent, When it is implemented, same word voice may recognize arrive multiple words recognition results.Or, input unisonance for user The situation of word or homonym, speech recognition engine, which may be recognized, obtains multiple words recognition results.Therefore, generally, each word Language voice will obtain at least one words recognition result.
Then, speech recognition engine by language model to the corresponding words recognition knot of each word voice in input voice Fruit is modified, and is further combined into sequence of terms.When some word voice has multiple words recognition results, speech recognition Engine will obtain inputting at least two voice identification results of voice.
Step 110, whole words in corresponding at least two voice identification results of each word voice are determined Language recognition result.
Speech recognition engine, with reference to language model, obtains at least two voices according to the words recognition result of every section of voice After recognition result, at least two voice identification results exported according to speech recognition engine can determine that every section of voice is right respectively The words recognition result answered, i.e., each word voice distinguishes corresponding words recognition result.The language exported with speech recognition engine Sound recognition result is:" Can ' t you help me " and " exemplified by Can you help me ", it may be determined that input voice in:The The corresponding words recognition result of one word voice is:" the corresponding words recognition of Can ' t " and " Can ", second word voice As a result it is:" you ", the corresponding words recognition result of the 3rd word voice are:" help ", the 4th corresponding word of word voice Language recognition result is:“me”.When it is implemented, the data format of the voice identification result exported according to speech recognition engine can be with It is determined that each word voice distinguishes corresponding words recognition result.
Step 120, according to the input sequence of the word voice, the corresponding at least part of each word voice is shown Words recognition result.
It may be one to input the corresponding words recognition result of each word voice in voice, it is also possible to be multiple, in order to It is easy to by selecting the corresponding words recognition result of each word voice to constitute the corresponding input information of the input voice, this Invention is shown the corresponding words recognition result of each word voice using corresponding mode.For example:According to words recognition As a result sequencing of the corresponding word voice in input voice, the word of each word voice is shown in corresponding list of locations Language recognition result.Such as the corresponding words recognition result of some word voice is more than 1, then the word is shown in the form of list The corresponding whole words recognition results of voice or the partial words recognition result for recognizing highest scoring.Or, such as some word language The corresponding words recognition result of sound is more than 1, then the corresponding whole words recognitions of the word voice are shown in the form of driver plate As a result or identification highest scoring partial words recognition result.Or, such as corresponding words recognition result of some word voice is big In 1, then use and highlight the words recognition result that form shows identification highest scoring in the word voice, to indicate User can select other words recognition results of the word voice by triggering the words recognition result.
Step 130, according to selection operation of the user to the words recognition result of display, information to be entered is generated.
User can select the words recognition result of display by operations such as click, slips, then the word knowledge by selecting Other result composition phrase or sentence, are used as information to be entered.
Data inputting method based on speech recognition disclosed in the embodiment of the present invention, by the way that input voice is identified, Obtain at least two voice identification results;Wherein, every institute's speech recognition result is each word in the input voice The words recognition result sequence of the corresponding words recognition result composition of voice;Determine each word voice it is corresponding it is described extremely Whole words recognition results in few two voice identification results;And according to the order in the words recognition result sequence, show Show the corresponding at least part words recognition result of each word;Finally, according to user to the words recognition result of display Selection operation, generates information to be entered, solves the input that the data inputting method based on speech recognition is present in the prior art The problem of inefficiency.By the way that the corresponding multiple words recognition results of each word voice are shown, and according to user's Selection composition information to be entered, accurately generates information to be entered and completes input, further increase the efficiency of phonetic entry.
Embodiment two
A kind of data inputting method based on speech recognition disclosed in the embodiment of the present invention, as shown in Fig. 2 methods described bag Include:Step 200 is to step 230.
Step 200, input voice is identified, obtains at least two voice identification results.
Wherein, every institute's speech recognition result is the corresponding words recognition of each word voice in the input voice As a result the words recognition result sequence constituted.
Input voice is identified, the embodiment of at least two voice identification results is obtained referring to embodiment One, here is omitted.
Step 210, whole words in corresponding at least two voice identification results of each word voice are determined Language recognition result.
Determine whole words recognition knots in corresponding at least two voice identification results of each word voice The embodiment of fruit is referring to embodiment one, and here is omitted.
In the present embodiment, the voice identification result exported using speech recognition engine as:" Can ' tyouhitit " and " Can Exemplified by you heat it ", it may be determined that in input voice:The corresponding words recognition result of first word voice is:“Can’ The corresponding words recognition result of t " and " Can ", second word voice is:The corresponding word of " you ", the 3rd word voice is known Other result is:" hit " and " heat ", the corresponding words recognition result of the 4th word voice are:“it”.
Step 220, according to the input sequence of the word voice, the same word voice of correspondence is shown side by side in relevant position Multiple words recognition results.
Know when in the voice identification result that speech recognition engine is exported, being present in the corresponding multiple words of same word voice During other result, the step of showing each word voice corresponding at least part words recognition result further comprises:In phase Position is answered to show multiple words recognition results of the same word voice of correspondence side by side.
Still by taking the words recognition result determined in previous step as an example, first word voice and the 3rd word voice point Dui Ying not multiple words recognition results " Can ' t " and " Can ", therefore, in first position display arranged side by side, first word voice Corresponding two words recognition results, the corresponding two words recognition knots of the 3rd word voice are shown the 3rd position side by side Really " hit " and " heat ".Second word voice and the 4th word voice are corresponded with words recognition result respectively, in phase Position is answered only to show a words recognition result.Concrete term recognition result display schematic diagram is as shown in Figure 3.
When it is implemented, the input sequence in order to intuitively embody word voice, and constitute defeated by words recognition result Enter the possibility of information, can be connected words recognition result by arrow, formed word chain, as shown in figure 4, i.e. with The form of word chain shows the corresponding words recognition result of each word voice;Wherein, by the same word voice of correspondence Multiple words recognition results as different terms chain node arranged side by side;Will be with the one-to-one words recognition knot of word voice Really as the common node of the different terms chain.The word is known it can be seen from the words recognition result shown in Fig. 4 The information for the input that other result can be constituted has four kinds, is respectively:“Can’t you hit it”、“Can you heat it”、 " Can ' t you heat it " and " Can you hit it ".By showing the same word voice of correspondence side by side in relevant position Multiple words recognition results, add user input information washability.
Step 230, according to user to the selection operation of the multiple words recognition result shown side by side, generation includes institute State the information to be entered for the words recognition result that selection operation is directed to.
When it is implemented, being corresponded to when in the voice identification result that speech recognition engine is exported, being present in same word voice Multiple words recognition results when, according to selection operation of the user at least part words recognition result of display, generation Information to be entered, further comprises:According to user to the selection operation of the multiple words recognition result shown side by side, generation The information to be entered for the words recognition result being directed to including the selection operation.
When it is implemented, the selection operation includes:The words recognition result shown side by side is clicked on, performs and slides hand The words recognition result of the gesture Jing Guo showing side by side and with the one-to-one words recognition result of word voice.With Fig. 3 and Fig. 4 Exemplified by the words recognition result of middle display, user can by clicking on successively " Can ' t " and " hit ", select words recognition result, Then, according to user to the selection operation of the multiple words recognition result shown side by side, generation includes the selection operation For words recognition result " Can ' t " and " hit " information to be entered, such as:“Can’t you hit it”.User can be with Words recognition result is selected by performing slip gesture.For example, the word chain that user is shown by finger along along Fig. 4 is slided Dynamic, finger sequentially passes through node where first to fourth node on word chain, i.e. " Can ", " you ", " heat ", " it ", The words recognition result of the node passed through by the slide, constitutes information to be entered:“Can you heat it”.
User can select the words recognition result of display by operations such as click, slips, then the word knowledge by selecting Other result composition phrase or sentence, are used as information to be entered.
Data inputting method based on speech recognition disclosed in the embodiment of the present invention, by the way that input voice is identified, Obtain at least two voice identification results;Wherein, every institute's speech recognition result is each word in the input voice The words recognition result sequence of the corresponding words recognition result composition of voice;Determine each word voice it is corresponding it is described extremely Whole words recognition results in few two voice identification results;According to the input sequence of the word voice, in relevant position Multiple words recognition results of the same word voice of correspondence are shown side by side;Finally, it is the multiple to what is shown side by side according to user The selection operation of words recognition result, generation includes the information to be entered for the words recognition result that the selection operation is directed to, and solves The problem of data inputting method based on speech recognition is present in the prior art the input efficiency of having determined is low.By by each word The corresponding multiple words recognition results of language voice are shown, and constitute information to be entered according to the selection of user, are accurately generated Information to be entered simultaneously completes input, further increases the efficiency of phonetic entry.Also, by showing multiple words recognitions side by side As a result, it is effectively improved the washability of information to be entered.
Embodiment three
Correspondingly, the embodiment of the invention also discloses a kind of data inputting method based on speech recognition, as shown in figure 5, Methods described includes:Step 500 is to step 540.
Step 500, input voice is identified, obtains at least two voice identification results.
Wherein, every institute's speech recognition result is the corresponding words recognition of each word voice in the input voice As a result the words recognition result sequence constituted.
Input voice is identified, the embodiment of at least two voice identification results is obtained referring to embodiment One, here is omitted.
Step 510, whole words in corresponding at least two voice identification results of each word voice are determined Language recognition result.
Determine whole words recognition knots in corresponding at least two voice identification results of each word voice The embodiment of fruit is referring to embodiment one, and here is omitted.
In the present embodiment, the voice identification result exported using speech recognition engine as:" Can ' t you he lp me " and " exemplified by Can you help me ", it may be determined that in input voice:The corresponding words recognition result of first word voice is: " the corresponding words recognition result of Can ' t " and " Can ", second word voice is:" you ", the 3rd word voice are corresponding Words recognition result is:" help ", the corresponding words recognition result of the 4th word voice are:“me”.
Step 520, according to the input sequence of the word voice, show corresponding with same word voice with the first pattern Multiple words recognition results, and shown and the one-to-one words recognition result of word voice with the second pattern.
Wherein, first pattern is different from second pattern, and first pattern is used to indicate the words recognition knot Fruit editable.
Know when in the voice identification result that speech recognition engine is exported, being present in the corresponding multiple words of same word voice During other result, the step of showing each word voice corresponding at least part words recognition result further comprises:With One pattern shows multiple words recognition results corresponding with same word voice, and is shown with the second pattern with word voice one by one Corresponding words recognition result.
Still by taking the words recognition result determined in previous step as an example, first word voice and the 3rd word voice point Not Dui Ying multiple words recognition results " Can ' t " and " Can ", therefore, first word are shown first position with the first pattern The corresponding two words recognition results of language voice.Second word voice, the 3rd word voice and the 4th word voice point Do not corresponded with words recognition result, one words recognition result is shown with the second pattern in relevant position.When it is implemented, First pattern is the Show Styles for being different from the second pattern, and such as the first pattern is:Be highlighted, underscore form is shown, Bubble is shown.Fig. 6 is that the first pattern is that underscore form is particularly shown schematic diagram.
When it is implemented, being corresponded to when in the voice identification result that speech recognition engine is exported, being present in same word voice Multiple words recognition results when, according to selection operation of the user at least part words recognition result of display, generation Information to be entered, further comprises:According to the trigger action of user, the words recognition result pair that the trigger action is directed to is shown The candidate's word recognition result answered, candidate's word recognition result is selected from the words recognition result that the trigger action is directed to Multiple words recognition results of corresponding word voice;The selection of candidate's word recognition result of display is grasped according to user Make, generate information to be entered.
Step 530, according to the trigger action of user, the corresponding time of words recognition result that the trigger action is directed to is shown Select words recognition result.
When it is implemented, when the words recognition result that user's triggering is shown with the first pattern, as clicked in Fig. 6 " Can ' t " then eject 700 in drop-down menu or subwindow, such as Fig. 7, user are shown in the drop-down menu or subwindow Triggering words recognition result " the corresponding candidate's word recognition results of Can ' t ", such as:“Can”.
Step 540, according to selection operation of the user to candidate's word recognition result of display, generation includes the choosing Select the information to be entered for candidate's word recognition result that operation is directed to.
User is detected to the selection operation of the candidate's word recognition result shown in drop-down menu or pop-up window, such as:Inspection Survey user and click on the operation of some candidate's word recognition result, then, candidate's word recognition result that user is clicked on as with Candidate's word recognition result of family selection, and the candidate's word recognition result selected according to user and other words recognition results are common With composition information to be entered.Wherein, candidate's word recognition result is selected from the words recognition knot that the trigger action is directed to Really " multiple words recognition results of the corresponding word voices of Can ' t ", such as:“Can’t”、“Can”、“Count”.To be used in Fig. 7 Words recognition result is clicked at family, and " after Can ' t ", ejection drop-down menu shows exemplified by " Can " and " Count " that user can be further Candidate's word recognition result " Can " is clicked on, to select " Can " as the words recognition result of first word voice.Finally, root The candidate's word recognition result " Can " selected according to user and other words recognition results:" you ", " help " and " me " common group Into information to be entered:“Can you help me”.
When it is implemented, according to selection operation of the user at least part words recognition result of display, generation During information to be entered, when the trigger action according to user, the corresponding time of words recognition result that the trigger action is directed to is shown After the step of selecting words recognition result, methods described also includes:Phonetic entry control is shown, for re-entering the triggering Operate the phonetic entry icon 710 in the corresponding word voice of words recognition result being directed to, such as Fig. 7.By showing that voice is defeated Voice can be re-entered with instruction user by entering control, and current term recognition result is modified.By detecting user couple in real time The trigger action of the phonetic entry control, and obtain the output of phonetic entry control, that is, the word voice re-entered.
It is described by the words recognition result of the word voice re-entered, update the word that the trigger action is directed to and know The corresponding candidate's word recognition result of other result, including:The word voice re-entered is entered with reference to the input voice Row speech recognition, the words recognition result updated;By the words recognition result of the renewal, update trigger action and be directed to The corresponding candidate's word recognition result of words recognition result.After the word voice re-entered is got, by again The word voice of input is replaced and is originally inputted corresponding segment in voice, by the voice re-entered and original to input voice progress Voice is merged, and calls speech recognition engine that the voice after fusion is identified again, the word language re-entered The renewal words recognition result of sound.Finally, by the words recognition result of the renewal, the word knowledge that trigger action is directed to is updated The corresponding candidate's word recognition result of other result.
Data inputting method based on speech recognition disclosed in the embodiment of the present invention, by the way that input voice is identified, Obtain at least two voice identification results;Wherein, every institute's speech recognition result is each word in the input voice The words recognition result sequence of the corresponding words recognition result composition of voice;Determine each word voice it is corresponding it is described extremely Whole words recognition results in few two voice identification results;According to the input sequence of the word voice, with the first pattern Display multiple words recognition results corresponding with same word voice, and show one-to-one with word voice with the second pattern Words recognition result;Then, according to the trigger action of user, show that the words recognition result that the trigger action is directed to is corresponding Candidate's word recognition result;Finally, according to selection operation of the user to candidate's word recognition result of display, generation treats defeated Enter information, solve the problem of data inputting method based on speech recognition is present in the prior art input efficiency is low.
By showing the words recognition result with multiple candidates with different patterns, using Show Styles as instruction user and Whether words recognition result may be selected, and when needing selection, just triggering shows candidate's word recognition result, can effectively save Show the display space that words recognition result takes.By showing phonetic entry control simultaneously in display candidate's word recognition result Part, is easy to user when the voice identification result of word voice is not that user really thinks the information of input, quick to update input language Sound, it is the efficiency that is transfused to information further to improve by voice.
Example IV
Correspondingly, the embodiment of the invention also discloses a kind of message input device based on speech recognition, as shown in figure 8, Described device includes:
Sound identification module 810, for input voice to be identified, obtains at least two voice identification results;Wherein, Every institute's speech recognition result is the word that each the corresponding words recognition result of word voice is constituted in the input voice Language recognition result sequence;
Words recognition result determining module 820, for determining corresponding at least two languages of each word voice Whole words recognition results in sound recognition result;
Words recognition result display module 830, for the input sequence according to the word voice, shows that the word is known The corresponding at least part words recognition result of each word voice that other result determining module 820 is determined;
Information generating module 840 is inputted, for the institute shown according to user to the words recognition result display module 830 The selection operation of at least part words recognition result is stated, information to be entered is generated.
Optionally, as shown in figure 9, the words recognition result display module 830 includes:
First display unit 8301, multiple words recognitions for showing the same word voice of correspondence side by side in relevant position As a result.
Optionally, the input information generating module 840 further comprises:
First input information generating unit 8401, for according to user to the multiple words recognition result that shows side by side Selection operation, generation include the information to be entered for the words recognition result that the selection operation is directed to.
Optionally, the selection operation includes:Click on the words recognition result shown side by side, perform slip gesture warp Cross the words recognition result that shows side by side and with the one-to-one words recognition result of word voice.
Optionally, as shown in Figure 10, the words recognition result display module 830 includes:
Second display unit 8302, for showing multiple words recognition knots corresponding with same word voice with the first pattern Really, and with the second pattern show and the one-to-one words recognition result of word voice;Wherein, first pattern is used to indicate The words recognition result editable.
Optionally, the words recognition result display module 830 further comprises:
3rd display unit 8303, for the trigger action according to user, shows that the word that the trigger action is directed to is known The corresponding candidate's word recognition result of other result, candidate's word recognition result is selected from the word that the trigger action is directed to Multiple words recognition results of the corresponding word voice of recognition result;
Optionally, the input information generating module 840 further comprises:
Second input information generating unit 8402, for the choosing according to user to candidate's word recognition result of display Operation is selected, generation includes the information to be entered generation information to be entered for candidate's word recognition result that the selection operation is directed to.
Optionally, as shown in Figure 10, the words recognition result display module 830 also includes:
4th display unit 8304, for showing phonetic entry control, is directed to for re-entering the trigger action The corresponding word voice of words recognition result;
Words recognition result updating block 8305, for the words recognition result of the word voice by re-entering, more The corresponding candidate's word recognition result of words recognition result that the new trigger action is directed to.
Optionally, the words recognition result updating block 8305 specifically for:
Speech recognition is carried out to the word voice re-entered with reference to the input voice, the word updated is known Other result;By the words recognition result of the renewal, the corresponding candidate word of words recognition result that trigger action is directed to is updated Language recognition result.
Message input device based on speech recognition disclosed in the embodiment of the present invention, by the way that input voice is identified, Obtain at least two voice identification results;Wherein, every institute's speech recognition result is each word in the input voice The words recognition result sequence of the corresponding words recognition result composition of voice;Determine each word voice it is corresponding it is described extremely Whole words recognition results in few two voice identification results;According to the input sequence of the word voice, with the first pattern Display multiple words recognition results corresponding with same word voice, and show one-to-one with word voice with the second pattern Words recognition result;Then, according to the trigger action of user, show that the words recognition result that the trigger action is directed to is corresponding Candidate's word recognition result;Finally, according to selection operation of the user to candidate's word recognition result of display, generation treats defeated Enter information, solve the problem of data inputting method based on speech recognition is present in the prior art input efficiency is low.
By showing the words recognition result with multiple candidates with different patterns, using Show Styles as instruction user and Whether words recognition result may be selected, and when needing selection, just triggering shows candidate's word recognition result, can effectively save Show the display space that words recognition result takes.By showing phonetic entry control simultaneously in display candidate's word recognition result Part, is easy to user when the voice identification result of word voice is not that user really thinks the information of input, quick to update input language Sound, it is the efficiency that is transfused to information further to improve by voice.
Correspondingly, invention additionally discloses a kind of electronic equipment, including memory, processor and be stored in the memory Computer program that is upper and can running on a processor, realizes that the present invention is implemented described in the computing device during computer program The data inputting method based on speech recognition described in example one to embodiment three.The electronic equipment can be:It is smart mobile phone, flat Plate computer, game machine, electronic reader etc..
Correspondingly, invention additionally discloses a kind of computer-readable recording medium, be stored thereon with computer program, the journey The data inputting method based on speech recognition described in the embodiment of the present invention one to embodiment three is realized when sequence is executed by processor The step of.
Each embodiment in this specification is described by the way of progressive, what each embodiment was stressed be with Between the difference of other embodiment, each embodiment identical similar part mutually referring to.For device and electronics For apparatus embodiments, because it is substantially similar to embodiment of the method, so description is fairly simple, related part is referring to method The part explanation of embodiment.
The application is carried out to a kind of data inputting method based on speech recognition, device and electronic equipment of offer above It is discussed in detail, specific case used herein is set forth to the principle and embodiment of the application, above example Explanation be only intended to help and understand the present processes and its core concept;Simultaneously for those of ordinary skill in the art, According to the thought of the application, it will change in specific embodiments and applications, in summary, in this specification Hold the limitation that should not be construed as to the application.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can Realized by the mode of software plus required general hardware platform, naturally it is also possible to realized by hardware.Based on such reason Solution, the part that above-mentioned technical proposal substantially contributes to prior art in other words can be embodied in the form of software product Come, the computer software product can be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disc, CD, including Some instructions are to cause a computer equipment (can be personal computer, server, or network equipment etc.) is performed each Method described in some parts of individual embodiment or embodiment.

Claims (11)

1. a kind of data inputting method based on speech recognition, it is characterised in that including:
Input voice is identified, at least two voice identification results are obtained;Wherein, every institute's speech recognition result is served as reasons The words recognition result sequence of the corresponding words recognition result composition of each word voice in the input voice;
Determine whole words recognition results in corresponding at least two voice identification results of each word voice;
According to the input sequence of the word voice, the corresponding at least part words recognition knot of each word voice is shown Really;
According to selection operation of the user at least part words recognition result of display, information to be entered is generated.
2. the method as described in claim 1, it is characterised in that the corresponding at least part of the display each word voice The step of words recognition result, including:
Show multiple words recognition results of the same word voice of correspondence side by side in relevant position.
3. method as claimed in claim 2, it is characterised in that described to be known according to user at least part word of display The step of selection operation of other result, generation information to be entered, including:
According to user to the selection operation of the multiple words recognition result shown side by side, generation includes the selection operation pin To words recognition result information to be entered.
4. method as claimed in claim 3, it is characterised in that the selection operation includes:Click on the institute's predicate shown side by side Language recognition result, perform slip gesture by the words recognition result that shows side by side and with the one-to-one word of word voice Language recognition result.
5. the method as described in claim 1, it is characterised in that the corresponding at least part of the display each word voice The step of words recognition result, including:
Multiple words recognition results corresponding with same word voice are shown with the first pattern, and shown and word with the second pattern The one-to-one words recognition result of voice;Wherein, first pattern is used to indicate the words recognition result editable.
6. method as claimed in claim 5, it is characterised in that described to be known according to user at least part word of display The step of selection operation of other result, generation information to be entered, including:
According to the trigger action of user, the corresponding candidate's words recognition knot of words recognition result that the trigger action is directed to is shown Really, candidate's word recognition result is selected from many of the corresponding word voice of words recognition result that the trigger action is directed to Individual words recognition result;
According to selection operation of the user to candidate's word recognition result of display, generation includes what the selection operation was directed to The information to be entered of candidate's word recognition result.
7. method as claimed in claim 6, it is characterised in that the trigger action according to user, shows the triggering behaviour After the step of making the words recognition result that is directed to corresponding candidate's word recognition result, in addition to:
Phonetic entry control is shown, for re-entering the corresponding word language of words recognition result that the trigger action is directed to Sound;
By the words recognition result of the word voice re-entered, the words recognition result pair that the trigger action is directed to is updated The candidate's word recognition result answered.
8. method as claimed in claim 7, it is characterised in that described by the words recognition knot of the word voice re-entered Really, the step of updating words recognition result that the trigger action is directed to corresponding candidate's word recognition result, including:
Speech recognition, the words recognition knot updated are carried out to the word voice re-entered with reference to the input voice Really;
By the words recognition result of the renewal, the corresponding candidate's word knowledge of words recognition result that trigger action is directed to is updated Other result.
9. a kind of message input device based on speech recognition, it is characterised in that including:
Sound identification module, for input voice to be identified, obtains at least two voice identification results;Wherein, every institute Speech recognition result is the words recognition that each the corresponding words recognition result of word voice is constituted in the input voice As a result sequence;
Words recognition result determining module, for determining the corresponding at least two speech recognition knots of each word voice Whole words recognition results in fruit;
Words recognition result display module, for the input sequence according to the word voice, shows the words recognition result The corresponding at least part words recognition result of each word voice that determining module is determined;
Information generating module is inputted, for the described at least part shown according to user to the words recognition result display module The selection operation of words recognition result, generates information to be entered.
10. a kind of electronic equipment, including memory, processor and it is stored on the memory and can runs on a processor Computer program, it is characterised in that realize claim 1 to 8 any one described in the computing device during computer program The data inputting method based on speech recognition described in claim.
11. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor The step of data inputting method based on speech recognition described in claim 1 to 8 any one claim is realized during execution.
CN201710358744.4A 2017-05-19 2017-05-19 A kind of data inputting method and device based on speech recognition, electronic equipment Pending CN107316639A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710358744.4A CN107316639A (en) 2017-05-19 2017-05-19 A kind of data inputting method and device based on speech recognition, electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710358744.4A CN107316639A (en) 2017-05-19 2017-05-19 A kind of data inputting method and device based on speech recognition, electronic equipment

Publications (1)

Publication Number Publication Date
CN107316639A true CN107316639A (en) 2017-11-03

Family

ID=60181507

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710358744.4A Pending CN107316639A (en) 2017-05-19 2017-05-19 A kind of data inputting method and device based on speech recognition, electronic equipment

Country Status (1)

Country Link
CN (1) CN107316639A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109215661A (en) * 2018-08-30 2019-01-15 上海与德通讯技术有限公司 Speech-to-text method, apparatus equipment and storage medium
CN110473543A (en) * 2019-09-25 2019-11-19 北京蓦然认知科技有限公司 A kind of audio recognition method, device
CN111475129A (en) * 2019-01-24 2020-07-31 北京京东尚科信息技术有限公司 Method and equipment for displaying candidate homophones through voice recognition
CN116825107A (en) * 2023-08-21 2023-09-29 北京集度科技有限公司 Voice interaction method and device, electronic equipment and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923854A (en) * 2010-08-31 2010-12-22 中国科学院计算技术研究所 Interactive speech recognition system and method
CN102903361A (en) * 2012-10-15 2013-01-30 Itp创新科技有限公司 Instant call translation system and instant call translation method
CN103366741A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input error correction method and system
CN104538032A (en) * 2014-12-19 2015-04-22 中国科学院计算技术研究所 Chinese voice recognition method and system fusing user feedback
CN105161094A (en) * 2015-06-26 2015-12-16 徐信 System and method for manually adjusting cutting point in audio cutting of voice
CN105244026A (en) * 2015-08-24 2016-01-13 陈娟 Voice processing method and device
CN105513589A (en) * 2015-12-18 2016-04-20 百度在线网络技术(北京)有限公司 Speech recognition method and speech recognition device
CN106683662A (en) * 2015-11-10 2017-05-17 中国电信股份有限公司 Speech recognition method and device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923854A (en) * 2010-08-31 2010-12-22 中国科学院计算技术研究所 Interactive speech recognition system and method
CN103366741A (en) * 2012-03-31 2013-10-23 盛乐信息技术(上海)有限公司 Voice input error correction method and system
CN102903361A (en) * 2012-10-15 2013-01-30 Itp创新科技有限公司 Instant call translation system and instant call translation method
CN104538032A (en) * 2014-12-19 2015-04-22 中国科学院计算技术研究所 Chinese voice recognition method and system fusing user feedback
CN105161094A (en) * 2015-06-26 2015-12-16 徐信 System and method for manually adjusting cutting point in audio cutting of voice
CN105244026A (en) * 2015-08-24 2016-01-13 陈娟 Voice processing method and device
CN106683662A (en) * 2015-11-10 2017-05-17 中国电信股份有限公司 Speech recognition method and device
CN105513589A (en) * 2015-12-18 2016-04-20 百度在线网络技术(北京)有限公司 Speech recognition method and speech recognition device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JUN OGATA等: ""Speech Repair: Quick Error Correction Just by Using Selection Operation for Speech Input Interfaces"", 《INTERSPEECH-2005:9TH EUROPEAN CONFERENCE ON SPEECH COMMNUNICATION AND TECHNOLOGY》 *
KEITH VERTANEN等: ""Intelligently Aiding Human-Guided Correction of Speech Recognition"", 《TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109215661A (en) * 2018-08-30 2019-01-15 上海与德通讯技术有限公司 Speech-to-text method, apparatus equipment and storage medium
CN111475129A (en) * 2019-01-24 2020-07-31 北京京东尚科信息技术有限公司 Method and equipment for displaying candidate homophones through voice recognition
CN110473543A (en) * 2019-09-25 2019-11-19 北京蓦然认知科技有限公司 A kind of audio recognition method, device
CN116825107A (en) * 2023-08-21 2023-09-29 北京集度科技有限公司 Voice interaction method and device, electronic equipment and storage medium
CN116825107B (en) * 2023-08-21 2023-11-21 北京集度科技有限公司 Voice interaction method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
TWI437449B (en) Multi-mode input method and input method editor system
TWI510965B (en) Input method editor integration
JP2021089739A (en) Question answering method and language model training method, apparatus, device, and storage medium
US9569231B2 (en) Device, system, and method for providing interactive guidance with execution of operations
TWI443551B (en) Method and system for an input method editor and computer program product
CN107316639A (en) A kind of data inputting method and device based on speech recognition, electronic equipment
US8370808B2 (en) Apparatus and a method for generating a test case
US20140236596A1 (en) Emotion detection in voicemail
WO2018075191A1 (en) Techniques for scheduling language models and character recognition models for handwriting inputs
CN105161095B (en) Method and device for picture composition of speech recognition syntax tree
CN112286485B (en) Method and device for controlling application through voice, electronic equipment and storage medium
CN101989128B (en) Method and device for showing candidate words in fixed position
CN110060674A (en) Form management method, apparatus, terminal and storage medium
JP2010044637A (en) Data processing apparatus, method, and program
CN106484131A (en) A kind of input error correction method and input subtraction unit
CN114168718A (en) Information processing apparatus, method and information recording medium
CN112541052B (en) Method, device, equipment and storage medium for determining answers to questions
CN115879469B (en) Text data processing method, model training method, device and medium
US11928444B2 (en) Editing files using a pattern-completion engine implemented using a machine-trained model
KR20210127613A (en) Method and apparatus for generating conversation, electronic device and storage medium
CN111880668A (en) Input display method and device and electronic equipment
CN107247518A (en) Method, device, equipment and the readable storage medium storing program for executing of polynary association's input
CN108351710A (en) Continuous keyboard identification
CN104317416B (en) A kind of method and apparatus of input
CN104077105B (en) A kind of information processing method and a kind of electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20171103