Nothing Special   »   [go: up one dir, main page]

CN107564528A - A kind of speech recognition text and the method and apparatus of order word text matches - Google Patents

A kind of speech recognition text and the method and apparatus of order word text matches Download PDF

Info

Publication number
CN107564528A
CN107564528A CN201710849743.XA CN201710849743A CN107564528A CN 107564528 A CN107564528 A CN 107564528A CN 201710849743 A CN201710849743 A CN 201710849743A CN 107564528 A CN107564528 A CN 107564528A
Authority
CN
China
Prior art keywords
word
text
similarity
order
phonetic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710849743.XA
Other languages
Chinese (zh)
Other versions
CN107564528B (en
Inventor
姚佳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Hui He science and Technology Development Co., Ltd.
Original Assignee
Shenzhen City Artificial Intelligence Technology Co Secluded Orchid In A Deserted Valley
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen City Artificial Intelligence Technology Co Secluded Orchid In A Deserted Valley filed Critical Shenzhen City Artificial Intelligence Technology Co Secluded Orchid In A Deserted Valley
Priority to CN201710849743.XA priority Critical patent/CN107564528B/en
Publication of CN107564528A publication Critical patent/CN107564528A/en
Application granted granted Critical
Publication of CN107564528B publication Critical patent/CN107564528B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention proposes the method and apparatus of a kind of speech recognition text and order word text matches, and method includes:Obtain the text obtained by speech recognition;Cutting word is carried out to text, to generate multiple text words;And cutting word is carried out to default order word text, to generate order word;It is determined that with each text Word similarity highest order word, and the first corresponding relation is generated with this;The Word similarity of text and order word text is determined based on each highest similarity;Determine the phonetic of text and each word in order word text;It is determined that the phonetic with the word in the phonetic similarity degree highest order word of each word in text, and the second corresponding relation is generated with this;The pinyin similarity of text and order word text is determined based on each highest pinyin similarity;The similarity of text and order word text is determined based on Word similarity and pinyin similarity.This programme, it is higher to the tolerance of speech recognition errors without labeled data, and ensure the subsequently matching with order word.

Description

A kind of speech recognition text and the method and apparatus of order word text matches
Technical field
The present invention relates to identification field, the method for more particularly to a kind of speech recognition text and order word text matches and set It is standby.
Background technology
In current interactive voice, by speech recognition it is text first with speech recognition technology, then carries out phase again The processing answered, in this process, just it is frequently necessary to match the text of speech recognition with function command word, determines user Target;But in actual applications, due to some function command words are too short, interrogatory, be not accordant to the old routine expression the problems such as, The text that can cause to identify is with actually there is larger difference.
At present, it is typically all directly to consider the similarity between text, current text in general text matches algorithm The scheme of this matching algorithm has:
A, character string rank, such as according to (cum rights) editing distance, retrieval model etc.;
B, shallow semantic rank, for example on the basis of A, build similar dictionary;Or interdependent syntax point is carried out to text Analysis etc., introduce word order information and be compared;
C, Deep Semantics rank, entered based on deep learning model more fiery at present, such as RNN, Bi-LSTM, GRU, CNN etc. Row Deep Semantics compare.
The objective shortcoming of prior art:
Current text matches algorithm is not very suitable in fact in the matching scene of speech recognition text and order word With.In the case that certain mistake occurs in speech recognition, substantially existing all methods can all go wrong;Wherein, with regard to character string For rank, enough semantic informations can not be matched, and in the case of having certain error rate in speech recognition, crash rate is just Can be very high;And for shallow semantic rank, the Shallow Semantic Parsing of interdependent syntactic analysis etc is built, can all be consumed more Processing time, and this can cause the reduction of environmental efficiency on whole line, be not very actual;And based on shallow semantic point Analysis, for colloquial style (or even speech recognition errors) expression way adaptability there is also certain otherness, and for structure Similar dictionary is built, then for the comparison aspect of character string rank is carried out, can not similarly solve asking for speech recognition errors Topic;As for Deep Semantics level otherwise, it is required for substantial amounts of labeled data, and in such a more emerging field, The acquisition of labeled data is great, or even is directly impossible in the short time.
Thus, user with during robot interactive, it is necessary to saying function command word can match, also just can be with Corresponding function is entered, but is based on aforesaid way, in the case that voice recognition information is insufficient, the result meeting of speech recognition There is larger deviation.
The content of the invention
For in the prior art the defects of, the present invention proposes the side of speech recognition text and order word text matches a kind of Method and equipment, speech recognition errors can greatly be tolerated by realizing, and ensure the subsequently matching with order word.
Specifically, the present invention proposes embodiment in detail below:
A kind of method that the embodiment of the present invention proposes speech recognition text and order word text matches, applied to man-machine friendship Mutual scene, this method include:
Obtain the text obtained by speech recognition;
Cutting word is carried out to the text, to generate multiple text words;And cutting word is carried out to default order word text, with life Into order word;
It is determined that with each text Word similarity highest order word, and the first corresponding relation is generated with this;Wherein, in institute State in the first corresponding relation, identified order word only corresponds to a text word;
The Word similarity of the text and the order word text is determined based on similarity described in each highest;
Determine the phonetic of the text and each word in the order word text;
It is determined that the phonetic with the word in order word described in the phonetic similarity degree highest of each word in the text, and with This second corresponding relation of generation;Wherein, in second corresponding relation, the phonetic of each word only corresponds to one in the text The phonetic of word in the order word;
The pinyin similarity of the text and the order word text is determined based on each highest pinyin similarity;
The similarity of the text and the order word text is determined based on the Word similarity and the pinyin similarity.
In a specific embodiment, in addition to:
Obtain the text data that quantity exceedes certain value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the phase between any two word in the text data is obtained Like spending and build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Collect the Pinyin information that pronunciation meets default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text The similar table of phonetic of all word phonetics in data;
The cutting word is carried out based on the similarity between any two word;
In a specific embodiment, " it is determined that with each text Word similarity highest order word " includes:
For each text word, determine that the text word is similar to each order word based on the similar vocabulary of near synonym Degree;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
It is described " it is determined that the spelling with the word in order word described in the phonetic similarity degree highest of each word in the text Sound " includes:
For the word of each text, based on the similar table of the phonetic determine with the phonetic of the word of the text with it is each described The pinyin similarity of word phonetic in order word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity with the phonetic of the word in the text The phonetic of word in highest order word.
It is described " text and the life to be determined based on similarity described in each highest in a specific embodiment Make the Word similarity of word text " include:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
It is described " text and the life to be determined based on each highest pinyin similarity in a specific embodiment Make the pinyin similarity of word text " include:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
It is described " text to be determined with the pinyin similarity based on the Word similarity in a specific embodiment Sheet and the similarity of the order word text " includes:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word The similarity of text;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text With the similarity of the order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is described The similarity of order word text.
The equipment that the embodiment of the present invention also proposed a kind of speech recognition text and order word text matches, applied to man-machine Interactive scene, the equipment include:
Acquisition module, for obtaining the text obtained by speech recognition;
Cutting word module, cutting word is carried out to the text, to generate multiple text words;And default order word text is carried out Cutting word, to generate order word;
First generation module, for determination and each text Word similarity highest order word, and first is generated with this Corresponding relation;Wherein, in first corresponding relation, identified order word only corresponds to a text word;
Word similarity determining module, for determining the text and order word text based on similarity described in each highest This Word similarity;
Phonetic determining module, for determining the phonetic of the text and each word in the order word text;
Second generation module, for determining and order word described in the phonetic similarity degree highest of each word in the text In word phonetic, and the second corresponding relation is generated with this;Wherein, it is each in the text in second corresponding relation The phonetic of word only corresponds to the phonetic of the word in an order word;
Pinyin similarity determining module, for determining the text and the order word based on each highest pinyin similarity The pinyin similarity of text;
Processing module, for determining the text and the order word based on the Word similarity and the pinyin similarity The similarity of text.
In a specific embodiment, in addition to:Pretreatment module, it is used for
Obtain the text data that quantity exceedes certain value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the phase between any two word in the text data is obtained Like spending and build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Collect the Pinyin information that pronunciation meets default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text The similar table of phonetic of all word phonetics in data;
The cutting word is carried out based on the similarity between any two word;
First generation module " it is determined that with each text Word similarity highest order word " includes:
For each text word, determine that the text word is similar to each order word based on the similar vocabulary of near synonym Degree;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
Second generation module " it is determined that with order word described in the phonetic similarity degree highest of each word in the text In word phonetic " include:
For the word of each text, based on the similar table of the phonetic determine with the phonetic of the word of the text with it is each described The pinyin similarity of word phonetic in order word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity with the phonetic of the word in the text The phonetic of word in highest order word.
In a specific embodiment, the Word similarity determining module, it is used for:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
In a specific embodiment, the pinyin similarity determining module, it is used for:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
In a specific embodiment, the processing module, it is used for:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word The similarity of text;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text With the similarity of the order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is described The similarity of order word text.
With this, the embodiment of the present invention proposes the method and apparatus of a kind of speech recognition text and order word text matches, Applied to the scene of man-machine interaction, wherein, this method includes:Obtain the text obtained by speech recognition;The text is entered Row cutting word, to generate multiple text words;And cutting word is carried out to default order word text, to generate order word;It is determined that with each institute Text Word similarity highest order word is stated, and the first corresponding relation is generated with this;Wherein, in first corresponding relation, Identified order word only corresponds to a text word;The text and the order word are determined based on similarity described in each highest The Word similarity of text;Determine the phonetic of the text and each word in the order word text;It is determined that with it is every in the text The phonetic of word in order word described in the phonetic similarity degree highest of individual word, and the second corresponding relation is generated with this;Wherein, exist In second corresponding relation, the phonetic of each word only corresponds to the phonetic of the word in an order word in the text;Base The pinyin similarity of the text and the order word text is determined in each highest pinyin similarity;Based on the Word similarity The similarity of the text and the order word text is determined with the pinyin similarity.By this programme, without labeled data, It is higher to the tolerance of speech recognition errors, and ensure the subsequently matching with order word.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore be not construed as pair The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is that a kind of speech recognition text that the embodiment of the present invention proposes and the flow of the method for order word text matches are shown It is intended to;
Fig. 2 is that a kind of speech recognition text that the embodiment of the present invention proposes and the structure of the equipment of order word text matches are shown It is intended to.
Embodiment
Hereinafter, the various embodiments of the disclosure will be described more fully.The disclosure can have various embodiments, and It can adjust and change wherein.It should be understood, however, that:It is limited to spy disclosed herein in the absence of by the various embodiments of the disclosure Determine the intention of embodiment, but the disclosure should be interpreted as covering in the spirit and scope for the various embodiments for falling into the disclosure All adjustment, equivalent and/or alternatives.
Embodiment 1
The embodiment of the invention discloses a kind of speech recognition text and the method for order word text matches, applied to man-machine friendship Mutual scene, as shown in figure 1, this method includes:
The text that step 101, acquisition are obtained by speech recognition;
Specifically, speech data is identified by speech recognition, to generate text.
Step 102, cutting word is carried out to the text, to generate multiple text words;And default order word text is carried out Cutting word, to generate order word;
Specifically, carrying out cutting word to text and default order word file, the preferable cutting word can be based on any Similarity between two words can avoid excessive invalid cutting word from handling come what is carried out with this;Specifically, namely based on any two Similarity between word carries out cutting word to the text, and carries out cutting word to default order word text;
And the similarity between specific any two word can be obtained based on following flows:
Obtain the text data that quantity exceedes certain value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the phase between any two word in the text data is obtained Like spending and build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Pronunciation meets the Pinyin information of default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text The similar table of phonetic of all word phonetics in data;
The cutting word is carried out based on the similarity between any two word;
Specifically, illustrated with specific embodiment:
It is possible, firstly, to collect nearly 5G text data by the method crawled on the net, word2vec structures are then based on The similarity of word and word;Based on the similar vocabulary of word2vec and hownet structure near synonym, obtain similar between any two word Degree;It is defined as word_sim_fuc (w1, w2) function;
Specifically, it can be based on to two similarities (based on the similarity that word2vec is obtained with being obtained based on hownet Similarity) averagely obtained similarity between any two word.
Then, obtain all words is possible to phonetic (polyphone)
3rd, and the more similar Pinyin information of collection pronunciation (for example f-h is easily mixed, r-l is easily mixed, and rear nasal sound and pre-nasal sound are easy Mix), based on this part of data, the similar pinyin of all word phonetics can be constructed.And according to similar categorization, such as with artificial Mode or other identification methods give certain similar value.Such as, similarity can be 0.9 between ting-tin, It can be 0.7 for the similarity between fu-hu, can be 0.7 for similarity between rou-lou;For two phonetic Similarity can be defined as pinyin_sim_fuc (pinyin1, pinyin2).
Step 103, determination and each text Word similarity highest order word, and the first corresponding relation is generated with this; Wherein, in first corresponding relation, identified order word only corresponds to a text word;
Specifically, " it is determined that with each text Word similarity highest order word " in step 103 includes:
For each text word, determine that the text word is similar to each order word based on the similar vocabulary of near synonym Degree;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
Step 104, the Word similarity for determining based on similarity described in each highest the text and the order word text;
Specifically, described in step 104 " determines the text and the order word based on similarity described in each highest The Word similarity of text " includes:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
Specifically, after the text of speech recognition is obtained, can be compared with function command word;Define speech recognition text This (x, behind have x expressions), and function command word text (y, behind represented with y), specifically, in one example:With Example illustrates.Assuming that x is:Mutual thwack is opened;And y is:Open annex.
According to the similarity of foregoing obtained any two word;To x, y is successively after cutting word, to each word (w_x_ in x I), the word (w_y_j) of most like (utilizing the word_sim_fuc functions in A) is looked in y, similarity is designated as sim_w_i_j, and And ensure that w_y_j can only correspond to a w_x_i;After each w_x_i similarity of most like word is obtained, word length cum rights is taken X is averagely used as, y Word similarity, is designated as sim_w_x_y;
Specifically, being illustrated still exemplified by above-mentioned, after x cutting words, become mutually-strong-opening;And to y cutting words after, Become unlatching-annex;So opening-unlatching is corresponding here;It is remaining, it is exactly annex-mutual correspondence, does not correspond to word by force;
Assuming that it is 0.9 to open with the similarity opened, annex is 0 with mutual similarity, strong similarity 0.So sim_w_ X_y=0.5*0.9+0.25*0+0.25*0=0.45.
Specifically, implication of the word length with weight average is the band weight average of the length based on word length in the text, specifically, Such as " opening " has 2 words, and " mutual thwack is opened " then has 4 words, therefore the ratio of word length is 0.5, other " mutual " and " strong " is identical with this.
Step 105, the phonetic for determining the text and each word in the order word text;
Step 106, determine and the word in order word described in the phonetic similarity degree highest of each word in the text Phonetic, and the second corresponding relation is generated with this;Wherein, in second corresponding relation, the phonetic of each word in the text The only phonetic of the word in a corresponding order word;
Wherein, it is described " it is determined that with being ordered described in the phonetic similarity degree highest of each word in the text in step 106 Make the phonetic of the word in word " include:
For the word of each text, based on the similar table of the phonetic determine with the phonetic of the word of the text with it is each described The pinyin similarity of word phonetic in order word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity with the phonetic of the word in the text The phonetic of word in highest order word.
Step 107, based on each highest pinyin similarity determine that the text is similar to the phonetic of the order word text Degree;
Specifically, described " determine the phonetic of the text and the order word text based on each highest pinyin similarity Similarity " includes:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
With this, illustrated based on one embodiment, according to all phonetics of foregoing obtained word, obtain x, it is every in y The phonetic of individual word.
For each word in x phonetic (pinyin_x_i) looked in y it is most like (utilize B in pinyin_sim_fuc Function) word phonetic (pinyin_y_j), similarity is designated as sim_pinyin_i_j, and ensures that pinyin_x_i only can be right Answer a pinyin_y_j;After each pinyin_x_i most like word phonetic is obtained, take it is average be used as x, y phonetic is similar Degree, is designated as pinyin_sim_x_y;
Specifically, still exemplified by above-mentioned, it is that hu jiang/qiang da kai Y open annex spelling that the mutual thwacks of X, which open phonetic, Sound is kai qi fu jian
Begun stepping through first from x, hu it is most like be fu, similarity 0.7
Jiang/qiang it is most like be jian, similarity 0.9
It is exactly kai that Kai is most like, similarity 1.0
Da and qi is remaining, is matched, similarity 0
Averagely get off, pinyin similarity 2.6/4=0.65
Step 108, the text and the order word text determined based on the Word similarity and the pinyin similarity Similarity.
In one embodiment, it is described " text and institute to be determined with the pinyin similarity based on the Word similarity State the similarity of order word text " include:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word The similarity of text;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text With the similarity of the order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is described The similarity of order word text.
Still can be exemplified by above-mentioned, then last similarity is 0.6*0.45+0.4*0.65=0.53.
Embodiment 2
The equipment that the embodiment of the present invention 2 also discloses a kind of speech recognition text and order word text matches, applied to people The scene of machine interaction, as shown in Fig. 2 the equipment includes:
Acquisition module 201, for obtaining the text obtained by speech recognition;
Cutting word module 202, cutting word is carried out to the text, to generate multiple text words;And to default order word text Cutting word is carried out, to generate order word;
First generation module 203, for determine with each text Word similarity highest order word, and generate the with this One corresponding relation;Wherein, in first corresponding relation, identified order word only corresponds to a text word;
Word similarity determining module 204, for determining the text and the order based on similarity described in each highest The Word similarity of word text;
Phonetic determining module 205, for determining the phonetic of the text and each word in the order word text;
Second generation module 206, for determining and life described in the phonetic similarity degree highest of each word in the text The phonetic of the word in word is made, and the second corresponding relation is generated with this;Wherein, in second corresponding relation, in the text The phonetic of each word only corresponds to the phonetic of the word in an order word;
Pinyin similarity determining module 207, for determining the text and the life based on each highest pinyin similarity Make the pinyin similarity of word text;
Processing module 208, for determining the text and the life based on the Word similarity and the pinyin similarity Make the similarity of word text.
In one embodiment, in addition to:Pretreatment module, it is used for
Obtain the text data that quantity exceedes certain value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the phase between any two word in the text data is obtained Like spending and build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Collect the Pinyin information that pronunciation meets default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text The similar table of phonetic of all word phonetics in data;
The cutting word is carried out based on the similarity between any two word;
First generation module " it is determined that with each text Word similarity highest order word " includes:
For each text word, determine that the text word is similar to each order word based on the similar vocabulary of near synonym Degree;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
Second generation module " it is determined that with order word described in the phonetic similarity degree highest of each word in the text In word phonetic " include:
For the word of each text, based on the similar table of the phonetic determine with the phonetic of the word of the text with it is each described The pinyin similarity of word phonetic in order word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity with the phonetic of the word in the text The phonetic of word in highest order word.
In one embodiment, the Word similarity determining module, is used for:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
In one embodiment, the pinyin similarity determining module, is used for:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
In one embodiment, the processing module, is used for:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word The similarity of text;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text With the similarity of the order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is described The similarity of order word text.
With this, the embodiment of the present invention proposes the method and apparatus of a kind of speech recognition text and order word text matches, Applied to the scene of man-machine interaction, wherein, this method includes:Obtain the text obtained by speech recognition;The text is entered Row cutting word, to generate multiple text words;And cutting word is carried out to default order word text, to generate order word;It is determined that with each institute Text Word similarity highest order word is stated, and the first corresponding relation is generated with this;Wherein, in first corresponding relation, Identified order word only corresponds to a text word;The text and the order word are determined based on similarity described in each highest The Word similarity of text;Determine the phonetic of the text and each word in the order word text;It is determined that with it is every in the text The phonetic of word in order word described in the phonetic similarity degree highest of individual word, and the second corresponding relation is generated with this;Wherein, exist In second corresponding relation, the phonetic of each word only corresponds to the phonetic of the word in an order word in the text;Base The pinyin similarity of the text and the order word text is determined in each highest pinyin similarity;Based on the Word similarity The similarity of the text and the order word text is determined with the pinyin similarity.By this programme, without labeled data, It is higher to the tolerance of speech recognition errors, and ensure the subsequently matching with order word.
It will be appreciated by those skilled in the art that accompanying drawing is a schematic diagram for being preferable to carry out scene, module in accompanying drawing or Flow is not necessarily implemented necessary to the present invention.
It will be appreciated by those skilled in the art that the module in device in implement scene can be described according to implement scene into Row is distributed in the device of implement scene, can also carry out one or more dresses that respective change is disposed other than this implement scene In putting.The module of above-mentioned implement scene can be merged into a module, can also be further split into multiple submodule.
The invention described above sequence number is for illustration only, does not represent the quality of implement scene.
Disclosed above is only several specific implementation scenes of the present invention, and still, the present invention is not limited to this, Ren Heben What the technical staff in field can think change should all fall into protection scope of the present invention.

Claims (10)

1. a kind of method of speech recognition text and order word text matches, it is characterised in that applied to the scene of man-machine interaction, This method includes:
Obtain the text obtained by speech recognition;
Cutting word is carried out to the text, to generate multiple text words;And cutting word is carried out to default order word text, to generate life Make word;
It is determined that with each text Word similarity highest order word, and the first corresponding relation is generated with this;Wherein, described In one corresponding relation, identified order word only corresponds to a text word;
The Word similarity of the text and the order word text is determined based on similarity described in each highest;
Determine the phonetic of the text and each word in the order word text;
It is determined that the phonetic with the word in order word described in the phonetic similarity degree highest of each word in the text, and given birth to this Into the second corresponding relation;Wherein, in second corresponding relation, the phonetic of each word is only corresponded to described in one in the text The phonetic of word in order word;
The pinyin similarity of the text and the order word text is determined based on each highest pinyin similarity;
The similarity of the text and the order word text is determined based on the Word similarity and the pinyin similarity.
2. the method as described in claim 1, it is characterised in that also include:
Obtain the text data that quantity exceedes certain value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the similarity between any two word in the text data is obtained And build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Collect the Pinyin information that pronunciation meets default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text data In all word phonetics the similar table of phonetic;
The cutting word is carried out based on the similarity between any two word;
" it is determined that with each text Word similarity highest order word " includes:
For each text word, the similarity of the text word and each order word is determined based on the similar vocabulary of near synonym;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
Described " it is determined that phonetic with the word in order word described in the phonetic similarity degree highest of each word in the text " wraps Include:
For the word of each text, determined and the phonetic of the word of the text and each order based on the similar table of the phonetic The pinyin similarity of word phonetic in word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity highest with the phonetic of the word in the text Order word in word phonetic.
3. the method as described in claim 1, it is characterised in that described " text to be determined based on similarity described in each highest Sheet and the Word similarity of the order word text " includes:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
4. the method as described in claim 1, it is characterised in that described " text to be determined based on each highest pinyin similarity Sheet and the pinyin similarity of the order word text " includes:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
5. the method as described in claim 1, it is characterised in that described " based on the Word similarity and the pinyin similarity Determine the similarity of the text and the order word text " include:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word text Similarity;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text and institute State the similarity of order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is the order The similarity of word text.
A kind of 6. equipment of speech recognition text and order word text matches, it is characterised in that applied to the scene of man-machine interaction, The equipment includes:
Acquisition module, for obtaining the text obtained by speech recognition;
Cutting word module, cutting word is carried out to the text, to generate multiple text words;And default order word text is cut Word, to generate order word;
First generation module, for determination and each text Word similarity highest order word, and the first correspondence is generated with this Relation;Wherein, in first corresponding relation, identified order word only corresponds to a text word;
Word similarity determining module, for determining the text and the order word text based on similarity described in each highest Word similarity;
Phonetic determining module, for determining the phonetic of the text and each word in the order word text;
Second generation module, in order word described in the phonetic similarity degree highest of each word in determination and the text The phonetic of word, and the second corresponding relation is generated with this;Wherein, in second corresponding relation, each word in the text Phonetic only corresponds to the phonetic of the word in an order word;
Pinyin similarity determining module, for determining the text and the order word text based on each highest pinyin similarity Pinyin similarity;
Processing module, for determining the text and the order word text based on the Word similarity and the pinyin similarity Similarity.
7. equipment as claimed in claim 6, it is characterised in that also include:Pretreatment module, exceed necessarily for obtaining quantity The text data of value;
First similarity of word and word in the text data is determined based on word2vec;
Second similarity of word and word in the text data is determined based on hownet;
Based on first similarity and second similarity, the similarity between any two word in the text data is obtained And build the similar vocabulary of near synonym;
Determine all phonetics of all words in the text data;
Collect the Pinyin information that pronunciation meets default condition of similarity;
Similarity between two phonetic is determined based on acquired phonetic and Pinyin information and builds the bag text data In all word phonetics the similar table of phonetic;
The cutting word is carried out based on the similarity between any two word;
First generation module " it is determined that with each text Word similarity highest order word " includes:
For each text word, the similarity of the text word and each order word is determined based on the similar vocabulary of near synonym;
Size sequence is carried out to the similarity, to determine and the text Word similarity highest order word;
Second generation module " it is determined that with order word described in the phonetic similarity degree highest of each word in the text The phonetic of word " includes:
For the word of each text, determined and the phonetic of the word of the text and each order based on the similar table of the phonetic The pinyin similarity of word phonetic in word;
Size sequence is carried out to the pinyin similarity, to determine the pinyin similarity highest with the phonetic of the word in the text Order word in word phonetic.
8. equipment as claimed in claim 6, it is characterised in that the Word similarity determining module, be used for:
Obtain the number of words of the word of similarity described in similarity described in each highest and each highest;
Word length band weight average is determined based on similarity described in the number of words, the number of words of the text, each highest;
The word length is arranged to the Word similarity of the text and the order word text with weight average.
9. equipment as claimed in claim 6, it is characterised in that the pinyin similarity determining module, be used for:
Determine the average value of each highest pinyin similarity;
The average value is defined as to the pinyin similarity of the text and the order word text.
10. equipment as claimed in claim 6, it is characterised in that the processing module, be used for:
Summation is weighted to the Word similarity and the pinyin similarity, to obtain the text and the order word text Similarity;Wherein, the weighted sum is completed by following equation;
Sim_x_y=0.6*sim_w_x_y+0.4*pinyin_sim_x_y;Wherein, the sim_x_y is the text and institute State the similarity of order word text;The sim_w_x_y is the Word similarity;The pinyin_sim_x_y is the order The similarity of word text.
CN201710849743.XA 2017-09-20 2017-09-20 Method and equipment for matching voice recognition text with command word text Active CN107564528B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710849743.XA CN107564528B (en) 2017-09-20 2017-09-20 Method and equipment for matching voice recognition text with command word text

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710849743.XA CN107564528B (en) 2017-09-20 2017-09-20 Method and equipment for matching voice recognition text with command word text

Publications (2)

Publication Number Publication Date
CN107564528A true CN107564528A (en) 2018-01-09
CN107564528B CN107564528B (en) 2020-12-15

Family

ID=60981694

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710849743.XA Active CN107564528B (en) 2017-09-20 2017-09-20 Method and equipment for matching voice recognition text with command word text

Country Status (1)

Country Link
CN (1) CN107564528B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109684643A (en) * 2018-12-26 2019-04-26 湖北亿咖通科技有限公司 Text recognition method, electronic equipment and computer-readable medium based on sentence vector
CN109710929A (en) * 2018-12-18 2019-05-03 金蝶软件(中国)有限公司 A kind of bearing calibration, device, computer equipment and the storage medium of speech recognition text
CN110503958A (en) * 2019-08-30 2019-11-26 厦门快商通科技股份有限公司 Audio recognition method, system, mobile terminal and storage medium
CN111524504A (en) * 2020-05-11 2020-08-11 中国商用飞机有限责任公司北京民用飞机技术研究中心 Airborne voice control method and device
CN112185371A (en) * 2019-07-05 2021-01-05 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and computer storage medium
CN112259100A (en) * 2020-09-15 2021-01-22 科大讯飞华南人工智能研究院(广州)有限公司 Speech recognition method, training method of related model, related equipment and device
CN116798417A (en) * 2023-07-31 2023-09-22 成都赛力斯科技有限公司 Voice intention recognition method, device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090240498A1 (en) * 2008-03-19 2009-09-24 Microsoft Corporation Similiarity measures for short segments of text
EP2362333A1 (en) * 2010-02-19 2011-08-31 Accenture Global Services Limited System for requirement identification and analysis based on capability model structure
CN102184169A (en) * 2011-04-20 2011-09-14 北京百度网讯科技有限公司 Method, device and equipment used for determining similarity information among character string information
CN106095865A (en) * 2016-06-03 2016-11-09 中细软移动互联科技有限公司 A kind of trade mark text similarity reviewing method
CN106598939A (en) * 2016-10-21 2017-04-26 北京三快在线科技有限公司 Method and device for text error correction, server and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090240498A1 (en) * 2008-03-19 2009-09-24 Microsoft Corporation Similiarity measures for short segments of text
EP2362333A1 (en) * 2010-02-19 2011-08-31 Accenture Global Services Limited System for requirement identification and analysis based on capability model structure
CN102184169A (en) * 2011-04-20 2011-09-14 北京百度网讯科技有限公司 Method, device and equipment used for determining similarity information among character string information
CN106095865A (en) * 2016-06-03 2016-11-09 中细软移动互联科技有限公司 A kind of trade mark text similarity reviewing method
CN106598939A (en) * 2016-10-21 2017-04-26 北京三快在线科技有限公司 Method and device for text error correction, server and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李红莲等: "一种文本相似度及其在语音识别中的应用", 《中文信息学报》 *
石安磊: "基于文本相似度评分的中医案例分析系统研究与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109710929A (en) * 2018-12-18 2019-05-03 金蝶软件(中国)有限公司 A kind of bearing calibration, device, computer equipment and the storage medium of speech recognition text
CN109684643A (en) * 2018-12-26 2019-04-26 湖北亿咖通科技有限公司 Text recognition method, electronic equipment and computer-readable medium based on sentence vector
CN109684643B (en) * 2018-12-26 2021-03-12 湖北亿咖通科技有限公司 Sentence vector-based text recognition method, electronic device and computer-readable medium
CN112185371A (en) * 2019-07-05 2021-01-05 百度在线网络技术(北京)有限公司 Voice interaction method, device, equipment and computer storage medium
CN110503958A (en) * 2019-08-30 2019-11-26 厦门快商通科技股份有限公司 Audio recognition method, system, mobile terminal and storage medium
CN111524504A (en) * 2020-05-11 2020-08-11 中国商用飞机有限责任公司北京民用飞机技术研究中心 Airborne voice control method and device
CN112259100A (en) * 2020-09-15 2021-01-22 科大讯飞华南人工智能研究院(广州)有限公司 Speech recognition method, training method of related model, related equipment and device
CN112259100B (en) * 2020-09-15 2024-04-09 科大讯飞华南人工智能研究院(广州)有限公司 Speech recognition method, training method of related model, related equipment and device
CN116798417A (en) * 2023-07-31 2023-09-22 成都赛力斯科技有限公司 Voice intention recognition method, device, electronic equipment and storage medium
CN116798417B (en) * 2023-07-31 2023-11-10 成都赛力斯科技有限公司 Voice intention recognition method, device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN107564528B (en) 2020-12-15

Similar Documents

Publication Publication Date Title
CN107564528A (en) A kind of speech recognition text and the method and apparatus of order word text matches
US10665226B2 (en) System and method for data-driven socially customized models for language generation
CN109146610B (en) Intelligent insurance recommendation method and device and intelligent insurance robot equipment
EP3529717B1 (en) Multi domain real-time question answering system
KR102668530B1 (en) Speech recognition methods, devices and devices, and storage media
CN111241237B (en) Intelligent question-answer data processing method and device based on operation and maintenance service
CN111310440B (en) Text error correction method, device and system
CN109637537B (en) Method for automatically acquiring annotated data to optimize user-defined awakening model
KR101581816B1 (en) Voice recognition method using machine learning
WO2017127296A1 (en) Analyzing textual data
CN110276071B (en) Text matching method and device, computer equipment and storage medium
CN108304445A (en) A kind of text snippet generation method and device
TWI734085B (en) Dialogue system using intention detection ensemble learning and method thereof
CN110717021B (en) Input text acquisition and related device in artificial intelligence interview
CN112364132A (en) Similarity calculation model and system based on dependency syntax and method for building system
CN113761887A (en) Matching method and device based on text processing, computer equipment and storage medium
Saunders et al. Towards using prosody to scaffold lexical meaning in robots
CN116955559A (en) Question-answer matching method and device, electronic equipment and storage medium
Asadi et al. Real-Time Presentation Tracking Using Semantic Keyword Spotting.
CN112084766A (en) Text processing method and device, storage medium and processor
CN111966808B (en) Dialogue question-answering method, device, equipment and storage medium combining RPA and AI
CN115062127A (en) Self-learning intelligent interaction system and method based on natural language processing
Sicilia et al. ISABEL: An Inclusive and Collaborative Task-Oriented Dialogue System
CN113724738A (en) Voice processing method, decision tree model training method, device, equipment and storage medium
Lin et al. Domain Independent Key Term Extraction from Spoken Content Based on Context and Term Location Information in the Utterances

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20180126

Address after: Nanshan District Guangdong streets in Shenzhen city of Guangdong province 518000 Xuefu Road No. 2388 Yi financial science and technology building 15 building 05 unit

Applicant after: Guangdong Hui He science and Technology Development Co., Ltd.

Address before: 518000 Guangdong city of Shenzhen province Nanshan District Guangdong streets mangocity.com building 601 units

Applicant before: Shenzhen city artificial intelligence technology Co., secluded orchid in a deserted Valley

GR01 Patent grant
GR01 Patent grant