Nothing Special   »   [go: up one dir, main page]

CN111883134A - Voice input method and device, electronic equipment and storage medium - Google Patents

Voice input method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN111883134A
CN111883134A CN202010723238.2A CN202010723238A CN111883134A CN 111883134 A CN111883134 A CN 111883134A CN 202010723238 A CN202010723238 A CN 202010723238A CN 111883134 A CN111883134 A CN 111883134A
Authority
CN
China
Prior art keywords
voice
character string
information
recognized
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010723238.2A
Other languages
Chinese (zh)
Other versions
CN111883134B (en
Inventor
郭毓伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Fotoable Technology Ltd
Original Assignee
Beijing Fotoable Technology Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Fotoable Technology Ltd filed Critical Beijing Fotoable Technology Ltd
Priority to CN202010723238.2A priority Critical patent/CN111883134B/en
Publication of CN111883134A publication Critical patent/CN111883134A/en
Application granted granted Critical
Publication of CN111883134B publication Critical patent/CN111883134B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application provides a voice input method, a voice input device, electronic equipment and a storage medium, which are applied to voice input of a voice input system, and the method comprises the following steps: acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information; inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network; matching the character string text with character strings in a preset database; and if the matching is successful, inputting the character string text into a voice input system to realize voice input. According to the game platform, voice recognition is carried out in the game of the hand game, answer recognition is carried out on voice input, the manual input of answers of a player is replaced, the game level is completed, the game player is helped to play, the user experience of the game player is improved, and the higher user requirements are met.

Description

Voice input method and device, electronic equipment and storage medium
Technical Field
The present application relates to the field of speech recognition technologies, and in particular, to a speech input method and apparatus, an electronic device, and a storage medium.
Background
With the progress of data processing technology and the rapid popularization of mobile internet, computer technology is widely applied to various fields of society, and speech recognition technology will enter various fields such as industry, household appliances, communication, automotive electronics, medical treatment, home services, consumer electronics, and the like in the future, for example: in the current popular hand game market, such as the word game played by the public, when a game player makes a game break-through, the game player needs to input a corresponding answer in each game level to complete the game level.
However, in the manner of manually inputting answers by a game player, the game level can be completed only by inputting corresponding answers with both hands of the player, and the game level cannot be completed under the condition that both hands of the game player are occupied or some words cannot be spelled, so that the user experience of the game player is reduced, and higher user requirements cannot be met.
Disclosure of Invention
The application provides a voice input method, a voice input device, electronic equipment and a storage medium, which are used for improving the user experience of game players and meeting higher user requirements.
A voice input method is applied to voice input of a voice input system, and comprises the following steps:
acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information;
inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network;
matching the character string text with character strings in a preset database;
and if the matching is successful, inputting the character string text into the voice input system to realize voice input.
Further, the method also comprises the following steps:
and if the matching is unsuccessful, sending message reminding information corresponding to the unsuccessful matching to the voice input system, and deleting the character string text.
Further, after inputting the character string text into the voice input system, the method further comprises:
and sending message reminding information corresponding to successful matching to the voice input system, and deleting the character string text.
Further, before the obtaining the speech to be recognized, the method further includes: selecting position information corresponding to a text of a character string to be input;
inputting the character string text into the voice input system, specifically:
and inputting the character string text to a position corresponding to the position information corresponding to the character string text to be input.
Further, the process of constructing the preset speech recognition model specifically includes:
acquiring voice information sample data, wherein the voice information sample data comprises voice information of various character string information;
and training a voice recognition model by adopting a convolutional neural network algorithm and a long-short term memory network algorithm based on the voice information sample data to obtain the preset voice recognition model.
A voice input device applied to voice input of a voice input system comprises:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring voice information to be recognized, and the voice information to be recognized comprises at least one character string information;
the recognition unit is used for inputting the voice information to be recognized into a preset voice recognition model for voice recognition so as to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network;
the matching unit is used for matching the character string text with character strings in a preset database;
and the input unit is used for inputting the character string text into the voice input system to realize voice input if the matching is successful.
Further, the method also comprises the following steps:
and the first sending unit is used for sending message reminding information corresponding to unsuccessful matching to the voice input system and deleting the character string text if the matching is unsuccessful.
Further, the method also comprises the following steps:
and the second sending unit is used for sending the message reminding information corresponding to the successful matching to the voice input system and deleting the character string text.
An electronic device, comprising:
a processor; and
a memory having executable code stored thereon, which when executed by the processor, causes the processor to perform the speech input method as described above.
A non-transitory machine-readable storage medium having stored thereon executable code, which when executed by a processor of an electronic device, causes the processor to perform a speech input method as described above.
Compared with the prior art, the voice input method, the voice input device, the electronic equipment and the storage medium are applied to voice input of a voice input system, and the method comprises the following steps: acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information; inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network; matching the character string text with character strings in a preset database; and if the matching is successful, inputting the character string text into the voice input system to realize voice input. According to the game platform, voice recognition is carried out in the game of the hand game, answer recognition is carried out on voice input, the manual input of answers of a player is replaced, the game level is completed, the game player is helped to play, the user experience of the game player is improved, and the higher user requirements are met.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present application, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic structural diagram of a speech input system according to an embodiment of the present application;
fig. 2 is a schematic flow chart of a voice input method according to an embodiment of the present application;
fig. 3 to 8 are schematic display diagrams of a game interface provided in an embodiment of the present application in various states;
fig. 9 is a schematic structural diagram of a voice input device according to an embodiment of the present application;
fig. 10 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
Detailed Description
The present application provides a voice input method, apparatus, electronic device and storage medium, which are applied to the voice input system shown in fig. 1, such as word games played by the public, including english and chinese, such as words, idioms or postlanguage word-filling, and the like), and the voice input system includes: the game system comprises a voice input device 10, a voice recognition device 20 and a voice output device 30, wherein the voice input device 10 receives voice information input by a user, the voice recognition device 20 performs voice recognition on the voice information input by the user to obtain a voice recognition result, and the voice recognition result is sent to a character game page through the voice output device 30, so that a game level is completed.
The invention of the present application aims to: how to improve the user experience of game players and meet higher user requirements.
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
Referring to fig. 2, a schematic flow chart of a voice input method according to an embodiment of the present application is shown. As shown in fig. 2, a voice input method provided in the embodiment of the present application is applied to voice input of a voice input system, and specifically includes the following steps:
s200: and selecting the position information corresponding to the text of the character string to be input.
In practical applications, such as word-filling character games in a hand game, a game player first needs to select position information corresponding to a text of a character string to be input, where the position information mainly identifies a position of the character string to be input on a game interface, as shown in fig. 3.
S201: and acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information.
In practical applications, such as word-filling character games in a hand game, a game player enters a voice input mode through a mode conversion key arranged on a game interface, as shown in fig. 3, the mode conversion key is arranged at a lower left corner. After the game player clicks the mode conversion key, the game interface is as shown in fig. 4, which includes a button for returning to the keyboard interface, a microphone button, and its prompt text, and a countdown display (60 s). In the embodiment of the present application, when the game player presses the microphone button in fig. 4, a voice input prompt sound is set, and an animation of the volume of the microphone is set. Further, when the game player releases the microphone button, a release warning tone may be provided.
It should be noted that the speech information to be recognized includes at least one character string information, that is, the speech information may include a plurality of character strings, that is, the game player may input the speech information including the character strings according to the word vocabulary of the game player.
S202: and inputting the voice information to be recognized into a preset voice recognition model for voice recognition so as to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network.
It should be noted that speech recognition technology, also known as Automatic Speech Recognition (ASR), aims to convert the vocabulary content in human speech into computer-readable input, such as keystrokes, binary codes or character sequences. In the embodiment of the application, the speech recognition technology belongs to technologies known to those skilled in the art, and therefore, details of the speech recognition are not described again, and specific contents may refer to related technologies.
In an embodiment of the present application, the process of constructing the preset speech recognition model specifically includes the following steps:
and acquiring voice information sample data, wherein the voice information sample data comprises voice information of various character string information.
And training the voice recognition model by adopting a convolutional neural network algorithm and a long-short term memory network algorithm based on voice information sample data to obtain a preset voice recognition model.
In the embodiment of the application, because the recognition efficiency of the voice recognition technology is continuously improved, a game player can use the voice recognition technology to complete the game stage by inputting voice, the voice recognition system can recognize the voice input of the player to help the player to play the game, the voice recognition technology is mainly applied to a hand game, and the game stage is completed by inputting voice.
S203: and matching the character string text with the character strings in a preset database.
It should be noted that a character string corresponding to a correct answer in a word game is pre-stored in a preset database, when voice recognition is performed on voice information to be recognized input by a game player, a corresponding character string text is obtained, the character string text is matched with the character string in the preset database, and whether the character string obtained through recognition is the correct answer is determined.
S204: judging whether the matching is successful, if so, executing the step S205; if the matching is not successful, step S207 is executed.
S205: and inputting the character string text to a position corresponding to the position information corresponding to the character string text to be input so as to realize voice input.
S206: and sending message reminding information corresponding to successful matching to the voice input system, and deleting the character string text.
S207: and sending the message reminding information corresponding to unsuccessful matching to the voice input system, and deleting the character string text.
In the embodiment of the application, the recognized character strings are matched with the character strings in the preset database one by one, and when the character strings which are successfully matched exist, the character string texts are input to the positions corresponding to the position information corresponding to the character string texts to be input, so as to realize voice input, as shown in fig. 5, the process from the step that a game player presses a microphone button, and the voice recognition is carried out to the step that words are successfully input is realized.
It should be noted that, after a game player inputs a word by voice, if matching is successful, the character string text is input to a position corresponding to position information corresponding to the character string text to be input, so as to implement voice input, as shown in fig. 5, after input is successful, a message reminding message corresponding to successful matching is sent to the voice input system, that is, when the game player submits a correct answer, a prompt message of "inputting a correct answer animation" is sent to the voice input system, and the recognized character string text is deleted. If the matching is unsuccessful, sending a message reminding message corresponding to the unsuccessful matching to the voice input system, namely sending a prompt message of 'inputting incorrect answer animation' to the voice input system when the answer submitted by the game player is incorrect, and deleting the character string text.
Furthermore, it should be noted that when the game player answers correctly, a prompt tone may be provided, and after the game player answers correctly, the button is released and an animation with correct answer is provided.
In order to further explain the application scenario of the embodiment of the application, the application scenario applied by the method can set a Party mode, and in the Party mode, the topic can be automatically switched and the voice is started; when the game player answers correctly, playing voice prompt CORRECT; playing a countdown prompt tone when the voice overtime occurs, or directly starting the next round of voice input; in addition, when the party speech recognition is clicked or the next question is started automatically, the speech question is given to the user X letters, … … (query).
It should be noted that, in the embodiment of the present application, the game interface in the wireless state is displayed as shown in fig. 6, and at any time, when a network link failure is found, the original microphone interface is replaced with the wireless prompt interface shown in fig. 6.
Regarding the game version problem of the game player, that is, the Beta version, in the Beta version, the voice recognition cannot be continued due to the limitation of the number of uses, and the game interface of the game player is displayed as shown in fig. 7.
It should be further noted that, when the game player plays the game, there is a flow of guiding by the novice, for the novice, the flow of starting the guiding by the new user at the 2 nd switch, and for the old user, as long as the 2 nd switch is passed, the flow of starting the guiding after the 6 th switch. Before starting the user guide, the game player can not see the entrance of the voice mode, and one user has the new hand guide for life, and the guide number of the game player can be configured locally according to the requirement of the user. As shown in fig. 8, when the player clicks the speech recognition for the first time, 2 permission applications pop up, and when any one permission application fails, a picture C appears and a diagram is needed to be attached to show the opening permission flow. And if the player does not open the authority, clicking any position and ending the guidance.
In the embodiment of the present application, various data of voice recognition need to be dotted in time, which mainly includes: BQ dotting and BI dotting, wherein the BQ dotting is a newly-built voice input dotting table, and the dotting time is each successful detection and is shown in the following table; and adding 1 column into the BI dotting LevelPass table and the FlashCrazePass table, and comparing the number of the topics by using voice input.
BQ dotting meter
Figure BDA0002600769240000081
The voice input method provided by the embodiment of the application is applied to voice input of a voice input system, and comprises the following steps: acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information; inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network; matching the character string text with character strings in a preset database; and if the matching is successful, inputting the character string text into the voice input system to realize voice input. According to the embodiment of the application, the voice recognition is carried out in the hand game, the answer recognition is carried out on the voice input, the manual input of the answer of a player is replaced, the game stage is completed, the game player is helped to play, the user experience of the game player is improved, and the higher user requirement is met.
Referring to fig. 9, based on the voice input method disclosed in the foregoing embodiment, the present embodiment correspondingly discloses a voice input device, which is applied to voice input of a voice input system, and the device specifically includes:
an obtaining unit 901, configured to obtain to-be-recognized voice information, where the to-be-recognized voice information includes at least one character string information.
And the recognition unit 902 is configured to input the speech information to be recognized into a preset speech recognition model for speech recognition, so as to obtain a character string text corresponding to the speech information to be recognized, where an algorithm for constructing the preset speech recognition model includes a convolutional neural network and a long-short term memory network.
And the matching unit 903 is configured to match the character string text with a character string in a preset database.
And an input unit 904, if the matching is successful, configured to input the character string text into the voice input system, so as to implement voice input.
And the first sending unit 905 is configured to send message reminding information corresponding to unsuccessful matching to the voice input system and delete the character string text if the matching is unsuccessful.
A second sending unit 906, configured to send message reminding information corresponding to successful matching to the voice input system, and delete the character string text.
The device comprises a processor and a memory, wherein the acquisition unit, the identification unit, the matching unit, the first transmission unit, the second transmission unit and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.
The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more than one, voice recognition is carried out in the hand game, answer recognition is carried out on voice input, and the manual input of answers by a player is replaced, so that a game stage is completed, the game player is helped to play, the user experience of the game player is improved, and higher user requirements are met.
An embodiment of the present invention provides a storage medium on which a program is stored, the program implementing the voice input method when executed by a processor.
The embodiment of the invention provides a processor, which is used for running a program, wherein the voice input method is executed when the program runs.
An embodiment of the present invention provides an electronic device, as shown in fig. 10, the electronic device 100 includes at least one processor 1001, and at least one memory 1002 and a bus 1003 connected to the processor; the processor 1001 and the memory 1002 complete communication with each other through the bus 1003; the processor 1001 is used for calling the program instructions in the memory 1002 to execute the voice input method described above.
The electronic device herein may be a server, a PC, a PAD, a mobile phone, etc.
The present application further provides a computer program product adapted to perform a program for initializing the following method steps when executed on a data processing device:
acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information;
inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network;
matching the character string text with character strings in a preset database;
and if the matching is successful, inputting the character string text into the voice input system to realize voice input.
Preferably, the method further comprises the following steps:
and if the matching is unsuccessful, sending message reminding information corresponding to the unsuccessful matching to the voice input system, and deleting the character string text.
Preferably, after the character string text is input to the voice input system, the method further includes:
and sending message reminding information corresponding to successful matching to the voice input system, and deleting the character string text.
Preferably, before the acquiring the speech to be recognized, the method further includes: selecting position information corresponding to a text of a character string to be input;
inputting the character string text into the voice input system, specifically:
and inputting the character string text to a position corresponding to the position information corresponding to the character string text to be input.
Preferably, the process of constructing the preset speech recognition model specifically includes:
acquiring voice information sample data, wherein the voice information sample data comprises voice information of various character string information;
and training a voice recognition model by adopting a convolutional neural network algorithm and a long-short term memory network algorithm based on the voice information sample data to obtain the preset voice recognition model.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a device includes one or more processors (CPUs), memory, and a bus. The device may also include input/output interfaces, network interfaces, and the like.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip. The memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

Claims (10)

1. A speech input method applied to speech input of a speech input system, the method comprising:
acquiring voice information to be recognized, wherein the voice information to be recognized comprises at least one character string information;
inputting the voice information to be recognized into a preset voice recognition model for voice recognition to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network;
matching the character string text with character strings in a preset database;
and if the matching is successful, inputting the character string text into the voice input system to realize voice input.
2. The method of claim 1, further comprising:
and if the matching is unsuccessful, sending message reminding information corresponding to the unsuccessful matching to the voice input system, and deleting the character string text.
3. The method of claim 1, further comprising, after entering the string of characters into the speech input system:
and sending message reminding information corresponding to successful matching to the voice input system, and deleting the character string text.
4. The method according to any one of claims 1-3, wherein before the obtaining the speech to be recognized, further comprising: selecting position information corresponding to a text of a character string to be input;
inputting the character string text into the voice input system, specifically:
and inputting the character string text to a position corresponding to the position information corresponding to the character string text to be input.
5. The method according to claim 1, wherein the process of constructing the preset speech recognition model specifically comprises:
acquiring voice information sample data, wherein the voice information sample data comprises voice information of various character string information;
and training a voice recognition model by adopting a convolutional neural network algorithm and a long-short term memory network algorithm based on the voice information sample data to obtain the preset voice recognition model.
6. A speech input device for speech input in a speech input system, the device comprising:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring voice information to be recognized, and the voice information to be recognized comprises at least one character string information;
the recognition unit is used for inputting the voice information to be recognized into a preset voice recognition model for voice recognition so as to obtain a character string text corresponding to the voice information to be recognized, wherein an algorithm for constructing the preset voice recognition model comprises a convolutional neural network and a long-short term memory network;
the matching unit is used for matching the character string text with character strings in a preset database;
and the input unit is used for inputting the character string text into the voice input system to realize voice input if the matching is successful.
7. The apparatus of claim 6, further comprising:
and the first sending unit is used for sending message reminding information corresponding to unsuccessful matching to the voice input system and deleting the character string text if the matching is unsuccessful.
8. The apparatus of claim 6, further comprising:
and the second sending unit is used for sending the message reminding information corresponding to the successful matching to the voice input system and deleting the character string text.
9. An electronic device, comprising:
a processor; and
a memory having executable code stored thereon, which when executed by the processor, causes the processor to perform the speech input method of any of claims 1-5.
10. A non-transitory machine-readable storage medium having stored thereon executable code, which when executed by a processor of an electronic device, causes the processor to perform the speech input method of any of claims 1-5.
CN202010723238.2A 2020-07-24 2020-07-24 Voice input method and device, electronic equipment and storage medium Active CN111883134B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010723238.2A CN111883134B (en) 2020-07-24 2020-07-24 Voice input method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010723238.2A CN111883134B (en) 2020-07-24 2020-07-24 Voice input method and device, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN111883134A true CN111883134A (en) 2020-11-03
CN111883134B CN111883134B (en) 2024-06-04

Family

ID=73200500

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010723238.2A Active CN111883134B (en) 2020-07-24 2020-07-24 Voice input method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111883134B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114461170A (en) * 2022-01-27 2022-05-10 山东省城市商业银行合作联盟有限公司 Page reading method and system for mobile banking application program
CN114511366A (en) * 2020-11-16 2022-05-17 深圳Tcl新技术有限公司 Purchasing link obtaining method, terminal and computer storage medium

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002159741A (en) * 2000-11-24 2002-06-04 Namco Ltd Game device and information storage medium
US20020099539A1 (en) * 2000-12-28 2002-07-25 Manabu Nishizawa Method for outputting voice of object and device used therefor
KR20130086971A (en) * 2012-01-26 2013-08-05 진삼순 Question answering system using speech recognition and its application method thereof
CN103546790A (en) * 2013-09-18 2014-01-29 深圳市掌世界网络科技有限公司 Language interaction method and language interaction system on basis of mobile terminal and interactive television
CN103699530A (en) * 2012-09-27 2014-04-02 百度在线网络技术(北京)有限公司 Method and equipment for inputting texts in target application according to voice input information
CN106297784A (en) * 2016-08-05 2017-01-04 Intelligent terminal plays the method and system of quick voice responsive identification
CN107115668A (en) * 2017-04-25 2017-09-01 合肥泽诺信息科技有限公司 Online game online interaction system based on speech recognition
CN107785021A (en) * 2017-08-02 2018-03-09 上海壹账通金融科技有限公司 Pronunciation inputting method, device, computer equipment and medium
CN107952239A (en) * 2017-12-19 2018-04-24 佛山市幻云科技有限公司 A kind of numbers game system and method
CN109331470A (en) * 2018-08-21 2019-02-15 平安科技(深圳)有限公司 Quiz game processing method, device, equipment and medium based on speech recognition
JP2019063620A (en) * 2019-01-31 2019-04-25 パイオニア株式会社 Communication terminal, communication method, communication program, recording medium with communication program stored and information processing device
CN111001156A (en) * 2019-11-27 2020-04-14 南京创维信息技术研究院有限公司 Voice processing method and device applied to guessing idiom game
CN111429913A (en) * 2020-03-26 2020-07-17 厦门快商通科技股份有限公司 Digit string voice recognition method, identity verification device and computer readable storage medium

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002159741A (en) * 2000-11-24 2002-06-04 Namco Ltd Game device and information storage medium
US20020099539A1 (en) * 2000-12-28 2002-07-25 Manabu Nishizawa Method for outputting voice of object and device used therefor
KR20130086971A (en) * 2012-01-26 2013-08-05 진삼순 Question answering system using speech recognition and its application method thereof
CN103699530A (en) * 2012-09-27 2014-04-02 百度在线网络技术(北京)有限公司 Method and equipment for inputting texts in target application according to voice input information
CN103546790A (en) * 2013-09-18 2014-01-29 深圳市掌世界网络科技有限公司 Language interaction method and language interaction system on basis of mobile terminal and interactive television
CN106297784A (en) * 2016-08-05 2017-01-04 Intelligent terminal plays the method and system of quick voice responsive identification
CN107115668A (en) * 2017-04-25 2017-09-01 合肥泽诺信息科技有限公司 Online game online interaction system based on speech recognition
CN107785021A (en) * 2017-08-02 2018-03-09 上海壹账通金融科技有限公司 Pronunciation inputting method, device, computer equipment and medium
CN107952239A (en) * 2017-12-19 2018-04-24 佛山市幻云科技有限公司 A kind of numbers game system and method
CN109331470A (en) * 2018-08-21 2019-02-15 平安科技(深圳)有限公司 Quiz game processing method, device, equipment and medium based on speech recognition
JP2019063620A (en) * 2019-01-31 2019-04-25 パイオニア株式会社 Communication terminal, communication method, communication program, recording medium with communication program stored and information processing device
CN111001156A (en) * 2019-11-27 2020-04-14 南京创维信息技术研究院有限公司 Voice processing method and device applied to guessing idiom game
CN111429913A (en) * 2020-03-26 2020-07-17 厦门快商通科技股份有限公司 Digit string voice recognition method, identity verification device and computer readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114511366A (en) * 2020-11-16 2022-05-17 深圳Tcl新技术有限公司 Purchasing link obtaining method, terminal and computer storage medium
CN114461170A (en) * 2022-01-27 2022-05-10 山东省城市商业银行合作联盟有限公司 Page reading method and system for mobile banking application program

Also Published As

Publication number Publication date
CN111883134B (en) 2024-06-04

Similar Documents

Publication Publication Date Title
US10832686B2 (en) Method and apparatus for pushing information
CN109428719B (en) Identity verification method, device and equipment
CN108962233B (en) Voice conversation processing method and system for voice conversation platform
JP6394709B2 (en) SPEAKER IDENTIFYING DEVICE AND FEATURE REGISTRATION METHOD FOR REGISTERED SPEECH
US10777207B2 (en) Method and apparatus for verifying information
CN109979450B (en) Information processing method and device and electronic equipment
US5752231A (en) Method and system for performing speaker verification on a spoken utterance
CN106796787A (en) The linguistic context carried out using preceding dialog behavior in natural language processing is explained
CN110784768B (en) Multimedia resource playing method, storage medium and electronic equipment
CN109460503B (en) Answer input method, answer input device, storage medium and electronic equipment
CN111883134B (en) Voice input method and device, electronic equipment and storage medium
CN109062944A (en) New word consolidation method based on voice search and electronic equipment
CN113128228A (en) Voice instruction recognition method and device, electronic equipment and storage medium
CN108960875A (en) A kind of processing method of access information
CN110047473B (en) Man-machine cooperative interaction method and system
CN111414764A (en) Method and system for determining skill field of dialog text
CN115174748A (en) Voice call-out method, device, equipment and medium based on semantic recognition
CN105488370A (en) Children mode unlock method and device
CN113204626A (en) Interaction method, device, terminal and storage medium based on knowledge graph
CN109800550B (en) Dialogue exchange type account identity authentication method and device
CN109273004B (en) Predictive speech recognition method and device based on big data
CN113656566B (en) Intelligent dialogue processing method, intelligent dialogue processing device, computer equipment and storage medium
CN115114404A (en) Question and answer method and device for intelligent customer service, electronic equipment and computer storage medium
CN111785280B (en) Identity authentication method and device, storage medium and electronic equipment
CN114141250A (en) Lyric recognition method and device, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100192 rooms c402 and 403, 4 / F, building C, building B-6, Dongsheng Science Park, Zhongguancun, No. 66, xixiaokou Road, Haidian District, Beijing

Applicant after: Beijing beta Technology Co.,Ltd.

Address before: 100192 rooms c402 and 403, 4 / F, building C, building B-6, Dongsheng Science Park, Zhongguancun, No. 66, xixiaokou Road, Haidian District, Beijing

Applicant before: BEIJING FOTOABLE TECHNOLOGY LTD.

GR01 Patent grant
GR01 Patent grant