Nothing Special   »   [go: up one dir, main page]

CN102404278A - Song requesting system based on voiceprint recognition and application method thereof - Google Patents

Song requesting system based on voiceprint recognition and application method thereof Download PDF

Info

Publication number
CN102404278A
CN102404278A CN2010102734656A CN201010273465A CN102404278A CN 102404278 A CN102404278 A CN 102404278A CN 2010102734656 A CN2010102734656 A CN 2010102734656A CN 201010273465 A CN201010273465 A CN 201010273465A CN 102404278 A CN102404278 A CN 102404278A
Authority
CN
China
Prior art keywords
user
song
sound
voice signal
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102734656A
Other languages
Chinese (zh)
Inventor
袁斯华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengle Information Technology Shanghai Co ltd
Original Assignee
Shengle Information Technology Shanghai Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shengle Information Technology Shanghai Co ltd filed Critical Shengle Information Technology Shanghai Co ltd
Priority to CN2010102734656A priority Critical patent/CN102404278A/en
Publication of CN102404278A publication Critical patent/CN102404278A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a song requesting system based on voiceprint recognition, which comprises: the system comprises a song database, a recording module, a voice signal processing module, a voiceprint feature extraction module and a voiceprint feature comparison module. The invention also discloses an application method of the song requesting system, which comprises the following steps: recording the voice of a user; preprocessing the recorded voice signal; extracting voiceprint characteristics of a user contained in the voice signal and establishing a user voiceprint model; and performing similarity matching on the voiceprint model of the user and the voiceprint models of the songs in the database, and returning a matched song list. The song requesting system and the application method thereof can improve the experience of singing of the user. When the user requests songs, the song requesting system can automatically retrieve songs suitable for the user to sing according to the voiceprint characteristics of the user and provide the songs for the user to select, so that the singing effect of the user is guaranteed.

Description

A kind of order programme and application process thereof based on Application on Voiceprint Recognition
Technical field
The present invention relates to a kind of order programme, especially a kind of order programme based on the Application on Voiceprint Recognition technology.The invention still further relates to a kind of application process of this order programme.
Background technology
Along with people's is to the pursuit of quality of the life, and people's free life becomes more and more abundanter, and singing is very popular a kind of entertainment way; People usually can stay at home or the singing-hall in program request oneself like or present popular song; But because everyone is in the difference of aspects such as vocal cords structure, tune, some song not convenience point singer is sung; And present order programme can only provide common list of songs; Can not select to be fit to the song that this user sings according to user's sound characteristic, if the song of user's program request is not suitable for oneself, the effect of performance will be poor; Not only do not reach the purpose of amusement, oneself and audience are felt disappointed.
Summary of the invention
The technical problem that the present invention will solve provides a kind of order programme based on Application on Voiceprint Recognition, and it can improve the experience that the user sings.
For solving the problems of the technologies described above, the order programme based on Application on Voiceprint Recognition of the present invention comprises:
Song database stores song and corresponding song sound-groove model thereof, and this song sound-groove model is set up through the vocal print characteristic of extracting song;
Recording module is used to record user's voice;
Voice signal processing module is connected with recording module, and the voice signal that is used for recording module is recorded to carries out preliminary treatment;
The vocal print characteristic extracting module is connected with voice signal processing module, is used for the analyzing speech signal, and from voice signal, extracts the vocal print characteristic, sets up sound-groove model;
Vocal print characteristic comparing module; Be connected with the vocal print characteristic extracting module; The user's sound-groove model that is used for that the vocal print characteristic extracting module is set up carries out similitude with song database song sound-groove model and matees, and returns the tabulation of the song that sound-groove model can be complementary with user's sound-groove model.
Another technical problem that the present invention will solve provides a kind of application process of above-mentioned order programme.
For solving the problems of the technologies described above, the application process of order programme of the present invention comprises the following steps:
(1) the prompting user sends one section voice;
(2) user's voice is recorded;
(3) voice signal of recording is carried out preliminary treatment;
(4) voice signal that obtains in the analytical procedure (3) extracts the user's who comprises in this voice signal vocal print characteristic, and sets up user's sound-groove model;
(5) sound-groove model with song in the user's who obtains in the step (4) sound-groove model and the song database carries out the similitude coupling; Calculate both similarities; And according to the distribution of similarity numerical value; Choose suitable numerical value as threshold value, the quantity of the song that the sound-groove model that retrieves with control can be complementary with user's sound-groove model;
The tabulation of the song that (6) step (5) is retrieved offers user's program request.
Order programme of the present invention and application process thereof can retrieve the song that is fit to this user's performance according to user's vocal print characteristic, offer user's program request, thereby can improve user's singing effect, make the user obtain good singing and experience.
Description of drawings
Below in conjunction with accompanying drawing and embodiment the present invention is done further detailed explanation:
Fig. 1 is the module diagram of order programme of the present invention;
Fig. 2 is the application process flow chart of order programme of the present invention.
Embodiment
Understand for technology contents of the present invention, characteristics and effect being had more specifically, combine illustrated execution mode at present, details are as follows:
Vocal print is the sound wave spectrum that carries verbal information that the electricity consumption acoustic instrument shows.Because it is complex physical physical process between human body speech center and the vocal organs that people's language produces; Organ---tongue, tooth, larynx, lung, nasal cavity that health uses when speech; Everyone is widely different aspect size and form; Therefore, any two people's vocal print collection of illustrative plates can be not identical, thereby utilize the characteristic of vocal print collection of illustrative plates can discern the pairing speaker of this vocal print automatically.
At present, the Application on Voiceprint Recognition technology has been a very mature technique, is widely used in fields such as criminal investigation.And along with the technological development of Application on Voiceprint Recognition, the accuracy of Application on Voiceprint Recognition is also increasingly high, and the present invention is exactly by means of the Application on Voiceprint Recognition technology, makes order programme intelligent more, can retrieve to be fit to the song that the user sings, confession user program request.
Order programme based on Application on Voiceprint Recognition of the present invention; At first to collect song; Analyze the audio signal of every first song; Utilize the Application on Voiceprint Recognition technology from audio signal, to extract the vocal print characteristic (actual is this singing songs person's vocal print characteristic) of this song, set up the sound-groove model of song, then with song one by one corresponding stored in a song database.In addition, as shown in Figure 1, order programme of the present invention also includes following modules:
Recording module 1 is used to record user's voice, and this recording module 1 can be at least one microphone;
Voice signal processing module 2 is connected with recording module 1, and the voice signal that is used for recording module 1 is recorded to carries out preliminary treatment, for example quantizes, processing such as preemphasis, windowing, filtering;
Vocal print characteristic extracting module 3 is connected with voice signal processing module 2, is used for the analyzing speech signal, and from voice signal, extracts the vocal print characteristic, sets up sound-groove model;
Vocal print characteristic comparing module 4; Be connected with vocal print characteristic extracting module 3; The user's sound-groove model that is used for vocal print characteristic extracting module 3 is set up carries out similitude with the song sound-groove model of song database and matees, and returns the tabulation of the song that sound-groove model can be complementary with user's sound-groove model.
When the user gets into above-mentioned order programme, when preparing to request a song, this order programme is realized it selects song to the user function according to the following step:
(1) the prompting user sends one section voice, and these voice can be one section word or one section song;
(2) when the user sends voice, recording module 1 is recorded user's voice get off;
(3) voice signal of recording in 2 pairs of steps of voice signal processing module (2) carries out preliminary treatment, for example, carries out filtering, removes the noise in this voice signal, or quantize, processing such as preemphasis, windowing;
(4) 3 pairs of voice signals of handling through step (3) of vocal print characteristic extracting module are analyzed, and utilize the Application on Voiceprint Recognition technology to extract the user's who is comprised in this voice signal vocal print characteristic, and set up user's sound-groove model;
(5) vocal print characteristic comparing module 4 is carried out the similitude coupling with user's sound-groove model that obtains in the step (4) and the song sound-groove model in the song database; Promptly calculate both similarities; Then according to the distribution of similarity numerical value; Choose suitable similarity numerical value as threshold value (the adjustment doors limit value can be controlled the number of songs that retrieves); Retrieve the song sound-groove model that is complementary with user's sound-groove model in the song database, and return the tabulation of the pairing song of these song sound-groove models;
(6) list of songs that searches is offered user's program request.
Similitude coupling in the above-mentioned steps (5) can adopt vector quantization model, stochastic model or neural network model.Hidden Markov model (Hidden Markov Model; HMM) be a kind of stochastic model based on transition probability and transmission probability; It regards sound as the random process of being made up of observable symbol sebolic addressing (being the output of sonification system status switch); Since regular when not required, computing time and the memory space of judging identification can be practiced thrift, therefore be widely used in field of speech recognition.Preferred embodiment of the present invention has also selected hidden Markov model to carry out the similitude calculation of Matching; And the number through the sound-groove model that retrieves of adjustment maximum probability (being threshold value) the control song similar with user's sound-groove model; Promptly when the number of songs that retrieves is too much; Heighten most probable value, improve the similitude requirement, make the song that retrieves be more suitable for the user and sing; Otherwise, when the number of songs that retrieves seldom the time, turn down most probable value, to reduce the similitude requirement, retrieve more song and supply the user to select.
After using order programme of the present invention; The song that the user just can select to be fit to own sound condition is sung, can blindly not request tune popular again or oneself like but and be not suitable for own song of singing, like this; Not only can improve the effect that the user sings; Reach the purpose of amusement, and can also make the singer obtain audience's approval, make the user obtain good singing and experience.

Claims (6)

1. the order programme based on Application on Voiceprint Recognition is characterized in that, comprising:
Song database stores song and corresponding song sound-groove model thereof, and this song sound-groove model is set up through the vocal print characteristic of extracting song;
Recording module is used to record user's voice;
Voice signal processing module is connected with recording module, and the voice signal that is used for recording module is recorded to carries out preliminary treatment;
The vocal print characteristic extracting module is connected with voice signal processing module, is used for the analyzing speech signal, and from voice signal, extracts the vocal print characteristic, sets up sound-groove model;
Vocal print characteristic comparing module; Be connected with the vocal print characteristic extracting module; The user's sound-groove model that is used for that the vocal print characteristic extracting module is set up carries out similitude with song database song sound-groove model and matees, and returns the tabulation of the song that sound-groove model can be complementary with user's sound-groove model.
2. order programme as claimed in claim 1 is characterized in that: said recording module is at least one microphone.
3. the application process of the described order programme of claim 1 is characterized in that, comprises the following steps:
(1) the prompting user sends one section voice;
(2) user's voice is recorded;
(3) voice signal of recording is carried out preliminary treatment;
(4) voice signal that obtains in the analytical procedure (3) extracts the user's who comprises in this voice signal vocal print characteristic, and sets up user's sound-groove model;
(5) sound-groove model with song in the user's who obtains in the step (4) sound-groove model and the song database carries out the similitude coupling; Calculate both similarities; And according to the distribution of similarity numerical value; Choose suitable numerical value as threshold value, the quantity of the song that the sound-groove model that retrieves with control can be complementary with user's sound-groove model;
The tabulation of the song that (6) step (5) is retrieved offers user's program request.
4. the application process of order programme as claimed in claim 3 is characterized in that: in the said step (3), the preliminary treatment of voice signal is comprised filtering, remove the noise in the voice signal.
5. method for ordering song as claimed in claim 3 is characterized in that: in the said step (5), adopt vector quantization model, stochastic model or neural network model to carry out the similitude coupling and calculate.
6. method for ordering song as claimed in claim 5 is characterized in that: said stochastic model is a hidden Markov model.
CN2010102734656A 2010-09-08 2010-09-08 Song requesting system based on voiceprint recognition and application method thereof Pending CN102404278A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102734656A CN102404278A (en) 2010-09-08 2010-09-08 Song requesting system based on voiceprint recognition and application method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102734656A CN102404278A (en) 2010-09-08 2010-09-08 Song requesting system based on voiceprint recognition and application method thereof

Publications (1)

Publication Number Publication Date
CN102404278A true CN102404278A (en) 2012-04-04

Family

ID=45886074

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102734656A Pending CN102404278A (en) 2010-09-08 2010-09-08 Song requesting system based on voiceprint recognition and application method thereof

Country Status (1)

Country Link
CN (1) CN102404278A (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103035247A (en) * 2012-12-05 2013-04-10 北京三星通信技术研究有限公司 Method and device of operation on audio/video file based on voiceprint information
CN103631802A (en) * 2012-08-24 2014-03-12 腾讯科技(深圳)有限公司 Song information searching method, device and corresponding server
CN103793641A (en) * 2014-02-27 2014-05-14 联想(北京)有限公司 Information processing method and device, and electronic device
CN104268279A (en) * 2014-10-16 2015-01-07 魔方天空科技(北京)有限公司 Query method and device of corpus data
CN105677799A (en) * 2015-12-31 2016-06-15 宇龙计算机通信科技(深圳)有限公司 Picture retrieval method and system
WO2017028704A1 (en) * 2015-08-18 2017-02-23 阿里巴巴集团控股有限公司 Method and device for providing accompaniment music
CN106548792A (en) * 2015-09-17 2017-03-29 阿里巴巴集团控股有限公司 Intelligent sound box device, mobile terminal and music processing method
WO2018094952A1 (en) * 2016-11-22 2018-05-31 百度在线网络技术(北京)有限公司 Content recommendation method and apparatus
CN108182946A (en) * 2017-12-25 2018-06-19 广州势必可赢网络科技有限公司 Vocal music mode selection method and device based on voiceprint recognition
CN109712635A (en) * 2018-12-28 2019-05-03 深圳创维-Rgb电子有限公司 A kind of voice data processing method, intelligent terminal and storage medium
CN110867189A (en) * 2018-08-28 2020-03-06 北京京东尚科信息技术有限公司 Login method and device
CN111199729A (en) * 2018-11-19 2020-05-26 阿里巴巴集团控股有限公司 Voiceprint recognition method and device
CN112489607A (en) * 2019-08-22 2021-03-12 北京峰趣互联网信息服务有限公司 Method and device for recording songs, electronic equipment and readable storage medium
WO2021127975A1 (en) * 2019-12-24 2021-07-01 广州国音智能科技有限公司 Voiceprint detection method, apparatus and device for sound acquisition object
CN113366567A (en) * 2021-05-08 2021-09-07 腾讯音乐娱乐科技(深圳)有限公司 Voiceprint identification method, singer authentication method, electronic equipment and storage medium
TWI745338B (en) * 2017-01-19 2021-11-11 香港商阿里巴巴集團服務有限公司 Method and device for providing accompaniment music
US11514885B2 (en) 2016-11-21 2022-11-29 Microsoft Technology Licensing, Llc Automatic dubbing method and apparatus

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103631802A (en) * 2012-08-24 2014-03-12 腾讯科技(深圳)有限公司 Song information searching method, device and corresponding server
CN103631802B (en) * 2012-08-24 2015-05-20 腾讯科技(深圳)有限公司 Song information searching method, device and corresponding server
US9704485B2 (en) 2012-08-24 2017-07-11 Tencent Technology (Shenzhen) Company Limited Multimedia information retrieval method and electronic device
CN103035247B (en) * 2012-12-05 2017-07-07 北京三星通信技术研究有限公司 Based on the method and device that voiceprint is operated to audio/video file
CN103035247A (en) * 2012-12-05 2013-04-10 北京三星通信技术研究有限公司 Method and device of operation on audio/video file based on voiceprint information
CN107274916A (en) * 2012-12-05 2017-10-20 北京三星通信技术研究有限公司 The method and device operated based on voiceprint to audio/video file
CN103793641A (en) * 2014-02-27 2014-05-14 联想(北京)有限公司 Information processing method and device, and electronic device
CN103793641B (en) * 2014-02-27 2021-07-16 联想(北京)有限公司 Information processing method and device and electronic equipment
CN104268279B (en) * 2014-10-16 2018-04-20 魔方天空科技(北京)有限公司 The querying method and device of corpus data
CN104268279A (en) * 2014-10-16 2015-01-07 魔方天空科技(北京)有限公司 Query method and device of corpus data
CN106469557A (en) * 2015-08-18 2017-03-01 阿里巴巴集团控股有限公司 The offer method and apparatus of accompaniment music
WO2017028704A1 (en) * 2015-08-18 2017-02-23 阿里巴巴集团控股有限公司 Method and device for providing accompaniment music
CN106469557B (en) * 2015-08-18 2020-02-18 阿里巴巴集团控股有限公司 Method and device for providing accompaniment music
CN106548792A (en) * 2015-09-17 2017-03-29 阿里巴巴集团控股有限公司 Intelligent sound box device, mobile terminal and music processing method
CN105677799A (en) * 2015-12-31 2016-06-15 宇龙计算机通信科技(深圳)有限公司 Picture retrieval method and system
US11514885B2 (en) 2016-11-21 2022-11-29 Microsoft Technology Licensing, Llc Automatic dubbing method and apparatus
WO2018094952A1 (en) * 2016-11-22 2018-05-31 百度在线网络技术(北京)有限公司 Content recommendation method and apparatus
TWI745338B (en) * 2017-01-19 2021-11-11 香港商阿里巴巴集團服務有限公司 Method and device for providing accompaniment music
CN108182946A (en) * 2017-12-25 2018-06-19 广州势必可赢网络科技有限公司 Vocal music mode selection method and device based on voiceprint recognition
CN108182946B (en) * 2017-12-25 2021-04-13 广州势必可赢网络科技有限公司 A method and device for selecting vocal music mode based on voiceprint recognition
CN110867189A (en) * 2018-08-28 2020-03-06 北京京东尚科信息技术有限公司 Login method and device
CN111199729A (en) * 2018-11-19 2020-05-26 阿里巴巴集团控股有限公司 Voiceprint recognition method and device
CN111199729B (en) * 2018-11-19 2023-09-26 阿里巴巴集团控股有限公司 Voiceprint recognition method and voiceprint recognition device
CN109712635B (en) * 2018-12-28 2020-10-09 深圳创维-Rgb电子有限公司 Sound data processing method, intelligent terminal and storage medium
CN109712635A (en) * 2018-12-28 2019-05-03 深圳创维-Rgb电子有限公司 A kind of voice data processing method, intelligent terminal and storage medium
CN112489607A (en) * 2019-08-22 2021-03-12 北京峰趣互联网信息服务有限公司 Method and device for recording songs, electronic equipment and readable storage medium
WO2021127975A1 (en) * 2019-12-24 2021-07-01 广州国音智能科技有限公司 Voiceprint detection method, apparatus and device for sound acquisition object
CN113366567A (en) * 2021-05-08 2021-09-07 腾讯音乐娱乐科技(深圳)有限公司 Voiceprint identification method, singer authentication method, electronic equipment and storage medium
CN113366567B (en) * 2021-05-08 2024-06-04 腾讯音乐娱乐科技(深圳)有限公司 Voiceprint recognition method, singer authentication method, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN102404278A (en) Song requesting system based on voiceprint recognition and application method thereof
CN108320733B (en) Voice data processing method and device, storage medium and electronic equipment
US11475897B2 (en) Method and apparatus for response using voice matching user category
CN102723078B (en) Emotion speech recognition method based on natural language comprehension
CN105334743B (en) A kind of intelligent home furnishing control method and its system based on emotion recognition
CN103680497B (en) Speech recognition system and method based on video
CN108874895B (en) Interactive information pushing method and device, computer equipment and storage medium
CN108538293B (en) Voice awakening method and device and intelligent device
CN107623614A (en) Method and apparatus for pushed information
CN112562681B (en) Speech recognition method and apparatus, and storage medium
WO2020155490A1 (en) Method and apparatus for managing music based on speech analysis, and computer device
CN109074806A (en) Distributed audio output is controlled to realize voice output
CN106228988A (en) A kind of habits information matching process based on voiceprint and device
CN104036774A (en) Method and system for recognizing Tibetan dialects
CN107767879A (en) Audio conversion method and device based on tone color
CN101794576A (en) Dirty word detection aid and using method thereof
CN107293300A (en) Audio recognition method and device, computer installation and readable storage medium storing program for executing
CN109346057A (en) A kind of speech processing system of intelligence toy for children
JP6915637B2 (en) Information processing equipment, information processing methods, and programs
CN112420063A (en) A kind of speech enhancement method and apparatus
CN115171731A (en) Emotion category determination method, device and equipment and readable storage medium
CN114999472A (en) An air conditioner control method, device and an air conditioner
CN114817514A (en) Method and device for determining reply audio, storage medium and electronic device
CN114283820A (en) Interaction method, electronic device and storage medium for multi-role voice
CN114664303A (en) Continuous voice instruction rapid recognition control system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120404