CN101605187A - The method of control voice quality in Conference server, user terminal and the voice conferencing - Google Patents
The method of control voice quality in Conference server, user terminal and the voice conferencing Download PDFInfo
- Publication number
- CN101605187A CN101605187A CNA2009100887512A CN200910088751A CN101605187A CN 101605187 A CN101605187 A CN 101605187A CN A2009100887512 A CNA2009100887512 A CN A2009100887512A CN 200910088751 A CN200910088751 A CN 200910088751A CN 101605187 A CN101605187 A CN 101605187A
- Authority
- CN
- China
- Prior art keywords
- user terminal
- voice signal
- voice
- quality
- low
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 230000009467 reduction Effects 0.000 claims abstract description 30
- 238000012545 processing Methods 0.000 claims abstract description 27
- 238000001514 detection method Methods 0.000 claims description 74
- 238000001228 spectrum Methods 0.000 claims description 29
- 230000008569 process Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000009131 signaling function Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Images
Landscapes
- Telephonic Communication Services (AREA)
Abstract
The invention provides the method for control voice quality in a kind of Conference server, user terminal and the voice conferencing, described Conference server comprises: judge module, whether the voice signal that is used to judge first user terminal input that inserts voice conferencing is low-quality voice signal, obtains a judged result; First processing module, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.The present invention can effectively control the low quality phonetic problem in the voice conferencing.
Description
Technical field
The present invention relates to field of multimedia communication, relate in particular to the method for control voice quality in a kind of Conference server, user terminal and the voice conferencing.
Background technology
Audio conference service be a kind of practical value very high, very the multimedia communication mode of development potentiality is arranged, voice conferencing can carry out speech exchange simultaneously so that be positioned at a plurality of participants of diverse geographic location.
The problem that often runs in the voice conferencing is the low quality phonetic problem, for example, if the volume of speech side is less, other participants then possibly can't catch, and therefore, other participants may interrupt spokesman's speech, allow him make a speech again, wasted the time of meeting; If there is the jamming pattern sound in spokesman's one end, perhaps the spokesman is away from keyboard and misplug into maintenance sound or music etc., and then voice conferencing will seriously be disturbed, and has hindered other participants' speech exchange, has influenced the definition of voice conferencing.
Summary of the invention
In view of this, the embodiment of the invention provides the method for control voice quality in a kind of Conference server, user terminal and the voice conferencing, can effectively control the low quality phonetic problem in the voice conferencing.
For addressing the above problem, the embodiment of the invention provides a kind of Conference server, comprising:
Judge module is used to judge whether the voice signal of first user terminal input that inserts voice conferencing is low-quality voice signal, obtains a judged result;
First processing module, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
Described judge module comprises:
First detection module is used for the default speech parameter by the voice signal that detects described first user terminal input, judges whether the voice signal of described first user terminal input is low-quality voice signal; And/or
Second detection module is used for whether receiving the low quality phonic warning information that described first user terminal sends by detecting, and judges whether the voice signal of described first user terminal input is low-quality voice signal.
Described Conference server also comprises:
Voice channel is set up module, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, set up the independent voice channel between second user terminal and described first user terminal, to be used for described second user terminal voice signal of described first user terminal input is carried out secondary detection, described second user terminal is for inserting the arbitrary user terminal except that described first user terminal of voice conferencing;
Receiver module, be used to receive the secondary detection result of described second user terminal, whether the voice signal of described first user terminal input of indication is low-quality voice signal among the described secondary detection result, described secondary detection result as new judged result, is sent to described first processing module.
Described first detection module comprises:
First Executive Module, whether the volume of voice signal that is used to detect first user terminal input that inserts voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; And/or
Second Executive Module, be used to detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module, be used to detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module, be used to detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
Described first processing module comprises:
Reminding module is used for described first user terminal being pointed out, and generating the timing trigger message when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal;
Timer is used for according to described timing trigger message, picks up counting from the moment that described first user terminal is pointed out;
Trigger module, be used for surpassing Preset Time in the timing of described timer, described judged result is still indicated the voice signal of described first user terminal input when being low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
Described Conference server also comprises:
Second processing module, be used for receive described first user terminal insert the request of described voice conferencing again the time, voice channel between foundation and described first user terminal, and trigger described judge module and judge whether the voice signal of described first user terminal input is low-quality voice signal, obtains described judged result;
Access module is used for when described judged result indicates the voice signal of described first user terminal input to be not low-quality voice signal described first user terminal being inserted in the described voice conferencing again.
The embodiment of the invention also provides a kind of user terminal, comprising:
Detection module is used at voice conferencing, by detecting the default speech parameter of the voice signal of exporting, judges whether the voice signal of described output is low-quality voice signal, and obtains a judged result;
Processing module, be used for when described judged result indicates the voice signal of described output to be low-quality voice signal, carry out being connected or generating low quality phonic warning information and sending to Conference server in the described voice conferencing of noise reduction processing, disconnection and described voice conferencing, by described Conference server described user terminal is carried out noise reduction and handle or disconnect being connected of described user terminal and described voice conferencing.
Described detection module comprises:
First Executive Module, whether the volume of voice signal that is used to detect output is less than predetermined threshold value, during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described output in described volume, and described default speech parameter is described volume; And/or
Second Executive Module, be used to detect the spectrum distribution of the voice signal of output, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module, be used to detect the tonequality or the speech energy rank of the voice signal of output, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module, be used to detect the speech energy rank or the overall noise rank of the voice signal of output, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described speech energy rank or overall noise rank.
The embodiment of the invention also provides the method for control voice quality in a kind of voice conferencing, may further comprise the steps:
Whether the voice signal of judging first user terminal input that inserts voice conferencing is low-quality voice signal, obtains a judged result;
When described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
Whether the voice signal that described judgement inserts first user terminal input of voice conferencing is that low-quality voice signal is specially:
The default speech parameter of the voice signal by detecting the input of described first user terminal judges whether the voice signal of described first user terminal input is low-quality voice signal; Or
Whether receive the low quality phonic warning information that described first user terminal sends by detecting, judge whether the voice signal of described first user terminal input is low-quality voice signal.
Whether the voice signal that described judgement inserts first user terminal input of voice conferencing is low-quality voice signal, obtains also comprising after the judged result:
When described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, set up the independent voice channel between second user terminal and described first user terminal, to be used for described second user terminal voice signal of described first user terminal input is carried out secondary detection, described second user terminal is for inserting the arbitrary user terminal except that described first user terminal of voice conferencing;
Receive the secondary detection result of described second user terminal, whether the voice signal of described first user terminal of indication input is low-quality voice signal among the described secondary detection result, with described secondary detection result as new judged result.
The described default speech parameter that passes through the voice signal of described first user terminal input of detection, judge whether the voice signal of described first user terminal input is that low-quality voice signal is specially:
Whether the volume of the voice signal of first user terminal input of detection access voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; Or
Detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; Or
Detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; Or
Detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
When described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal is carried out the noise reduction processing or disconnects described first user terminal being specially with being connected of described voice conferencing:
Described first user terminal is pointed out, and generate the timing trigger message;
According to described timing trigger message, pick up counting from the moment that described first user terminal is pointed out;
Timing at described timer surpasses Preset Time, described judged result is still indicated the voice signal of described first user terminal input when being low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
Described first user terminal of described disconnection also comprises with being connected of described voice conferencing afterwards:
Receive described first user terminal insert the request of described voice conferencing again the time, set up and described first user terminal between voice channel;
Whether the voice signal that rejudges described first user terminal input is low-quality voice signal, obtains described judged result;
When described judged result indicates the voice signal of described first user terminal input to be not low-quality voice signal, described first user terminal is inserted in the described voice conferencing again.
Embodiments of the invention have following beneficial effect:
In the voice conferencing that a plurality of user terminals participate in, whether the voice signal of judging user terminal is low-quality voice signal, when the voice signal of first user terminal is low-quality voice signal, described first user terminal is carried out noise reduction to be handled, or disconnect described first user terminal and operations such as being connected of voice conferencing, thereby make voice conferencing clearer, smooth, do not disturbed by the user terminal of low voice quality, and normal voice conferencing is still carrying out in whole process, and other user terminals needn't leave voice conferencing;
In addition, when the voice signal that detects for the first time described first user terminal is low-quality voice signal, can also be the independent voice channel that is used for secondary detection of setting up of second user terminal in described first user terminal and the voice conferencing, carry out secondary detection with voice signal to described first user terminal input, and according to the result of this secondary detection, determine whether described first user terminal is the user terminal of low voice quality, thereby improved reliability.
Description of drawings
Fig. 1 is the structural representation of the Conference server of the embodiment of the invention;
Fig. 2 is another structural representation of the Conference server of the embodiment of the invention;
Fig. 3 is the another structural representation of the Conference server of the embodiment of the invention;
Fig. 4 is a structural representation again of the Conference server of the embodiment of the invention;
Fig. 5 is the another structural representation of the Conference server of the embodiment of the invention;
Fig. 6 is the schematic flow sheet of the method for control voice quality in the voice conferencing of the embodiment of the invention;
Fig. 7 is another schematic flow sheet of the method for control voice quality in the voice conferencing of the embodiment of the invention;
Fig. 8 is the another schematic flow sheet of the method for control voice quality in the voice conferencing of the embodiment of the invention.
Embodiment
Below in conjunction with drawings and Examples, the specific embodiment of the present invention is described in further detail.
Be illustrated in figure 1 as the structural representation of the Conference server of the embodiment of the invention, described Conference server comprises:
Described Conference server can insert a plurality of user terminals voice conferencing and keep described voice conferencing;
Described user terminal can be for having the terminal of collecting the voice signal function or portable terminal etc., described user terminal is by IP (Internet Protocol, Internet Protocol) voice conferencing kept of network, mobile communications network or the described Conference server of other network insertions, the participant of described voice conferencing sends voice signal by the user terminal of its use to described Conference server, described user terminal receives the voice signal of other user terminals inputs that described Conference server sends, and plays to the participant and listen to.
The Conference server that provides by the foregoing description, in the voice conferencing that a plurality of user terminals participate in, whether the voice signal of judging the user terminal input is low-quality voice signal, when the voice signal of user terminal input is low-quality voice signal, described user terminal is handled accordingly, thereby can effectively control the low quality phonetic problem in the voice conferencing, make voice conferencing clearer, smooth, do not disturbed by the user terminal of low voice quality, and normal voice conferencing is still carrying out in whole control process, and other user terminals needn't leave voice conferencing.
Conference server in the foregoing description can judge in several ways whether the voice signal of first user terminal input that inserts voice conferencing is low-quality voice signal, for example, the default speech parameter of the voice signal by detecting the input of described first user terminal, whether the voice signal of judging described first user terminal input is low-quality voice signal, perhaps, by detecting the low quality phonic warning information that described first user terminal sends that whether receives, whether the voice signal of judging described first user terminal input is low-quality voice signal, or the like.
Be illustrated in figure 2 as another structural representation of the Conference server of the embodiment of the invention, on the basis of embodiment shown in Figure 1, described Conference server also comprises:
First detection module 111 is used for the default speech parameter by the voice signal that detects described first user terminal input, judges whether the voice signal of described first user terminal input is low-quality voice signal; The detection mode of described first detection module 111 can be for detecting after detecting in real time, periodically detect or being triggered by user terminal etc.The default speech parameter of described voice signal can for; Speech parameters such as the volume of voice signal, tonequality, spectrum distribution or speech energy rank.
Described judge module 11 can comprise any one in described first detection module 111 and described second detection module 112, also can comprise described first detection module 111 and described second detection module 112 simultaneously.
The judged result of above-mentioned judge module 11 can not entirely accurate, therefore, can carry out secondary detection to the voice signal of described first user terminal input, to improve the reliability of judged result, be illustrated in figure 3 as the another structural representation of the Conference server of the embodiment of the invention, on the basis of embodiment shown in Figure 1, described Conference server comprises:
Voice channel is set up module 13, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, set up the independent voice channel between second user terminal and described first user terminal, to be used for described second user terminal voice signal of described first user terminal input is carried out secondary detection, described second user terminal is for inserting the arbitrary user terminal except that described first user terminal of described voice conferencing;
Independent voice channel between above-mentioned second user terminal and described first user terminal can be unidirectional voice channel, when described voice channel is unidirectional voice channel, described second user terminal can obtain the voice signal of described first user terminal input, and described first user terminal can't obtain the voice signal of described second user terminal input; Described voice channel also can be two-way voice channel, and when described voice channel was two-way voice channel, described second user terminal and described first user terminal can obtain the voice signal of the other side's input mutually;
Behind the independent voice channel of setting up between described second user terminal and described first user terminal, whether the voice signal that described second user terminal can adopt dual mode to detect described first user terminal input is low-quality voice signal:
First kind of detection mode is the automatic detection mode of user terminal, be the voice channel of described second user terminal by setting up, obtain the voice signal of described first user terminal input, described voice signal is analyzed, judge whether described voice signal is low-quality voice signal;
Second kind of detection mode is artificial detection mode, it is described second user terminal obtains described first user terminal input by described voice channel voice signal, participant by described second user terminal, one side listens to described voice signal, thereby judge whether described voice signal is low-quality voice signal, after participant's judgement of described second user terminal, one side finishes, can notify described Conference server with described judged result by described second user terminal, generally, the artificial mode that detects is more more accurate than the mode that user terminal detects;
In concrete implementation procedure, usually with the initiation terminal of this voice conferencing as described second user terminal, listen to the voice signal of described first user terminal input by participant's (being the promoter of voice conferencing) of initiation terminal one side of described voice conferencing, and provide the secondary detection result.
In addition, before described Conference server is set up independent voice channel between described second user terminal and described first user terminal, preferably at first send and carry out the request of secondary detection to described second user terminal, to avoid described second user terminal because of busy or other reasons, can't carry out described secondary detection, receiving described second user terminal when accepting replying of described request, set up the independent voice channel between described second user terminal and described first user terminal again.If described second user terminal is because busy or other reasons, can't carry out described secondary detection, just do not need to send replying of described request, described Conference server through Preset Time do not receive described second user terminal for the replying of described request the time, described Conference server can also be reselected another user terminal in the voice conferencing as second user terminal, sends the request of described secondary detection.
The Conference server that provides by the foregoing description, when the voice signal that detects for the first time described first user terminal is low-quality voice signal, can between second user terminal in described first user terminal and the voice conferencing, set up an independent voice channel that is used for secondary detection, carry out secondary detection with voice signal to described first user terminal input, and according to the result of this secondary detection, determine whether described first user terminal is the user terminal of low voice quality, because the signal to described first user terminal input has carried out repeated detection, thereby can effectively improve the reliability of detection.
Above-mentioned first detection module 111 can be analyzed the one or more default speech parameter of the voice signal of first user terminal input that inserts voice conferencing, judges whether the voice signal of described first user terminal input is low-quality voice signal.Be illustrated in figure 4 as a structural representation again of the Conference server of the embodiment of the invention, on the basis of above-mentioned embodiment shown in Figure 2, described first detection module 111 comprises:
First Executive Module 1111, whether the volume of voice signal that is used to detect first user terminal input that inserts voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; And/or
Second Executive Module 1112, be used to detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module 1113, be used to detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module 1114, be used to detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
Mention in the above-described embodiments, when the voice signal of described first user terminal input was low-quality voice signal, described first processing module 12 can be carried out noise reduction to described first user terminal and be handled or disconnect operations such as described first user terminal and being connected of voice conferencing.Described Conference server to the described first user terminal noise reduction after, described first user terminal still can continue to receive the content of described voice conferencing, but can't be to described Conference server input speech signal.In addition, before described Conference server is carried out operations such as noise reduction processing or disconnection connect to described first user terminal, preferably at first described first user terminal is pointed out, the voice signal of pointing out its input is low-quality voice signal, so that described first user terminal is in time adjusted, behind the process Preset Time, when if the voice signal of described first user terminal input still is low-quality voice signal, again described first user terminal is carried out noise reduction, disconnect operations such as connection, therefore, as shown in Figure 5, on the basis of embodiment shown in Figure 1, described first processing module 12 comprises:
Reminding module 121 is used for described first user terminal being pointed out, and generating the timing trigger message when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal;
When described reminding module 121 can be low-quality voice signal at the voice signal of described first user terminal input of prompting, notify described first user terminal with the reason of detected low voice quality, for example, when the volume of the voice signal that detects described first user terminal input is hanged down, can directly notify described first user terminal by described reminding module 121, so that the described first user terminal rapid adjustment voice quality with this reason.
Described Conference server with described first user terminal and described voice conferencing disconnect be connected after, described first user terminal may be eliminated the reason that has low voice quality, after described first user terminal has been eliminated the reason of low voice quality, can send the request that inserts described voice conferencing to described Conference server again, described Conference server then can insert described first user terminal in the described voice conferencing again, therefore, as shown in Figure 5, described Conference server also comprises:
Second processing module 15, be used for receive described first user terminal insert the request of described voice conferencing again the time, voice channel between foundation and described first user terminal, and trigger described judge module 11 and judge whether the voice signal of described first user terminal input is low-quality voice signal, obtains described judged result;
The embodiment of the invention also provides a kind of user terminal, and described user terminal comprises:
Detection module is used at voice conferencing, by detecting the default speech parameter of the voice signal of exporting, judges whether the voice signal of described output is low-quality voice signal, and obtains a judged result; The default speech parameter of described voice signal can for; Speech parameters such as the volume of voice signal, tonequality, spectrum distribution or speech energy rank.
Processing module, be used for when described judged result indicates the voice signal of described output to be low-quality voice signal, carry out being connected or generating low quality phonic warning information and sending to Conference server in the described voice conferencing of noise reduction processing, disconnection and described voice conferencing, by described Conference server described user terminal is carried out noise reduction and handle or disconnect being connected of described user terminal and described voice conferencing.
Above-mentioned detection module can be analyzed the one or more default speech parameter of the voice signal of first user terminal input that inserts voice conferencing, judges whether the voice signal of described first user terminal input is low-quality voice signal.On the basis of the foregoing description, described detection module comprises:
First Executive Module, whether the volume of voice signal that is used to detect output is less than predetermined threshold value, during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described output in described volume, and described default speech parameter is described volume; And/or
Second Executive Module, be used to detect the spectrum distribution of the voice signal of output, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module, be used to detect the tonequality or the speech energy rank of the voice signal of output, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module, be used to detect the speech energy rank or the overall noise rank of the voice signal of output, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described speech energy rank or overall noise rank.
The user terminal that provides by the foregoing description, can be in voice conferencing, detect whether the voice signal of self exporting is low-quality voice signal, when the voice signal that detects output is low-quality voice signal, handle accordingly, thereby can effectively control the low quality phonetic problem in the voice conferencing, make voice conferencing clearer, smooth.
Be illustrated in figure 6 as the method for control voice quality in the voice conferencing of the embodiment of the invention, said method comprising the steps of:
Whether the voice signal that can judge first user terminal input that inserts voice conferencing in the above-mentioned steps 61 in several ways is low-quality voice signal, for example, the default speech parameter of the voice signal by detecting the input of described first user terminal, whether the voice signal of judging described first user terminal input is low-quality voice signal, perhaps, by detecting the low quality phonic warning information that described first user terminal sends that whether receives, whether the voice signal of judging described first user terminal input is low-quality voice signal, or the like.
Therefore, described step 61 can be specially:
The default speech parameter of the voice signal by detecting the input of described first user terminal judges whether the voice signal of described first user terminal input is low-quality voice signal; Or
Whether receive the low quality phonic warning information that described first user terminal sends by detecting, judge whether the voice signal of described first user terminal input is low-quality voice signal.
In the foregoing description, can analyze, judge whether the voice signal of described first user terminal input is low-quality voice signal the one or more default speech parameter of the voice signal of first user terminal input that inserts voice conferencing.Therefore, the default speech parameter of the voice signal by detecting the input of described first user terminal, judge whether the voice signal of described first user terminal input is that low-quality voice signal can be specially:
Whether the volume of the voice signal of first user terminal input of detection access voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; Or
Detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; Or
Detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; Or
Detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
The method that provides by the foregoing description, in the voice conferencing that a plurality of user terminals participate in, whether the voice signal of judging the user terminal input is low-quality voice signal, when the voice signal of user terminal input is low-quality voice signal, described user terminal is handled accordingly, thereby can effectively control the low quality phonetic problem in the voice conferencing, make voice conferencing clearer, smooth, do not disturbed by the user terminal of low voice quality, and normal voice conferencing is still carrying out in whole control process, and other participants needn't leave voice conferencing.
Judged result in the foregoing description can not entirely accurate, therefore, can carry out secondary detection to the voice signal of described first user terminal input, thereby improve the reliability of judged result, be illustrated in figure 7 as another schematic flow sheet of the method for control voice quality in the voice conferencing of the embodiment of the invention, said method comprising the steps of:
The method that provides by the foregoing description, when the voice signal that detects for the first time described first user terminal is low-quality voice signal, can between second user terminal in described first user terminal and the voice conferencing, set up an independent voice channel, carry out secondary detection with voice signal to described first user terminal input, and according to the result of this secondary detection, determine whether described first user terminal is the user terminal of low voice quality, owing to carried out repeated detection, thereby can effectively improve the reliability of judged result.
In addition, before described first user terminal is carried out operations such as noise reduction processing or disconnection connect, preferably at first described first user terminal is pointed out, the voice signal of pointing out its input is low-quality voice signal, so that described first user terminal is in time adjusted, through behind the Preset Time,, more described first user terminal is carried out operations such as noise reduction, disconnection connection if when the voice signal of described first user terminal input still be low-quality voice signal.
Be illustrated in figure 8 as the another schematic flow sheet of the method for control voice quality in the voice conferencing of the embodiment of the invention, said method comprising the steps of:
Certainly, in step 84, when described judged result still indicates the voice signal of described first user terminal input to be low-quality voice signal, also can not disconnect being connected of first user terminal and described voice conferencing, but described first user terminal is carried out the noise reduction processing.
The above only is a preferred implementation of the present invention; should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the principle of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.
Claims (14)
1. a Conference server is characterized in that, comprising:
Judge module is used to judge whether the voice signal of first user terminal input that inserts voice conferencing is low-quality voice signal, obtains a judged result;
First processing module, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
2. Conference server according to claim 1 is characterized in that, described judge module comprises:
First detection module is used for the default speech parameter by the voice signal that detects described first user terminal input, judges whether the voice signal of described first user terminal input is low-quality voice signal; And/or
Second detection module is used for whether receiving the low quality phonic warning information that described first user terminal sends by detecting, and judges whether the voice signal of described first user terminal input is low-quality voice signal.
3. Conference server according to claim 1 and 2 is characterized in that, also comprises:
Voice channel is set up module, be used for when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, set up the independent voice channel between second user terminal and described first user terminal, to be used for described second user terminal voice signal of described first user terminal input is carried out secondary detection, described second user terminal is for inserting the arbitrary user terminal except that described first user terminal of voice conferencing;
Receiver module, be used to receive the secondary detection result of described second user terminal, whether the voice signal of described first user terminal input of indication is low-quality voice signal among the described secondary detection result, described secondary detection result as new judged result, is sent to described first processing module.
4. Conference server according to claim 2 is characterized in that, described first detection module comprises:
First Executive Module, whether the volume of voice signal that is used to detect first user terminal input that inserts voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; And/or
Second Executive Module, be used to detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module, be used to detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module, be used to detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
5. Conference server according to claim 1 is characterized in that, described first processing module comprises:
Reminding module is used for described first user terminal being pointed out, and generating the timing trigger message when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal;
Timer is used for according to described timing trigger message, picks up counting from the moment that described first user terminal is pointed out;
Trigger module, be used for surpassing Preset Time in the timing of described timer, described judged result is still indicated the voice signal of described first user terminal input when being low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
6. Conference server according to claim 1 or 5 is characterized in that, also comprises:
Second processing module, be used for receive described first user terminal insert the request of described voice conferencing again the time, voice channel between foundation and described first user terminal, and trigger described judge module and judge whether the voice signal of described first user terminal input is low-quality voice signal, obtains described judged result;
Access module is used for when described judged result indicates the voice signal of described first user terminal input to be not low-quality voice signal described first user terminal being inserted in the described voice conferencing again.
7. a user terminal is characterized in that, comprising:
Detection module is used at voice conferencing, by detecting the default speech parameter of the voice signal of exporting, judges whether the voice signal of described output is low-quality voice signal, and obtains a judged result;
Processing module, be used for when described judged result indicates the voice signal of described output to be low-quality voice signal, carry out being connected or generating low quality phonic warning information and sending to Conference server in the described voice conferencing of noise reduction processing, disconnection and described voice conferencing, by described Conference server described user terminal is carried out noise reduction and handle or disconnect being connected of described user terminal and described voice conferencing.
8. user terminal according to claim 7 is characterized in that, described detection module comprises:
First Executive Module, whether the volume of voice signal that is used to detect output is less than predetermined threshold value, during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described output in described volume, and described default speech parameter is described volume; And/or
Second Executive Module, be used to detect the spectrum distribution of the voice signal of output, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described spectrum distribution; And/or
The 3rd Executive Module, be used to detect the tonequality or the speech energy rank of the voice signal of output, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described tonequality or speech energy rank; And/or
The 4th Executive Module, be used to detect the speech energy rank or the overall noise rank of the voice signal of output, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described output, and described default speech parameter is described speech energy rank or overall noise rank.
9. the method for control voice quality in the voice conferencing is characterized in that, may further comprise the steps:
Whether the voice signal of judging first user terminal input that inserts voice conferencing is low-quality voice signal, obtains a judged result;
When described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
10. the method for control voice quality is characterized in that in the voice conferencing according to claim 9, and whether the voice signal that described judgement inserts first user terminal input of voice conferencing is that low-quality voice signal is specially:
The default speech parameter of the voice signal by detecting the input of described first user terminal judges whether the voice signal of described first user terminal input is low-quality voice signal; Or
Whether receive the low quality phonic warning information that described first user terminal sends by detecting, judge whether the voice signal of described first user terminal input is low-quality voice signal.
11. method according to control voice quality in claim 9 or the 10 described voice conferencings, it is characterized in that, whether the voice signal that described judgement inserts first user terminal input of voice conferencing is low-quality voice signal, obtains also comprising after the judged result:
When described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, set up the independent voice channel between second user terminal and described first user terminal, to be used for described second user terminal voice signal of described first user terminal input is carried out secondary detection, described second user terminal is for inserting the arbitrary user terminal except that described first user terminal of voice conferencing;
Receive the secondary detection result of described second user terminal, whether the voice signal of described first user terminal of indication input is low-quality voice signal among the described secondary detection result, with described secondary detection result as new judged result.
12. the method for control voice quality in the voice conferencing according to claim 10, it is characterized in that, the described default speech parameter that passes through the voice signal of described first user terminal input of detection, judge whether the voice signal of described first user terminal input is that low-quality voice signal is specially:
Whether the volume of the voice signal of first user terminal input of detection access voice conferencing is less than predetermined threshold value, in described volume during less than described predetermined threshold value, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described volume; Or
Detect the spectrum distribution of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising the circulation sound in the described voice signal or keeping sound in described spectrum distribution, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described spectrum distribution; Or
Detect the tonequality or the speech energy rank of the voice signal of first user terminal input that inserts voice conferencing, indicate in described tonequality or speech energy rank and to comprise in the described voice signal when keeping music, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described tonequality or speech energy rank; Or
Detect the speech energy rank or the overall noise rank of the voice signal of first user terminal input that inserts voice conferencing, indicate when comprising background noise in the described voice signal in described speech energy rank or overall noise rank, it is low-quality voice signal that described judged result is indicated the voice signal of described first user terminal input, and described default speech parameter is described speech energy rank or overall noise rank.
13. the method for control voice quality in the voice conferencing according to claim 9, it is characterized in that, when described judged result indicates the voice signal of described first user terminal input to be low-quality voice signal, described first user terminal is carried out the noise reduction processing or disconnects described first user terminal being specially with being connected of described voice conferencing:
Described first user terminal is pointed out, and generate the timing trigger message;
According to described timing trigger message, pick up counting from the moment that described first user terminal is pointed out;
Timing at described timer surpasses Preset Time, described judged result is still indicated the voice signal of described first user terminal input when being low-quality voice signal, described first user terminal is carried out noise reduction handle or disconnect being connected of described first user terminal and described voice conferencing.
14. the method according to control voice quality in claim 9 or the 13 described voice conferencings is characterized in that, described first user terminal of described disconnection also comprises with being connected of described voice conferencing afterwards:
Receive described first user terminal insert the request of described voice conferencing again the time, set up and described first user terminal between voice channel;
Whether the voice signal that rejudges described first user terminal input is low-quality voice signal, obtains described judged result;
When described judged result indicates the voice signal of described first user terminal input to be not low-quality voice signal, described first user terminal is inserted in the described voice conferencing again.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2009100887512A CN101605187A (en) | 2009-07-10 | 2009-07-10 | The method of control voice quality in Conference server, user terminal and the voice conferencing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2009100887512A CN101605187A (en) | 2009-07-10 | 2009-07-10 | The method of control voice quality in Conference server, user terminal and the voice conferencing |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101605187A true CN101605187A (en) | 2009-12-16 |
Family
ID=41470729
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2009100887512A Pending CN101605187A (en) | 2009-07-10 | 2009-07-10 | The method of control voice quality in Conference server, user terminal and the voice conferencing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101605187A (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102572929A (en) * | 2011-12-21 | 2012-07-11 | 华为技术有限公司 | Voice detection method and equipment |
CN102745606A (en) * | 2012-07-12 | 2012-10-24 | 中联重科股份有限公司 | Control equipment, method and system for super lifting device and engineering machinery |
CN102811386A (en) * | 2011-06-01 | 2012-12-05 | 中兴通讯股份有限公司 | Recording device, media server, recording method and system |
CN103455514A (en) * | 2012-06-01 | 2013-12-18 | 腾讯科技(深圳)有限公司 | Updating method and updating device for audio file |
CN103500580A (en) * | 2013-09-23 | 2014-01-08 | 广东威创视讯科技股份有限公司 | Audio mixing processing method and system |
CN103731567A (en) * | 2012-10-11 | 2014-04-16 | 国际商业机器公司 | Method and system for reducing noise in a shared media session |
CN104580776A (en) * | 2015-01-16 | 2015-04-29 | 四川联友电讯技术有限公司 | Telephone conference system and method capable of intelligently shielding strong noise participant based on noise detection |
CN104618613A (en) * | 2015-01-16 | 2015-05-13 | 四川联友电讯技术有限公司 | Method for telephone conference system presenter to shield high-noise participants |
CN106170977A (en) * | 2014-05-08 | 2016-11-30 | 优倍快网络公司 | Telephone system and communication means |
CN107302640A (en) * | 2017-06-08 | 2017-10-27 | 携程旅游信息技术(上海)有限公司 | Videoconference control system and its control method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1549035A1 (en) * | 2003-08-06 | 2005-06-29 | Polycom, Inc. | Method and apparatus for improving nuisance signals in adio/video conference |
CN1798214A (en) * | 2004-12-14 | 2006-07-05 | 阿尔卡特公司 | Enhanced ip-voice conferencing |
CN101119533A (en) * | 2006-08-02 | 2008-02-06 | 中兴通讯股份有限公司 | Method for organizing telephone conference |
-
2009
- 2009-07-10 CN CNA2009100887512A patent/CN101605187A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1549035A1 (en) * | 2003-08-06 | 2005-06-29 | Polycom, Inc. | Method and apparatus for improving nuisance signals in adio/video conference |
CN1798214A (en) * | 2004-12-14 | 2006-07-05 | 阿尔卡特公司 | Enhanced ip-voice conferencing |
CN101119533A (en) * | 2006-08-02 | 2008-02-06 | 中兴通讯股份有限公司 | Method for organizing telephone conference |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102811386A (en) * | 2011-06-01 | 2012-12-05 | 中兴通讯股份有限公司 | Recording device, media server, recording method and system |
CN102572929A (en) * | 2011-12-21 | 2012-07-11 | 华为技术有限公司 | Voice detection method and equipment |
CN102572929B (en) * | 2011-12-21 | 2014-11-05 | 华为技术有限公司 | Voice detection method and equipment |
CN103455514A (en) * | 2012-06-01 | 2013-12-18 | 腾讯科技(深圳)有限公司 | Updating method and updating device for audio file |
CN102745606A (en) * | 2012-07-12 | 2012-10-24 | 中联重科股份有限公司 | Control equipment, method and system for super lifting device and engineering machinery |
CN102745606B (en) * | 2012-07-12 | 2014-12-24 | 中联重科股份有限公司 | Control equipment, method and system for super lifting device and engineering machinery |
CN103731567A (en) * | 2012-10-11 | 2014-04-16 | 国际商业机器公司 | Method and system for reducing noise in a shared media session |
CN103500580A (en) * | 2013-09-23 | 2014-01-08 | 广东威创视讯科技股份有限公司 | Audio mixing processing method and system |
CN106170977A (en) * | 2014-05-08 | 2016-11-30 | 优倍快网络公司 | Telephone system and communication means |
CN104580776A (en) * | 2015-01-16 | 2015-04-29 | 四川联友电讯技术有限公司 | Telephone conference system and method capable of intelligently shielding strong noise participant based on noise detection |
CN104618613A (en) * | 2015-01-16 | 2015-05-13 | 四川联友电讯技术有限公司 | Method for telephone conference system presenter to shield high-noise participants |
CN107302640A (en) * | 2017-06-08 | 2017-10-27 | 携程旅游信息技术(上海)有限公司 | Videoconference control system and its control method |
CN107302640B (en) * | 2017-06-08 | 2019-10-01 | 携程旅游信息技术(上海)有限公司 | Videoconference control system and its control method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101605187A (en) | The method of control voice quality in Conference server, user terminal and the voice conferencing | |
WO2016184118A1 (en) | Method and device for realizing multimedia conference | |
US8218534B2 (en) | VoIP anomaly traffic detection method with flow-level data | |
CN103067217B (en) | A kind of indication mechanism of communications network service quality and method | |
WO2005067277A3 (en) | Speaker identification during telephone conferencing | |
CN104410974B (en) | A kind of method and system that prompting message is sent to fraudulent call | |
WO2008137373A1 (en) | Media detection and packet distribution in a multipoint conference | |
CN109413721A (en) | Configuration, detection method, the network equipment and the terminal of wake-up signal detection time | |
US20080189108A1 (en) | Text messaging in a telephony network | |
CN103179270B (en) | Mobile phone power-off or exceed the reminding method of service area in call | |
WO2006133337A2 (en) | Call logging and call logging notification at telecommunications service provider gateway | |
US7991919B2 (en) | Device, method and system for detecting unwanted conversational media session | |
CN103188411A (en) | VOIP telephone real-time monitoring system and monitoring method based on recording | |
CN107846520B (en) | Single-pass detection method and device | |
CN101674382B (en) | Notification of dropped audio in a teleconference call | |
CN104579710A (en) | Method for conference member to issue voice information in fragmentation asynchronous conference system | |
CN101488870B (en) | Method, system and equipment for implementing sound mixing | |
WO2007130995A2 (en) | Methods and apparatuses for processing audio streams for use with multiple devices | |
CN101287029A (en) | Method and apparatus for automatically respond to detection | |
CN100569003C (en) | Abnormal pull-off network detecting method | |
CN101998426A (en) | Handshaking signal processing method of voice assessment algorithm in voice test system | |
CN108712407A (en) | A kind of audio/video live broadcasting method and its system based on browser | |
CN100484175C (en) | Method and system of implementing report of current speaker during conference | |
CN108419124A (en) | A kind of audio-frequency processing method | |
CN104427287B (en) | Data processing method and equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20091216 |