Nothing Special   »   [go: up one dir, main page]

WO2020125522A1 - Automatic calibration method, device and apparatus for microphone array and storage medium - Google Patents

Automatic calibration method, device and apparatus for microphone array and storage medium Download PDF

Info

Publication number
WO2020125522A1
WO2020125522A1 PCT/CN2019/124639 CN2019124639W WO2020125522A1 WO 2020125522 A1 WO2020125522 A1 WO 2020125522A1 CN 2019124639 W CN2019124639 W CN 2019124639W WO 2020125522 A1 WO2020125522 A1 WO 2020125522A1
Authority
WO
WIPO (PCT)
Prior art keywords
channel
audio signal
preset
value
energy value
Prior art date
Application number
PCT/CN2019/124639
Other languages
French (fr)
Chinese (zh)
Inventor
孙铭
Original Assignee
深圳Tcl新技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳Tcl新技术有限公司 filed Critical 深圳Tcl新技术有限公司
Publication of WO2020125522A1 publication Critical patent/WO2020125522A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/004Monitoring arrangements; Testing arrangements for microphones

Definitions

  • the present application relates to the technical field of microphones, in particular to a method, device, equipment and storage medium for automatic calibration of microphone arrays.
  • microphone array to pick up sound is currently the most important far-field sound picking method, that is, multi-channel synchronous voice data is collected, and then the collected voice data is processed by noise reduction, sound source positioning, beam processing and other processing measures and then sent to the back end Voice recognition module. Therefore, the microphone array pickup performance is a prerequisite for the quality of the entire far-field speech recognition system.
  • the main purpose of the present application is to provide a method, device and computer readable storage medium for automatic calibration of a microphone array, aiming to solve the technical problem that the sound pickup consistency of each microphone channel due to individual differences in microphone array hardware cannot meet the technical requirements.
  • the present application provides an automatic calibration method of a microphone array, which includes the steps of:
  • the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
  • the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
  • Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
  • the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
  • the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
  • the step of adjusting the gain value of each channel includes:
  • the gain value of the channel to be adjusted is adjusted according to a preset adjustment mode.
  • the step of determining the channel to be adjusted in each channel includes:
  • the step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method includes:
  • the gain value of the channel to be adjusted is reduced by a preset value
  • the gain value of the channel to be adjusted is increased by a preset value.
  • the method further includes:
  • the method before the step of adjusting the gain value of each channel, the method further includes:
  • the step of adjusting the gain value of each channel includes:
  • the microphone array automatic proofreading device includes:
  • the acquisition module is configured to acquire digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
  • a calculation module configured to calculate the energy value of the audio signal corresponding to each channel according to the digital audio data
  • a detection module configured to detect whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition
  • An adjustment module configured to adjust the gain value of each channel so that the audio signal corresponding to each channel if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition The energy value meets the preset consistency condition.
  • the present application also provides a microphone array automatic proofreading device
  • the microphone array automatic proofreading device includes a memory, a processor, and a microphone array stored on the memory and operable on the processor
  • An automatic calibration program when the automatic microphone array calibration program is executed by the processor, the steps of the automatic calibration method for the microphone array described above are implemented.
  • the present application also provides a computer-readable storage medium on which the microphone array automatic calibration program is stored, and the microphone array automatic calibration program is implemented as described above when executed by the processor The steps of the microphone array automatic calibration method described above.
  • This application obtains digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array; calculates the energy value of the audio signal corresponding to each channel according to the digital audio data; detects whether the energy value of the audio signal corresponding to each channel meets A preset consistency condition; if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, adjust the gain value of each channel so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition,
  • the automatic calibration of each microphone channel of the microphone array is realized to compensate for the hardware difference of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
  • FIG. 1 is a schematic structural diagram of a hardware operating environment involved in an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a first embodiment of a microphone array automatic proofreading method of the present application.
  • FIG. 3 is a detailed flowchart of step S3 in the embodiment of the automatic calibration method of the microphone array of the present application.
  • this application provides a solution by preprocessing the picked up reference audio signal by acquiring each channel of the microphone array Digital audio data; calculating the energy value of the audio signal corresponding to each channel according to the digital audio data; detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition; if the channels correspond The energy value of the audio signal does not satisfy the preset consistency condition, then the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition.
  • the automatic calibration of each microphone channel of the microphone array is realized to compensate for the hardware difference of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
  • FIG. 1 is a schematic structural diagram of a hardware operating environment involved in a solution of an embodiment of the present application.
  • FIG. 1 is a schematic diagram of the hardware operating environment of the microphone array automatic proofreading device.
  • the microphone array automatic proofreading device may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection communication between these components.
  • the user interface 1003 may include a display (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface.
  • the network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface).
  • the memory 1005 may be a high-speed RAM memory, or may be a non-volatile memory (non-volatile memory), such as a disk memory.
  • the memory 1005 may optionally be a storage device independent of the foregoing processor 1001.
  • the microphone array automatic proofreading device may further include a camera, an RF (Radio Frequency) circuit, a sensor, an audio circuit, a WiFi module, and so on.
  • RF Radio Frequency
  • the structure of the automatic microphone array calibration device shown in FIG. 1 does not constitute a limitation on the automatic microphone array calibration device, and may include more or fewer components than the illustration, or a combination of certain components. Or different component arrangements.
  • the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a microphone array automatic proofreading program.
  • the network interface 1004 is mainly used to connect the microphone array device and perform data communication with the microphone array device; the proofreader can trigger the calibration instruction through the user interface 1003 to make the microphone array automatic calibration device According to the calibration instruction, the microphone array device is automatically calibrated.
  • the user interface 1003 can also be used to display voice data and calibration results; and the processor 1001 can be used to call the microphone array automatic calibration program stored in the memory 1005 and perform the following operations:
  • the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
  • the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
  • Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
  • the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
  • the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
  • the step of adjusting the gain value of each channel includes:
  • the gain value of the channel to be adjusted is adjusted according to a preset adjustment mode.
  • step of determining the channel to be adjusted in each channel includes:
  • the step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method includes:
  • the gain value of the channel to be adjusted is reduced by a preset value
  • the gain value of the channel to be adjusted is increased by a preset value.
  • the processor 1001 may call the microphone array automatic calibration program stored in the memory 1005, and also perform the following operations:
  • Detecting whether a distortion phenomenon occurs after the channel to be adjusted performs gain amplification on the reference audio signal according to the adjusted gain value
  • a distortion message is output, a prompt message to modify the preset adjustment mode is output.
  • the microphone array automatic proofreading method of the present application is mainly applied to the above-mentioned microphone array automatic proofreading device.
  • the proofreading device is used as an executive body to explain the various embodiments.
  • the first embodiment of the automatic calibration method of the microphone array of the present application provides an automatic calibration method of the microphone array. It should be noted that although the logic sequence is shown in the flowchart, in some cases, it may be different. The steps shown or described are performed in the order presented here.
  • the automatic calibration method of the microphone array includes:
  • Step S1 Obtain digital audio data after pre-processing the picked up reference audio signal in each channel of the microphone array;
  • the proofreading device and the microphone array device are connected by a wired or wireless method for data transmission.
  • the proofing device may include a sound source device.
  • the proofing device controls the sound source device to play a reference audio stream.
  • the reference audio stream is a segment of audio stream whose energy value stability meets the stability requirements.
  • the proofing environment is preferably a noise-free environment, so that The proofreading effect is the best.
  • Each microphone channel of the microphone array converts the reference audio stream into a reference audio signal respectively. This process is similar to the process in which existing microphones convert sound into electrical signals. After picking up the reference audio signal, each channel preprocesses the reference audio signal to obtain digital audio data.
  • the preprocessing may be to perform gain amplification processing on the reference audio signal first, and then perform analog-to-digital conversion processing, or may be other processing procedures capable of converting the reference audio signal into digital audio data.
  • the proofreading device obtains digital audio data after preprocessing the reference audio signal by each channel of the microphone array.
  • step S1 includes:
  • Step S11 Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked up reference audio signal.
  • each channel of the microphone array After picking up the reference audio signal, each channel of the microphone array performs gain amplification processing on the reference audio signal. At this time, each channel amplifies the reference audio signal with an initial gain value.
  • the initial gain value of each channel can be preset Set, the initial gain value of each channel should be the same. In an ideal state, each microphone channel performs gain amplification processing on the same reference audio signal with the same initial gain value, and an audio signal with the same gain amplification characteristic should be obtained, but due to the individual differences in hardware of each microphone channel of the microphone array, and such hardware The difference is a difference that is difficult to avoid when producing hardware. Therefore, the initial gain value of each channel needs to be adjusted so that the gain amplification characteristics of each channel are consistent.
  • each channel After performing gain amplification processing on the reference audio signal, each channel obtains the amplified audio signal, and then performs analog-to-digital conversion on the amplified audio signal to obtain digital audio data.
  • the analog-to-digital conversion is based on the preset sampling in the microphone array device Sampling and quantization are performed on sampling specifications such as rate and sampling size.
  • the digital audio data obtained after analog-to-digital conversion is the sample value of each sampling point recorded in the order of sampling.
  • the proofreading device acquires digital audio data corresponding to each channel from the microphone array device.
  • Step S2 Calculate the energy value of the audio signal corresponding to each channel according to the digital audio data
  • the proofreading device After acquiring the digital audio data corresponding to each channel, the proofreading device calculates the energy value of the audio signal corresponding to each channel according to the acquired digital audio data corresponding to each channel. Since the reference audio stream is an audio stream whose energy value stability meets the stability requirements, the amplified audio signal after the gain amplification processing of the picked up reference audio signal by each channel should also meet the energy value equal, but, due to the microphone The hardware difference of each microphone channel of the array may cause the energy value of the amplified audio signal of one channel to be too different from other channels. Therefore, the gain value of each channel needs to be adjusted so that the energy value of the amplified audio signal of each channel Meet consistency requirements. The proofreading device calculates the energy value of the audio signal collected at the same time in each channel based on the digital audio data acquired from each channel, that is, the energy value of the audio signal corresponding to each channel is obtained.
  • step S2 includes:
  • step S21 the digital audio data corresponding to each channel is separately framed
  • N is the window length
  • m is the sampling point on the window
  • x(m) is the sample value of the sampling point
  • ⁇ (n-m) is the window function
  • the short-term average energy is the sum of squares of sample values in one frame.
  • the proofreading device first performs windowing and framing processing on the digital audio data corresponding to each channel, the window function is a rectangular window, and the frame length can be set in advance according to specific needs.
  • Step S22 Acquire audio data of a preset frame in the digital audio data corresponding to each channel
  • the proofreading device After performing windowing and framing processing on the digital audio data corresponding to each channel, the proofreading device respectively obtains audio data of a preset frame in the digital audio data processed by each channel.
  • the preset frame may be the Nth frame of the digital audio data after the frame processing, and N may be set in advance according to specific needs.
  • Step S23 Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  • the proofreading device After acquiring the audio data of a preset frame of each channel, the proofreading device substitutes the audio data of the preset frame into the above formula 2 to obtain the energy value of the audio signal corresponding to each channel.
  • Step S3 detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition
  • the proofreading device calculates the energy value of the audio signal corresponding to each channel, it is determined whether the energy value of each audio signal meets the preset consistency condition.
  • the energy value of the audio signal corresponding to each microphone channel should be exactly the same in order to make the far-field speech recognition system that uses the microphone array to pick up the best voice recognition effect, but due to the individual hardware differences of each microphone channel of the microphone array, There may be a situation where the energy value of the amplified audio signal of one channel is too different from other channels.
  • the preset consistency condition may be that the difference between the energy values of the audio signals corresponding to each channel is less than a preset energy difference, or the difference between the energy values of the audio signals corresponding to each channel and the average energy value The value is less than a preset energy difference.
  • the preset energy difference can be set according to specific needs. When the preset energy difference is set smaller, the smaller the difference in energy value of the audio signal corresponding to each microphone channel, the proofreading effect is also good, but at the same time proofreading equipment calculation It may increase, so the preset energy difference can take a reasonable value, so that the proofreading process will not take too long, while ensuring the proofreading effect.
  • Step S4 if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, adjust the gain value of each channel so that the energy value of the audio signal corresponding to each channel Satisfying the preset consistency condition.
  • the proofreading device detects that the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, it can adjust the gain value of each channel.
  • Each microphone channel performs gain amplification processing on the picked up reference audio signal according to the adjusted gain value, and then performs analog-to-digital conversion.
  • the proofreading device obtains the digital audio data corresponding to each channel again, calculates and analyzes the energy value of the audio signal corresponding to each channel, detects whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition, and if it does not meet, then adjusts again , Loop operation until it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency condition. When it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency condition, the gain value of each channel does not change any more, and the proofreading is completed.
  • the digital audio data after preprocessing the picked up reference audio signal by each channel of the microphone array is obtained; the energy value of the audio signal corresponding to each channel is calculated according to the digital audio data; whether the energy value of the audio signal corresponding to each channel is detected The preset consistency condition is satisfied; if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition , To achieve automatic calibration of each microphone channel of the microphone array, make up for the hardware differences of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
  • step S3 includes:
  • Step S31 calculating an average energy value of the energy values of the audio signals corresponding to the channels
  • Step S32 respectively calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
  • the proofreading device After calculating the energy value of the audio signal corresponding to each channel, the proofreading device calculates the average energy value of the energy value of the audio signal corresponding to each channel. Then calculate the absolute value of the difference between the energy value of the audio signal corresponding to each channel and the average energy value, that is, the absolute value of the energy difference of each channel.
  • Step S33 Detect whether the absolute value of the energy difference of each channel is less than a preset difference
  • the proofreading device After calculating the absolute value of the energy difference of each channel, the proofreading device detects whether the absolute value of the energy difference of each channel is less than the preset difference.
  • the preset difference can be set according to specific needs. When the consistency of each channel of the microphone array is high, the preset difference can be set smaller, so that the energy value of the audio signal corresponding to each channel is closer.
  • Step S34 If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
  • the proofreading device determines that the energy value of the audio signal corresponding to each channel meets the preset consistency condition. If it is detected that the absolute value of the energy difference of at least one channel is not less than the preset difference, it means that the energy value of the audio signal of the channel and other channels is too large. At this time, the proofreading device determines that the energy value of the audio signal corresponding to each channel is not Meet the preset consistency conditions.
  • the average energy value of the audio signal energy value corresponding to each channel by calculating the average energy value of the audio signal energy value corresponding to each channel, and then detecting whether the absolute value of the energy difference between the audio signal energy value corresponding to each channel and the average energy value is less than the preset difference value, if If it is less than the preset difference value, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition, and it is realized that whether the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition is quickly calculated.
  • the third embodiment of the microphone array automatic proofreading method of the present application provides a microphone array automatic proofreading method.
  • the step of adjusting the gain value of each channel includes:
  • Step S41 determining the channel to be adjusted in each channel
  • the channel to be adjusted needs to be determined, that is, the channel gain value to be adjusted needs to be determined.
  • the determination method can be to select the channel with the lowest audio signal energy value as the reference channel, determine the other channels as the channel to be adjusted, and adjust the gain value of the other channels; or select the channel with the highest audio signal energy value as the reference channel, and use the other channels It is determined as the channel to be adjusted; the channel to be adjusted may also be one, for example, the channel with the largest absolute value of the energy difference from the average energy value is determined as the channel to be adjusted.
  • Step S42 Adjust the gain value of the channel to be adjusted according to a preset adjustment method.
  • the gain value of the channel to be adjusted is determined according to a preset adjustment mode, where the preset adjustment mode corresponds to the method of determining the channel to be adjusted. For example, when the channel with the lowest energy value of the audio signal is used as the reference channel, and other channels are determined as channels to be adjusted, the gain values of the other channels are reduced by a preset value, where the preset value can be set according to specific needs, and the preset value is set If it is too large, the difference between the energy value of the audio signal after adjusting the gain value of the channel to be adjusted and the energy value of the audio signal before adjusting the gain value is too large, so that the energy value of the audio signal corresponding to each channel is always difficult to meet the preset consistency condition.
  • the preset value can also be set to correspond to the magnitude of the energy difference, that is, when the difference between the energy value of the audio signal of the channel to be adjusted and the energy value of the audio signal of the reference channel is larger, the corresponding preset value is larger, when The smaller the difference between the energy value of the audio signal of the channel to be adjusted and the energy value of the audio signal of the reference channel, the smaller the corresponding preset value.
  • step S41 includes:
  • Step S411 Determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted
  • the method for determining the channel to be adjusted may be that, after the proofreading device calculates the absolute value of the energy difference between the energy value of the audio signal and the average energy value of each channel, the absolute value of the energy difference of each channel is compared, and all channels are The channel with the largest absolute value of energy difference is determined as the channel to be adjusted. If the absolute values of the energy differences of multiple channels are equal, the multiple channels are all determined as channels to be adjusted, or alternatively, determined as channels to be adjusted.
  • Step S42 includes:
  • Step S421 Determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value
  • Step S422 if the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value;
  • Step S423 If the energy value of the audio signal of the channel to be adjusted is less than the average energy value, increase the gain value of the channel to be adjusted by a preset value.
  • the proofreading device determines the channel with the largest absolute value of energy difference as the channel to be adjusted, it is determined whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value. If it is determined that the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value.
  • the preset values are the same as above, and can be set according to specific needs. If it is determined that the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
  • the method of adjusting the audio signal energy value of each channel can be quickly achieved Requirements, speeding up the proofreading of proofreading equipment.
  • step S42 it also includes:
  • Step S43 Detect whether the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value to see whether distortion occurs;
  • the proofreading device After the proofreading device adjusts the gain value of the channel to be adjusted according to the preset adjustment mode, the pending adjustment channel performs gain amplification processing and analog-to-digital conversion on the picked-up reference audio signal with the adjusted gain value.
  • the proofreading device may first Detecting whether the channel to be adjusted exhibits distortion after performing gain processing on the picked-up reference audio signal according to the adjusted gain value. Since the proofreading device adjusts the gain value of the channel to be adjusted according to the preset adjustment method, the preset value in the preset adjustment method may be set to be too large, so that the adjusted gain value is too large, which may result in gain amplification processing The rear audio signal is distorted. If the audio signal is distorted, it cannot be used as the basis for the later speech recognition module to recognize speech, so it is a necessary condition that the audio signal is not distorted.
  • Step S44 if it is detected that the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value, then a prompt message to modify the preset adjustment mode is output.
  • the proofreading device may output a prompt message prompting the operation and maintenance personnel to modify the preset adjustment method, that is, prompting the operation
  • the maintenance personnel modify the preset value in the preset adjustment mode so that the adjusted gain value will not be too large and cause distortion of the audio signal.
  • the proofreading device detects that the channel to be adjusted does not appear to be distorted after performing gain amplification on the picked up reference audio signal according to the adjusted gain value, the digital audio data corresponding to each channel is acquired again, and the audio signal energy corresponding to each channel is acquired. Calculate and analyze the value to detect whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition. If it does not meet the requirement, adjust again and perform a loop operation until it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency Conditions so far.
  • the distortion of the audio signal during the calibration process can be avoided to ensure the accuracy of the microphone array calibration.
  • the fourth embodiment of the automatic microphone array calibration method of the present application provides an automatic microphone array calibration method.
  • the method before the step of adjusting the gain value of each channel, the method further includes:
  • Step S51 detecting whether the current adjustment times for adjusting the gain values of the channels are greater than the preset times
  • the proofreading device can first detect whether the number of adjustments to adjust the gain value of each channel is greater than the preset number of times.
  • the proofreading device can set a counter to record the number of times to adjust the gain value, and the gain value of each channel is increased by one every time the adjustment is made.
  • the preset number of times can be set according to specific needs.
  • Step S52 if the current adjustment times for adjusting the gain value of each channel are greater than the preset number of times, the step of adjusting the gain value of each channel includes: performing a preset error reporting operation.
  • the proofreading device detects that the current adjustment times for adjusting the gain value of each channel are greater than the preset times, it will no longer adjust the channel gain value, suspend the proofreading, and perform the preset error reporting operation.
  • the preset error reporting operation may be outputting prompt information, prompting the operation and maintenance personnel to check too many times, or issuing an error reporting tone. By checking the error operation performed by the equipment, the operation and maintenance personnel can be prompted to check or repair the malfunction of the microphone array equipment.
  • a preset error report operation is performed to detect a problem on the hardware of the microphone array device for the operation and maintenance personnel to Carry out inspection or repair.
  • an embodiment of the present application also provides an automatic calibration device for a microphone array.
  • the automatic calibration device for a microphone array includes:
  • the acquisition module is configured to acquire digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
  • a calculation module configured to calculate the energy value of the audio signal corresponding to each channel according to the digital audio data
  • a detection module configured to detect whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition
  • An adjustment module configured to adjust the gain value of each channel so that the audio signal corresponding to each channel if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition The energy value meets the preset consistency condition.
  • the acquisition module is further configured to acquire digital audio data obtained by performing gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal by each channel of the microphone.
  • the calculation module includes:
  • the frame processing unit is configured to perform frame processing on the digital audio data corresponding to the respective channels;
  • An obtaining unit configured to obtain audio data of a preset frame in the digital audio data corresponding to each channel
  • the first calculation unit is configured to calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  • the detection module includes:
  • the second calculation unit is configured to calculate the average energy value of the audio signal energy value corresponding to each channel; and also configured to calculate the energy of the audio signal energy value and the average energy value corresponding to each channel respectively Absolute value of difference;
  • a detection unit configured to detect whether the absolute value of the energy difference of each channel is less than a preset difference
  • the first determining unit is configured to determine that the energy value of the audio signal corresponding to each channel satisfies the preset consistency if the absolute value of the energy difference of each channel is less than the preset difference condition.
  • the adjustment module includes:
  • a second determination unit configured to determine the channel to be adjusted in each channel
  • the adjustment unit is configured to adjust the gain value of the channel to be adjusted according to a preset adjustment mode.
  • the second determining unit is further configured to determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
  • the adjusting unit is further configured to determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value; if the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, then The gain value of the channel to be adjusted decreases by a preset value; if the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
  • the detection module is further configured to detect whether distortion occurs in the channel to be adjusted according to the adjusted gain value after performing gain amplification processing on the reference audio signal;
  • the automatic calibration device of the microphone array further includes:
  • the output module is configured to output a prompt message to modify the preset adjustment mode if a distortion phenomenon occurs after the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value.
  • the detection module is further configured to detect whether the current number of adjustments to adjust the gain value of each channel is greater than a preset number of times;
  • the automatic calibration device of the microphone array further includes:
  • the error reporting module is configured to perform a preset error reporting operation if the current number of adjustments to adjust the gain value of each channel is greater than the preset number of times.
  • the embodiments of the microphone array automatic proofreading device are basically the same as the embodiments of the microphone array automatic proofreading method described above, and details are not repeated here.
  • an embodiment of the present application also provides a computer-readable storage medium that stores a microphone array automatic calibration program stored on the computer-readable storage medium, and the microphone array automatic calibration program is implemented by the processor to implement the microphone array as described above
  • the steps of the automatic proofreading method The expanded content of the specific implementation of the microphone array automatic proofreading device and the storage medium (ie, computer-readable storage medium) of the present application is basically the same as the above embodiments of the microphone array automatic proofreading method, and details are not described herein.
  • the methods in the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course, can also be implemented by hardware, but in many cases the former is better Implementation.
  • the technical solution of the present application can be embodied in the form of a software product in essence or part that contributes to the existing technology, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above , Disk, CD), including several instructions to make a terminal device (which can be a mobile phone, computer, server, air conditioner, or network equipment, etc.) to perform the method described in each embodiment of the present application.

Landscapes

  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The present application discloses an automatic calibration method, device and apparatus for microphone array and a storage medium. The method comprises the following steps: obtaining digital audio data after respective channels of a microphone array performs preprocessing on a picked-up benchmark audio signal; calculating audio signal energy values corresponding to respective channels according to the digital audio data; detecting whether the audio signal energy values satisfy a preset consistency condition; if the audio signal energy values do not satisfy the preset consistency condition, adjusting gain values of respective channels so as to make the audio signal energy values satisfy the preset consistency condition.

Description

麦克风阵列自动校对方法、装置、设备及存储介质Microphone array automatic proofreading method, device, equipment and storage medium
本申请要求于2018年12月17日提交中国专利局、申请号为201811542125.1、发明名称为“麦克风阵列自动校对方法、装置、设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在申请中。This application requires the priority of the Chinese patent application submitted to the Chinese Patent Office on December 17, 2018 with the application number 201811542125.1 and the invention titled "Microphone array automatic proofreading method, device, equipment and storage medium", the entire content of which is cited by reference Incorporated in the application.
技术领域Technical field
本申请涉及麦克风技术领域,尤其涉及一种麦克风阵列自动校对方法、装置、设备及存储介质。The present application relates to the technical field of microphones, in particular to a method, device, equipment and storage medium for automatic calibration of microphone arrays.
背景技术Background technique
随着人工智能技术的发展,智能语音交互的使用场合越来越多,对远场语音识别的能力要求也越来越高。采用麦克风阵列拾音是当前最主要的远场拾音方式,即采用多通道同步采集语音数据,再对采集到的语音数据进行降噪、声源定位、波束处理等处理措施后送至后端语音识别模块。所以麦克风阵列拾音性能是整个远场语音识别系统质量好坏的前提。目前麦克风阵列存在有不同的空间布局,如有线性、环形等,但无论哪种方式,都要求所有的麦克风的拾音性能主要是拾音增益要保持一致,但由于麦克风阵列硬件个体差异的存在,各麦克风通道的拾音一致性并不能满足技术要求,会使后端语音识别模块的算法计算产生偏差,从而影响远场拾音和语音识别的效果。With the development of artificial intelligence technology, there are more and more use cases for intelligent voice interaction, and the requirements for far-field voice recognition are becoming higher and higher. Using microphone array to pick up sound is currently the most important far-field sound picking method, that is, multi-channel synchronous voice data is collected, and then the collected voice data is processed by noise reduction, sound source positioning, beam processing and other processing measures and then sent to the back end Voice recognition module. Therefore, the microphone array pickup performance is a prerequisite for the quality of the entire far-field speech recognition system. At present, there are different spatial layouts of microphone arrays, such as linear, circular, etc., but either way, the pickup performance of all microphones is mainly required to maintain the same pickup gain, but due to the existence of individual differences in microphone array hardware The sound pickup consistency of each microphone channel does not meet the technical requirements, which will cause deviations in the algorithm calculation of the back-end speech recognition module, thereby affecting the far-field sound pickup and speech recognition effects.
发明内容Summary of the invention
本申请的主要目的在于提供一种麦克风阵列自动校对方法、设备及计算机可读存储介质,旨在解决由于麦克风阵列硬件个体差异导致的各麦克风通道的拾音一致性不能满足技术要求的技术问题。The main purpose of the present application is to provide a method, device and computer readable storage medium for automatic calibration of a microphone array, aiming to solve the technical problem that the sound pickup consistency of each microphone channel due to individual differences in microphone array hardware cannot meet the technical requirements.
为实现上述目的,本申请提供一种麦克风阵列自动校对方法,所述麦克风阵列自动校对方法包括步骤:In order to achieve the above object, the present application provides an automatic calibration method of a microphone array, which includes the steps of:
获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Obtain digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
根据所述数字音频数据计算所述各通道对应的音频信号能量值;Calculating the energy value of the audio signal corresponding to each channel according to the digital audio data;
检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;Detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition;
若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件, 则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
可选地,所述获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据的步骤包括:Optionally, the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
可选地,所述根据所述数字音频数据计算所述各通道对应的音频信号能量值的步骤包括:Optionally, the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
分别对所述各通道对应的数字音频数据进行分帧处理;Separately framing the digital audio data corresponding to each channel;
获取所述各通道对应的数字音频数据中一预设帧的音频数据;Acquiring audio data of a preset frame in the digital audio data corresponding to each channel;
根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
可选地,所述检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件的步骤包括:Optionally, the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
计算所述各通道对应的所述音频信号能量值的平均能量值;Calculating the average energy value of the energy values of the audio signals corresponding to the respective channels;
分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Separately calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
检测所述各通道的所述能量差值绝对值是否小于预设差值;Detecting whether the absolute value of the energy difference of each channel is less than a preset difference;
若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
可选地,所述调整所述各通道的增益值的步骤包括:Optionally, the step of adjusting the gain value of each channel includes:
在所述各通道中确定待调整通道;Determine the channel to be adjusted in each channel;
根据预设调整方式调整所述待调整通道的增益值。The gain value of the channel to be adjusted is adjusted according to a preset adjustment mode.
可选地,所述在所述各通道中确定待调整通道的步骤包括:Optionally, the step of determining the channel to be adjusted in each channel includes:
将所有通道中所述能量差值绝对值最大的通道确定为待调整通道;Determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
所述根据预设调整方式调整所述待调整通道的增益值的步骤包括:The step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method includes:
判断所述待调整通道的所述音频信号能量值是否大于所述平均能量值;Determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value;
若所述待调整通道的所述音频信号能量值大于所述平均能量值,则将所述待调整通道的增益值减小预设值;If the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value;
若所述待调整通道的所述音频信号能量值小于所述平均能量值,将所述待调整通道的增益值增大预设值。If the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
可选地,所述根据预设调整方式调整所述待调整通道的增益值的步骤之后,还包括:Optionally, after the step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method, the method further includes:
检测所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后是否出现失真现象;Detecting whether distortion occurs in the channel to be adjusted after gain-amplifying the reference audio signal according to the adjusted gain value;
若检测到所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后出现失真现象,则输出修改所述预设调整方式的提示消息。If it is detected that the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value, a distortion phenomenon occurs, and a prompt message to modify the preset adjustment mode is output.
可选地,所述调整所述各通道的增益值的步骤之前,还包括:Optionally, before the step of adjusting the gain value of each channel, the method further includes:
检测当前调整所述各通道增益值的调整次数是否大于预设次数;Detecting whether the current adjustment times for adjusting the gain values of the channels are greater than the preset times;
若当前调整所述各通道的增益值的调整次数大于所述预设次数,则所述调整所述各通道的增益值的步骤包括:If the current number of adjustments to adjust the gain value of each channel is greater than the preset number of times, the step of adjusting the gain value of each channel includes:
执行预设报错操作。Perform preset error reporting.
此外,为实现上述目的,本申请还提供一种麦克风阵列自动校对装置,所述麦克风阵列自动校对装置包括:In addition, in order to achieve the above object, the present application also provides a microphone array automatic proofreading device. The microphone array automatic proofreading device includes:
获取模块,配置为获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;The acquisition module is configured to acquire digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
计算模块,配置为根据所述数字音频数据计算所述各通道对应的音频信号能量值;A calculation module configured to calculate the energy value of the audio signal corresponding to each channel according to the digital audio data;
检测模块,配置为检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;A detection module configured to detect whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition;
调整模块,配置为若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。An adjustment module configured to adjust the gain value of each channel so that the audio signal corresponding to each channel if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition The energy value meets the preset consistency condition.
此外,为实现上述目的,本申请还提供一种麦克风阵列自动校对设备,所述麦克风阵列自动校对设备包括存储器、处理器和存储在所述存储器上并可在所述处理器上运行的麦克风阵列自动校对程序,所述麦克风阵列自动校对程序被所述处理器执行时实现如上所述的麦克风阵列自动校对方法的步骤。In addition, in order to achieve the above object, the present application also provides a microphone array automatic proofreading device, the microphone array automatic proofreading device includes a memory, a processor, and a microphone array stored on the memory and operable on the processor An automatic calibration program, when the automatic microphone array calibration program is executed by the processor, the steps of the automatic calibration method for the microphone array described above are implemented.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有麦克风阵列自动校对程序,所述麦克风阵列自动校对程序被处理器执行时实现如上所述的麦克风阵列自动校对方法的步 骤。In addition, in order to achieve the above object, the present application also provides a computer-readable storage medium on which the microphone array automatic calibration program is stored, and the microphone array automatic calibration program is implemented as described above when executed by the processor The steps of the microphone array automatic calibration method described above.
本申请通过获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;根据数字音频数据分别计算各通道对应的音频信号能量值;检测各通道对应的音频信号能量值是否满足预设一致性条件;若各通道对应的音频信号能量值不满足预设一致性条件,则调整各通道的增益值,以使各通道对应的音频信号能量值满足所述预设一致性条件,实现了对麦克风阵列各麦克风通道进行自动校对,弥补各麦克风通道的硬件差异,使得各麦克风通道的拾音满足一致性要求。This application obtains digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array; calculates the energy value of the audio signal corresponding to each channel according to the digital audio data; detects whether the energy value of the audio signal corresponding to each channel meets A preset consistency condition; if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, adjust the gain value of each channel so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition, The automatic calibration of each microphone channel of the microphone array is realized to compensate for the hardware difference of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
附图说明BRIEF DESCRIPTION
图1是本申请实施例方案涉及的硬件运行环境的结构示意图;FIG. 1 is a schematic structural diagram of a hardware operating environment involved in an embodiment of the present application;
图2为本申请麦克风阵列自动校对方法第一实施例的流程示意图。FIG. 2 is a schematic flowchart of a first embodiment of a microphone array automatic proofreading method of the present application.
图3为本申请麦克风阵列自动校对方法实施例中步骤S3的细化流程示意图。FIG. 3 is a detailed flowchart of step S3 in the embodiment of the automatic calibration method of the microphone array of the present application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional characteristics and advantages of the present application will be further described in conjunction with the embodiments and with reference to the drawings.
具体实施方式detailed description
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described herein are only used to explain the present application, and are not used to limit the present application.
由于目前存在麦克风阵列硬件个体差异导致的各麦克风通道拾音一致性不能满足技术要求的技术问题,本申请提供一种解决方案,通过获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;根据所述数字音频数据计算所述各通道对应的音频信号能量值;检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。实现了对麦克风阵列各麦克风通道进行自动校对,弥补各麦克风通道的硬件差异,使得各麦克风通道的拾音满足一致性要求。Due to the current technical problem that the pickup consistency of each microphone channel cannot meet the technical requirements due to the individual differences in the microphone array hardware, this application provides a solution by preprocessing the picked up reference audio signal by acquiring each channel of the microphone array Digital audio data; calculating the energy value of the audio signal corresponding to each channel according to the digital audio data; detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition; if the channels correspond The energy value of the audio signal does not satisfy the preset consistency condition, then the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition. The automatic calibration of each microphone channel of the microphone array is realized to compensate for the hardware difference of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
本申请提供了一种麦克风阵列自动校对设备,参照图1,图1是本申请实施例方案涉及的硬件运行环境的结构示意图。The present application provides a microphone array automatic proofreading device. Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a hardware operating environment involved in a solution of an embodiment of the present application.
需要说明的是,图1即可为麦克风阵列自动校对设备的硬件运行环境的结构示意图。It should be noted that FIG. 1 is a schematic diagram of the hardware operating environment of the microphone array automatic proofreading device.
如图1所示,该麦克风阵列自动校对设备可以包括:处理器1001,例如CPU,网络接口1004,用户接口1003,存储器1005,通信总线1002。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 1, the microphone array automatic proofreading device may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, and a communication bus 1002. Among them, the communication bus 1002 is used to implement connection communication between these components. The user interface 1003 may include a display (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. The network interface 1004 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface). The memory 1005 may be a high-speed RAM memory, or may be a non-volatile memory (non-volatile memory), such as a disk memory. The memory 1005 may optionally be a storage device independent of the foregoing processor 1001.
可选地,麦克风阵列自动校对设备还可以包括摄像头、RF(Radio Frequency,射频)电路,传感器、音频电路、WiFi模块等等。本领域技术人员可以理解,图1中示出的麦克风阵列自动校对设备结构并不构成对麦克风阵列自动校对设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Optionally, the microphone array automatic proofreading device may further include a camera, an RF (Radio Frequency) circuit, a sensor, an audio circuit, a WiFi module, and so on. Those skilled in the art may understand that the structure of the automatic microphone array calibration device shown in FIG. 1 does not constitute a limitation on the automatic microphone array calibration device, and may include more or fewer components than the illustration, or a combination of certain components. Or different component arrangements.
如图1所示,作为一种计算机存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及麦克风阵列自动校对程序。As shown in FIG. 1, the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a microphone array automatic proofreading program.
在图1所示的麦克风阵列自动校对设备中,网络接口1004主要用于连接麦克风阵列设备,与麦克风阵列设备进行数据通信;校对人员可通过用户接口1003触发校对指令,以使麦克风阵列自动校对设备根据该校对指令对麦克风阵列设备进行自动校对,用户接口1003还可用于显示语音数据及校对结果;而处理器1001可以用于调用存储器1005中存储的麦克风阵列自动校对程序,并执行以下操作:In the microphone array automatic calibration device shown in FIG. 1, the network interface 1004 is mainly used to connect the microphone array device and perform data communication with the microphone array device; the proofreader can trigger the calibration instruction through the user interface 1003 to make the microphone array automatic calibration device According to the calibration instruction, the microphone array device is automatically calibrated. The user interface 1003 can also be used to display voice data and calibration results; and the processor 1001 can be used to call the microphone array automatic calibration program stored in the memory 1005 and perform the following operations:
获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Obtain digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
根据所述数字音频数据计算所述各通道对应的音频信号能量值;Calculating the energy value of the audio signal corresponding to each channel according to the digital audio data;
检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;Detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition;
若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
进一步地,所述获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据的步骤包括:Further, the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
进一步地,所述根据所述数字音频数据计算所述各通道对应的音频信号能量值的步骤包括:Further, the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
分别对所述各通道对应的数字音频数据进行分帧处理;Separately framing the digital audio data corresponding to each channel;
获取所述各通道对应的数字音频数据中一预设帧的音频数据;Acquiring audio data of a preset frame in the digital audio data corresponding to each channel;
根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
进一步地,所述检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件的步骤包括:Further, the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
计算所述各通道对应的所述音频信号能量值的平均能量值;Calculating the average energy value of the energy values of the audio signals corresponding to the respective channels;
分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Separately calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
检测所述各通道的所述能量差值绝对值是否小于预设差值;Detecting whether the absolute value of the energy difference of each channel is less than a preset difference;
若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
进一步地,所述调整所述各通道的增益值的步骤包括:Further, the step of adjusting the gain value of each channel includes:
在所述各通道中确定待调整通道;Determine the channel to be adjusted in each channel;
根据预设调整方式调整所述待调整通道的增益值。The gain value of the channel to be adjusted is adjusted according to a preset adjustment mode.
进一步地,所述在所述各通道中确定待调整通道的步骤包括:Further, the step of determining the channel to be adjusted in each channel includes:
将所有通道中所述能量差值绝对值最大的通道确定为待调整通道;Determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
所述根据预设调整方式调整所述待调整通道的增益值的步骤包括:The step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method includes:
判断所述待调整通道的所述音频信号能量值是否大于所述平均能量值;Determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value;
若所述待调整通道的所述音频信号能量值大于所述平均能量值,则将所述待调整通道的增益值减小预设值;If the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value;
若所述待调整通道的所述音频信号能量值小于所述平均能量值,将所述待调整通道的增益值增大预设值。If the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
进一步地,所述根据预设调整方式调整所述待调整通道的增益值的步骤之后,处理器1001可以调用存储器1005中存储的麦克风阵列自动校对程序,还执行以下操作:Further, after the step of adjusting the gain value of the channel to be adjusted according to the preset adjustment mode, the processor 1001 may call the microphone array automatic calibration program stored in the memory 1005, and also perform the following operations:
检测所述待调整通道根据调整后的增益值对所述基准音频信号进行增益 放大处理后是否出现失真现象;Detecting whether a distortion phenomenon occurs after the channel to be adjusted performs gain amplification on the reference audio signal according to the adjusted gain value;
若检测到所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后出现失真现象,则输出修改所述预设调整方式的提示消息。If it is detected that the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value, then a distortion message is output, a prompt message to modify the preset adjustment mode is output.
基于上述的硬件结构,提出本申请麦克风阵列自动校对方法的各个实施例。本申请麦克风阵列自动校对方法主要应用于上述麦克风阵列自动校对设备上,在以下各实施例中,为了便于描述,以校对设备为执行主体进行阐述各个实施例。Based on the above hardware structure, various embodiments of the microphone array automatic proofreading method of the present application are proposed. The microphone array automatic proofreading method of the present application is mainly applied to the above-mentioned microphone array automatic proofreading device. In the following embodiments, for the convenience of description, the proofreading device is used as an executive body to explain the various embodiments.
参照图2,本申请麦克风阵列自动校对方法第一实施例提供一种麦克风阵列自动校对方法,需要说明的是,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。所述麦克风阵列自动校对方法包括:Referring to FIG. 2, the first embodiment of the automatic calibration method of the microphone array of the present application provides an automatic calibration method of the microphone array. It should be noted that although the logic sequence is shown in the flowchart, in some cases, it may be different. The steps shown or described are performed in the order presented here. The automatic calibration method of the microphone array includes:
步骤S1,获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Step S1: Obtain digital audio data after pre-processing the picked up reference audio signal in each channel of the microphone array;
校对设备与麦克风阵列设备通过有线或无线方式连接进行数据传输。校对设备可以包括一个声源装置,校对设备控制声源装置播放一段基准音频流,该基准音频流为能量值稳定度满足稳定性要求的一段音频流,校对环境最佳选取无噪声环境,以使得校对效果达到最佳。麦克风阵列的各个麦克风通道分别将该基准音频流转化为基准音频信号,此过程与现有的麦克风将声音转化为电信号的过程类似。各通道在拾取到基准音频信号后,分别对基准音频信号进行预处理后得到数字音频数据。预处理可以是先对基准音频信号进行增益放大处理后,再进行模数转换处理,也可以是其他能够将基准音频信号转换为数字音频数据的处理过程。校对设备获取麦克风阵列各通道对基准音频信号进行预处理后的数字音频数据。The proofreading device and the microphone array device are connected by a wired or wireless method for data transmission. The proofing device may include a sound source device. The proofing device controls the sound source device to play a reference audio stream. The reference audio stream is a segment of audio stream whose energy value stability meets the stability requirements. The proofing environment is preferably a noise-free environment, so that The proofreading effect is the best. Each microphone channel of the microphone array converts the reference audio stream into a reference audio signal respectively. This process is similar to the process in which existing microphones convert sound into electrical signals. After picking up the reference audio signal, each channel preprocesses the reference audio signal to obtain digital audio data. The preprocessing may be to perform gain amplification processing on the reference audio signal first, and then perform analog-to-digital conversion processing, or may be other processing procedures capable of converting the reference audio signal into digital audio data. The proofreading device obtains digital audio data after preprocessing the reference audio signal by each channel of the microphone array.
进一步地,步骤S1包括:Further, step S1 includes:
步骤S11,获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Step S11: Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked up reference audio signal.
麦克风阵列各通道在拾取到基准音频信号后,分别对基准音频信号进行增益放大处理,此时各通道是以初始增益值对该基准音频信号进行放大处理,其中,各通道的初始增益值可预先设置,每个通道的初始增益值应相同。在理想状态下各麦克风通道以相同初始增益值对相同基准音频信号进行增益放 大处理,应当得到相同增益放大特征的音频信号,但由于麦克风阵列各麦克风通道存在硬件上的个体差异,并且此种硬件差异为生产硬件时就难以避免的差异,因此,需要对各通道的初始增益值进行调整,以使得各通道增益放大特征一致。各通道在对基准音频信号进行增益放大处理后,得到放大后的音频信号,再对放大后的音频信号进行模数转换,得到数字音频数据,模数转换是依据麦克风阵列设备中预先设置的采样率、采样大小等采样规格进行采样、量化,模数转换后得到的数字音频数据为按采样先后顺序记录的每个采样点的样点值。校对设备从麦克风阵列设备获取各个通道对应的数字音频数据。After picking up the reference audio signal, each channel of the microphone array performs gain amplification processing on the reference audio signal. At this time, each channel amplifies the reference audio signal with an initial gain value. Among them, the initial gain value of each channel can be preset Set, the initial gain value of each channel should be the same. In an ideal state, each microphone channel performs gain amplification processing on the same reference audio signal with the same initial gain value, and an audio signal with the same gain amplification characteristic should be obtained, but due to the individual differences in hardware of each microphone channel of the microphone array, and such hardware The difference is a difference that is difficult to avoid when producing hardware. Therefore, the initial gain value of each channel needs to be adjusted so that the gain amplification characteristics of each channel are consistent. After performing gain amplification processing on the reference audio signal, each channel obtains the amplified audio signal, and then performs analog-to-digital conversion on the amplified audio signal to obtain digital audio data. The analog-to-digital conversion is based on the preset sampling in the microphone array device Sampling and quantization are performed on sampling specifications such as rate and sampling size. The digital audio data obtained after analog-to-digital conversion is the sample value of each sampling point recorded in the order of sampling. The proofreading device acquires digital audio data corresponding to each channel from the microphone array device.
步骤S2,根据所述数字音频数据计算所述各通道对应的音频信号能量值;Step S2: Calculate the energy value of the audio signal corresponding to each channel according to the digital audio data;
校对设备在获取到各通道对应的数字音频数据后,根据获取到的各通道对应的数字音频数据,分别计算各通道对应的音频信号能量值。由于基准音频流为能量值稳定度满足稳定性要求的一段音频流,因此,各通道对拾取到的基准音频信号进行增益放大处理后的放大后音频信号也应当满足能量值相等,但是,由于麦克风阵列各麦克风通道的硬件个体差异,可能出现某个通道放大后音频信号的能量值与其他通道差异过大的情况,因此,需要调整各个通道的增益值,使得各个通道放大后音频信号的能量值满足一致性要求。校对设备根据从各通道获取到的数字音频数据,分别计算各通道同一时刻采集到的音频信号的能量值,即得到各通道对应的音频信号能量值。After acquiring the digital audio data corresponding to each channel, the proofreading device calculates the energy value of the audio signal corresponding to each channel according to the acquired digital audio data corresponding to each channel. Since the reference audio stream is an audio stream whose energy value stability meets the stability requirements, the amplified audio signal after the gain amplification processing of the picked up reference audio signal by each channel should also meet the energy value equal, but, due to the microphone The hardware difference of each microphone channel of the array may cause the energy value of the amplified audio signal of one channel to be too different from other channels. Therefore, the gain value of each channel needs to be adjusted so that the energy value of the amplified audio signal of each channel Meet consistency requirements. The proofreading device calculates the energy value of the audio signal collected at the same time in each channel based on the digital audio data acquired from each channel, that is, the energy value of the audio signal corresponding to each channel is obtained.
进一步地,步骤S2包括:Further, step S2 includes:
步骤S21,分别对所述各通道对应的数字音频数据进行分帧处理;In step S21, the digital audio data corresponding to each channel is separately framed;
定义n时刻某语音信号的短时平均能量E n为: Define the short-term average energy E n of a speech signal at time n as:
公式1:
Figure PCTCN2019124639-appb-000001
Formula 1:
Figure PCTCN2019124639-appb-000001
其中,N为窗长,m为窗口上的采样点,x(m)为采样点的样点值,ω(n-m)为窗函数,可见短时平均能量为一帧样点值的加权平方和。特殊地,当窗函数为矩形窗时,有公式2:Among them, N is the window length, m is the sampling point on the window, x(m) is the sample value of the sampling point, ω(n-m) is the window function, it can be seen that the short-term average energy is the weight of the sample value of one frame sum of square. In particular, when the window function is a rectangular window, there is formula 2:
Figure PCTCN2019124639-appb-000002
Figure PCTCN2019124639-appb-000002
此时短时平均能量为一帧样点值的平方和。校对设备先分别对各通道对应的数字音频数据进行加窗分帧处理,窗函数为矩形窗,帧长可预先根据具 体需要设置。At this time, the short-term average energy is the sum of squares of sample values in one frame. The proofreading device first performs windowing and framing processing on the digital audio data corresponding to each channel, the window function is a rectangular window, and the frame length can be set in advance according to specific needs.
步骤S22,获取所述各通道对应的数字音频数据中一预设帧的音频数据;Step S22: Acquire audio data of a preset frame in the digital audio data corresponding to each channel;
校对设备在对各通道对应的数字音频数据进行加窗分帧处理后,分别获取各通道分帧处理后的数字音频数据中一预设帧的音频数据。其中预设帧可以是分帧处理后的数字音频数据的第N帧,N可以预先根据具体需要设置。After performing windowing and framing processing on the digital audio data corresponding to each channel, the proofreading device respectively obtains audio data of a preset frame in the digital audio data processed by each channel. The preset frame may be the Nth frame of the digital audio data after the frame processing, and N may be set in advance according to specific needs.
步骤S23,根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Step S23: Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
校对设备在获取到各通道的一预设帧的音频数据后,将该预设帧的音频数据代入上述公式2,得出各通道对应的音频信号能量值。After acquiring the audio data of a preset frame of each channel, the proofreading device substitutes the audio data of the preset frame into the above formula 2 to obtain the energy value of the audio signal corresponding to each channel.
步骤S3,检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;Step S3, detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition;
当校对设备计算出各通道对应的音频信号能量值后,判断各音频信号能量值是否满足预设一致性条件。在理想情况下,各个麦克风通道对应的音频信号能量值应当完全相同,才能使得采用麦克风阵列进行拾音的远场语音识别系统语音识别效果最好,但由于麦克风阵列各麦克风通道的硬件个体差异,可能出现某个通道放大后音频信号的能量值与其他通道差异过大的情况,此时需要检测各通道对应的音频信号能量值是否满足预设一致性条件,即是否出现某个通道放大后音频信号的能量值与其他通道差异过大的情况。该预设一致性条件可以是每个通道对应的音频信号能量值之间的差值小于一预设能量差值,也可以是每个通道对应的音频信号能量值与平均能量值之间的差值小于一预设能量差值。其中,预设能量差值可以根据具体需要进行设置,当预设能量差值设置的越小时,各麦克风通道对应的音频信号能量值差值越小,校对效果也好,但同时校对设备计算量可能增大,因此预设能量差值可取一个合理值,使得校对过程不会耗时过久,同时保证校对效果。After the proofreading device calculates the energy value of the audio signal corresponding to each channel, it is determined whether the energy value of each audio signal meets the preset consistency condition. Ideally, the energy value of the audio signal corresponding to each microphone channel should be exactly the same in order to make the far-field speech recognition system that uses the microphone array to pick up the best voice recognition effect, but due to the individual hardware differences of each microphone channel of the microphone array, There may be a situation where the energy value of the amplified audio signal of one channel is too different from other channels. At this time, it is necessary to detect whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition, that is, whether the amplified audio of a certain channel appears The signal energy value is too different from other channels. The preset consistency condition may be that the difference between the energy values of the audio signals corresponding to each channel is less than a preset energy difference, or the difference between the energy values of the audio signals corresponding to each channel and the average energy value The value is less than a preset energy difference. Among them, the preset energy difference can be set according to specific needs. When the preset energy difference is set smaller, the smaller the difference in energy value of the audio signal corresponding to each microphone channel, the proofreading effect is also good, but at the same time proofreading equipment calculation It may increase, so the preset energy difference can take a reasonable value, so that the proofreading process will not take too long, while ensuring the proofreading effect.
步骤S4,若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。Step S4, if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, adjust the gain value of each channel so that the energy value of the audio signal corresponding to each channel Satisfying the preset consistency condition.
校对设备若检测到各通道对应的音频信号能量值不满足预设一致性条件,则可调整各通道的增益值。各麦克风通道根据调整后的增益值对拾取到的基准音频信号进行增益放大处理后,再进行模数转换。校对设备再次获取 各通道对应的数字音频数据,对各个通道对应的音频信号能量值进行计算分析,检测各通道对应的音频信号能量值是否满足预设一致性条件,若不满足,则再次进行调整,循环操作,直到检测到各通道对应的音频信号能量值满足预设一致性条件为止。当检测到各通道对应的音频信号能量值满足预设一致性条件后,各个通道的增益值不再进行改变,校对完成。If the proofreading device detects that the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, it can adjust the gain value of each channel. Each microphone channel performs gain amplification processing on the picked up reference audio signal according to the adjusted gain value, and then performs analog-to-digital conversion. The proofreading device obtains the digital audio data corresponding to each channel again, calculates and analyzes the energy value of the audio signal corresponding to each channel, detects whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition, and if it does not meet, then adjusts again , Loop operation until it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency condition. When it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency condition, the gain value of each channel does not change any more, and the proofreading is completed.
本实施例通过获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;根据数字音频数据分别计算各通道对应的音频信号能量值;检测各通道对应的音频信号能量值是否满足预设一致性条件;若各通道对应的音频信号能量值不满足预设一致性条件,则调整各通道的增益值,以使各通道对应的音频信号能量值满足所述预设一致性条件,实现了对麦克风阵列各麦克风通道进行自动校对,弥补各麦克风通道的硬件差异,使得各麦克风通道的拾音满足一致性要求。In this embodiment, the digital audio data after preprocessing the picked up reference audio signal by each channel of the microphone array is obtained; the energy value of the audio signal corresponding to each channel is calculated according to the digital audio data; whether the energy value of the audio signal corresponding to each channel is detected The preset consistency condition is satisfied; if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel meets the preset consistency condition , To achieve automatic calibration of each microphone channel of the microphone array, make up for the hardware differences of each microphone channel, so that the sound pickup of each microphone channel meets the consistency requirements.
进一步的,基于上述第一实施例,本申请麦克风阵列自动校对方法第二实施例提供一种麦克风阵列自动校对方法。参照图3,在本实施例中,步骤S3包括:Further, based on the above-mentioned first embodiment, the second embodiment of the microphone array automatic proofreading method of the present application provides a microphone array automatic proofreading method. Referring to FIG. 3, in this embodiment, step S3 includes:
步骤S31,计算所述各通道对应的所述音频信号能量值的平均能量值;Step S31, calculating an average energy value of the energy values of the audio signals corresponding to the channels;
步骤S32,分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Step S32, respectively calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
校对设备在计算出各通道对应的音频信号能量值后,计算各通道对应音频信号能量值的平均能量值。再分别计算各通道对应的音频信号能量值与该平均能量值的差值的绝对值,即得到各个通道的能量差值绝对值。After calculating the energy value of the audio signal corresponding to each channel, the proofreading device calculates the average energy value of the energy value of the audio signal corresponding to each channel. Then calculate the absolute value of the difference between the energy value of the audio signal corresponding to each channel and the average energy value, that is, the absolute value of the energy difference of each channel.
步骤S33,检测所述各通道的所述能量差值绝对值是否小于预设差值;Step S33: Detect whether the absolute value of the energy difference of each channel is less than a preset difference;
校对设备在计算出各个通道的能量差值绝对值后,检测各个通道的能量差值绝对值是否小于预设差值。预设差值可以根据具体需要进行设置,当对麦克风阵列各通道一致性要求较高时,预设差值可设置得较小,从而使得各通道对应的音频信号能量值越接近。After calculating the absolute value of the energy difference of each channel, the proofreading device detects whether the absolute value of the energy difference of each channel is less than the preset difference. The preset difference can be set according to specific needs. When the consistency of each channel of the microphone array is high, the preset difference can be set smaller, so that the energy value of the audio signal corresponding to each channel is closer.
步骤S34,若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。Step S34: If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
若检测到各通道的能量差值绝对值均小于该预设差值,则校对设备确定各通道对应的音频信号能量值满足预设一致性条件。若检测到至少一个通道 的能量差值绝对值不小于该预设差值,则说明有通道与其他通道的音频信号能量值差别过大,此时校对设备确定各通道对应的音频信号能量值不满足预设一致性条件。If it is detected that the absolute value of the energy difference of each channel is less than the preset difference, the proofreading device determines that the energy value of the audio signal corresponding to each channel meets the preset consistency condition. If it is detected that the absolute value of the energy difference of at least one channel is not less than the preset difference, it means that the energy value of the audio signal of the channel and other channels is too large. At this time, the proofreading device determines that the energy value of the audio signal corresponding to each channel is not Meet the preset consistency conditions.
在本实施例中,通过计算各通道对应的音频信号能量值的平均能量值,再检测各通道对应的音频信号能量值与该平均能量值的能量差值绝对值是否小于预设差值,若小于预设差值则确定各通道对应的音频信号能量值满足预设一致性条件,实现了快速计算各个通道对应的音频信号能量值是否满足预设一致性条件。In this embodiment, by calculating the average energy value of the audio signal energy value corresponding to each channel, and then detecting whether the absolute value of the energy difference between the audio signal energy value corresponding to each channel and the average energy value is less than the preset difference value, if If it is less than the preset difference value, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition, and it is realized that whether the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition is quickly calculated.
进一步的,基于上述第一或第二实施例,本申请麦克风阵列自动校对方法第三实施例提供一种麦克风阵列自动校对方法。在本实施例中,所述调整所述各通道的增益值的步骤包括:Further, based on the above-mentioned first or second embodiment, the third embodiment of the microphone array automatic proofreading method of the present application provides a microphone array automatic proofreading method. In this embodiment, the step of adjusting the gain value of each channel includes:
步骤S41,在所述各通道中确定待调整通道;Step S41, determining the channel to be adjusted in each channel;
当校对设备检测到各通道对应的音频信号能量值不满足预设一致性条件时,需要调整各通道的增益值。此时需要确定待调整通道,即需要确定调整哪一通道的增益值。确定方法可以是选择音频信号能量值最低的通道为基准通道,将其他通道确定为待调整通道,调整其他通道的增益值;也可以是选择音频信号能量值最高的通道为基准通道,将其他通道确定为待调整通道;待调整通道也可以是一个,如将与平均能量值的能量差值绝对值最大的通道确定为待调整通道。When the proofreading device detects that the energy value of the audio signal corresponding to each channel does not meet the preset consistency condition, it is necessary to adjust the gain value of each channel. At this time, the channel to be adjusted needs to be determined, that is, the channel gain value to be adjusted needs to be determined. The determination method can be to select the channel with the lowest audio signal energy value as the reference channel, determine the other channels as the channel to be adjusted, and adjust the gain value of the other channels; or select the channel with the highest audio signal energy value as the reference channel, and use the other channels It is determined as the channel to be adjusted; the channel to be adjusted may also be one, for example, the channel with the largest absolute value of the energy difference from the average energy value is determined as the channel to be adjusted.
步骤S42,根据预设调整方式调整所述待调整通道的增益值。Step S42: Adjust the gain value of the channel to be adjusted according to a preset adjustment method.
在确定待调整通道后,根据预设调整方式确定待调整通道的增益值,其中,预设调整方式与确定待调整通道的方法对应。如当将音频信号能量值最低的通道作为基准通道,其他通道确定为待调整通道时,将其他通道的增益值减小预设值,其中预设值可根据具体需要进行设置,预设值设置得过大会使待调整通道增益值调整后的音频信号能量值与增益值调整前的音频信号能量值相差过大,从而导致各通道对应的音频信号能量值始终难以满足预设一致性条件。预设值设置得过小则将使得调整次数增多,计算量加大,因此预设值应当设置得适中。进一步地,预设值也可以设置为与能量差值大小对应,即当待调整通道的音频信号能量值与基准通道的音频信号能量值的差值越大时,对应预设值越大,当待调整通道的音频信号能量值与基准通道的音频信 号能量值的差值越小时,对应预设值越小。After determining the channel to be adjusted, the gain value of the channel to be adjusted is determined according to a preset adjustment mode, where the preset adjustment mode corresponds to the method of determining the channel to be adjusted. For example, when the channel with the lowest energy value of the audio signal is used as the reference channel, and other channels are determined as channels to be adjusted, the gain values of the other channels are reduced by a preset value, where the preset value can be set according to specific needs, and the preset value is set If it is too large, the difference between the energy value of the audio signal after adjusting the gain value of the channel to be adjusted and the energy value of the audio signal before adjusting the gain value is too large, so that the energy value of the audio signal corresponding to each channel is always difficult to meet the preset consistency condition. Setting the preset value too small will increase the number of adjustments and increase the amount of calculation, so the preset value should be set appropriately. Further, the preset value can also be set to correspond to the magnitude of the energy difference, that is, when the difference between the energy value of the audio signal of the channel to be adjusted and the energy value of the audio signal of the reference channel is larger, the corresponding preset value is larger, when The smaller the difference between the energy value of the audio signal of the channel to be adjusted and the energy value of the audio signal of the reference channel, the smaller the corresponding preset value.
进一步地,步骤S41包括:Further, step S41 includes:
步骤S411,将所有通道中所述能量差值绝对值最大的通道确定为待调整通道;Step S411: Determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
具体地,确定待调整通道的方法可以是,校对设备计算出各通道的音频信号能量值与平均能量值的能量差值绝对值后,比较各通道的能量差值绝对值大小,将所有通道中能量差值绝对值最大的通道确定为待调整通道。若有多个通道能量差值绝对值相等,则将多个通道均确定为待调整通道,或者择一确定为待调整通道。Specifically, the method for determining the channel to be adjusted may be that, after the proofreading device calculates the absolute value of the energy difference between the energy value of the audio signal and the average energy value of each channel, the absolute value of the energy difference of each channel is compared, and all channels are The channel with the largest absolute value of energy difference is determined as the channel to be adjusted. If the absolute values of the energy differences of multiple channels are equal, the multiple channels are all determined as channels to be adjusted, or alternatively, determined as channels to be adjusted.
步骤S42包括:Step S42 includes:
步骤S421,判断所述待调整通道的所述音频信号能量值是否大于所述平均能量值;Step S421: Determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value;
步骤S422,若所述待调整通道的所述音频信号能量值大于所述平均能量值,则将所述待调整通道的增益值减小预设值;Step S422, if the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value;
步骤S423,若所述待调整通道的所述音频信号能量值小于所述平均能量值,则将所述待调整通道的增益值增大预设值。Step S423: If the energy value of the audio signal of the channel to be adjusted is less than the average energy value, increase the gain value of the channel to be adjusted by a preset value.
当校对设备将能量差值绝对值最大的通道确定为待调整通道后,判断该待调整通道的音频信号能量值是否大于平均能量值。若判断出待调整通道的音频信号能量值大于平均能量值,则将待调整通道的增益值减小预设值。预设值同上所述,可以根据具体需要进行设置。若判断出待调整通道的音频信号能量值小于平均能量值,则将待调整通道的增益值增大预设值。After the proofreading device determines the channel with the largest absolute value of energy difference as the channel to be adjusted, it is determined whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value. If it is determined that the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, the gain value of the channel to be adjusted is reduced by a preset value. The preset values are the same as above, and can be set according to specific needs. If it is determined that the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
本实施例通过上述将所有通道中能量差值绝对值最大的通道确定为待调整通道,再根据预设值调整待调整通道增益值的调整方式,可以实现快速使得各通道音频信号能量值满足一致性要求,加快了校对设备校对速度。In this embodiment, by determining the channel with the largest absolute value of energy difference among all channels as the channel to be adjusted, and then adjusting the gain value of the channel to be adjusted according to the preset value, the method of adjusting the audio signal energy value of each channel can be quickly achieved Requirements, speeding up the proofreading of proofreading equipment.
进一步地,步骤S42之后,还包括:Further, after step S42, it also includes:
步骤S43,检测所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后是否出现失真现象;Step S43: Detect whether the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value to see whether distortion occurs;
当校对设备根据预设调整方式调整了待调整通道的增益值后,待调整通道以调整后的增益值对拾取到的基准音频信号进行增益放大处理和模数转换,此时,校对设备可先检测待调整通道根据调整后的增益值对拾取到的基 准音频信号进行增益处理后是否出现失真现象。由于校对设备是根据预设调整方式调整待调整通道的增益值,则预设调整方式中的预设值可能会设置得偏大,从而使得调整后的增益值偏大,从而可能导致增益放大处理后音频信号失真。若音频信号失真则不能作为后期语音识别模块识别语音的基础,因此音频信号不失真是必要条件。After the proofreading device adjusts the gain value of the channel to be adjusted according to the preset adjustment mode, the pending adjustment channel performs gain amplification processing and analog-to-digital conversion on the picked-up reference audio signal with the adjusted gain value. At this time, the proofreading device may first Detecting whether the channel to be adjusted exhibits distortion after performing gain processing on the picked-up reference audio signal according to the adjusted gain value. Since the proofreading device adjusts the gain value of the channel to be adjusted according to the preset adjustment method, the preset value in the preset adjustment method may be set to be too large, so that the adjusted gain value is too large, which may result in gain amplification processing The rear audio signal is distorted. If the audio signal is distorted, it cannot be used as the basis for the later speech recognition module to recognize speech, so it is a necessary condition that the audio signal is not distorted.
步骤S44,若检测到所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后出现失真现象,则输出修改所述预设调整方式的提示消息。Step S44, if it is detected that the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value, then a prompt message to modify the preset adjustment mode is output.
若校对设备检测到待调整通道根据调整后的增益值对拾取到的基准音频信号进行增益放大处理后出现了失真现象,则可输出提示运维人员修改预设调整方式的提示消息,即提示运维人员修改预设调整方式中的预设值,以使的调整后的增益值不会过大而导致音频信号失真。若校对设备检测到待调整通道根据调整后的增益值对拾取到的基准音频信号进行增益放大处理后未出现失真现象,则再次获取各通道对应的数字音频数据,对各个通道对应的音频信号能量值进行计算分析,检测各通道对应的音频信号能量值是否满足预设一致性条件,若不满足,则再次进行调整,循环操作,直到检测到各通道对应的音频信号能量值满足预设一致性条件为止。If the proofreading device detects that the channel to be adjusted undergoes gain amplification processing on the picked up reference audio signal according to the adjusted gain value, it may output a prompt message prompting the operation and maintenance personnel to modify the preset adjustment method, that is, prompting the operation The maintenance personnel modify the preset value in the preset adjustment mode so that the adjusted gain value will not be too large and cause distortion of the audio signal. If the proofreading device detects that the channel to be adjusted does not appear to be distorted after performing gain amplification on the picked up reference audio signal according to the adjusted gain value, the digital audio data corresponding to each channel is acquired again, and the audio signal energy corresponding to each channel is acquired. Calculate and analyze the value to detect whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition. If it does not meet the requirement, adjust again and perform a loop operation until it is detected that the energy value of the audio signal corresponding to each channel meets the preset consistency Conditions so far.
本实施例通过上述检测待调整通道增益放大后的音频信号是否失真的步骤,可以避免校对过程中的音频信号失真现象,以保证麦克风阵列校对的准确性。In this embodiment, through the above steps of detecting whether the audio signal after the channel gain amplification is adjusted is distorted, the distortion of the audio signal during the calibration process can be avoided to ensure the accuracy of the microphone array calibration.
进一步的,基于上述第一、第二或第三实施例,本申请麦克风阵列自动校对方法第四实施例提供一种麦克风阵列自动校对方法。在本实施例中,所述调整所述各通道的增益值的步骤之前,还包括:Further, based on the foregoing first, second, or third embodiment, the fourth embodiment of the automatic microphone array calibration method of the present application provides an automatic microphone array calibration method. In this embodiment, before the step of adjusting the gain value of each channel, the method further includes:
步骤S51,检测当前调整所述各通道增益值的调整次数是否大于预设次数;Step S51, detecting whether the current adjustment times for adjusting the gain values of the channels are greater than the preset times;
若校对设备检测到各通道的音频信号能量值不满足预设一致性条件,则需要调整各通道的增益值,调整后再检测是否满足预设一致性条件,循环操作直到检测到满足预设一致性条件。但是校对过程中可能出现调整通道增益值的次数过多,但各通道的音频信号能量值任然不满足预设一致性条件,此时可能是麦克风阵列设备硬件上出现问题。因此,校对设备在调整各通道增 益值之前,可先检测当前调整各通道增益值的调整次数是否大于预设次数。校对设备可设置计数器来记录调整增益值的次数,每调整一次各通道增益值加一。预设次数可根据具体需要进行设置。If the proofreading device detects that the energy value of the audio signal of each channel does not meet the preset consistency condition, you need to adjust the gain value of each channel, and then check whether the preset consistency condition is met after the adjustment, and loop until the preset consistency is detected. Sexual conditions. However, there may be too many times to adjust the channel gain value during the proofreading process, but the energy value of the audio signal of each channel still does not meet the preset consistency condition. In this case, there may be a problem in the hardware of the microphone array device. Therefore, before adjusting the gain value of each channel, the proofreading device can first detect whether the number of adjustments to adjust the gain value of each channel is greater than the preset number of times. The proofreading device can set a counter to record the number of times to adjust the gain value, and the gain value of each channel is increased by one every time the adjustment is made. The preset number of times can be set according to specific needs.
步骤S52,若当前调整所述各通道的增益值的调整次数大于所述预设次数,则所述调整所述各通道的增益值的步骤包括:执行预设报错操作。Step S52, if the current adjustment times for adjusting the gain value of each channel are greater than the preset number of times, the step of adjusting the gain value of each channel includes: performing a preset error reporting operation.
若校对设备检测到当前调整各通道增益值的调整次数大于预设次数,则不再调整通道增益值,暂停校对,执行预设的报错操作。预设报错操作可以是输出提示信息,提示运维人员校对次数过多,也可以是发出报错提示音等。通过校对设备执行的报错操作,即可提示运维人员对麦克风阵列设备的故障进行检查或维修。If the proofreading device detects that the current adjustment times for adjusting the gain value of each channel are greater than the preset times, it will no longer adjust the channel gain value, suspend the proofreading, and perform the preset error reporting operation. The preset error reporting operation may be outputting prompt information, prompting the operation and maintenance personnel to check too many times, or issuing an error reporting tone. By checking the error operation performed by the equipment, the operation and maintenance personnel can be prompted to check or repair the malfunction of the microphone array equipment.
在本实施例中,通过当检测到调整各通道增益值的调整次数大于预设次数时,执行预设报错操作,可检测出麦克风阵列设备硬件上的问题,以供运维人员对麦克风阵列设备进行检测或维修。In this embodiment, when it is detected that the number of adjustments to adjust the gain value of each channel is greater than the preset number of times, a preset error report operation is performed to detect a problem on the hardware of the microphone array device for the operation and maintenance personnel to Carry out inspection or repair.
此外,本申请实施例还提出一种麦克风阵列自动校对装置,所述麦克风阵列自动校对装置包括:In addition, an embodiment of the present application also provides an automatic calibration device for a microphone array. The automatic calibration device for a microphone array includes:
获取模块,配置为获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;The acquisition module is configured to acquire digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
计算模块,配置为根据所述数字音频数据计算所述各通道对应的音频信号能量值;A calculation module configured to calculate the energy value of the audio signal corresponding to each channel according to the digital audio data;
检测模块,配置为检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;A detection module configured to detect whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition;
调整模块,配置为若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。An adjustment module configured to adjust the gain value of each channel so that the audio signal corresponding to each channel if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition The energy value meets the preset consistency condition.
可选地,所述获取模块还配置为获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Optionally, the acquisition module is further configured to acquire digital audio data obtained by performing gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal by each channel of the microphone.
可选地,所述计算模块包括:Optionally, the calculation module includes:
分帧处理单元,配置为分别对所述各通道对应的数字音频数据进行分帧处理;The frame processing unit is configured to perform frame processing on the digital audio data corresponding to the respective channels;
获取单元,配置为获取所述各通道对应的数字音频数据中一预设帧的音频数据;An obtaining unit configured to obtain audio data of a preset frame in the digital audio data corresponding to each channel;
第一计算单元,配置为根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。The first calculation unit is configured to calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
可选地,所述检测模块包括:Optionally, the detection module includes:
第二计算单元,配置为计算所述各通道对应的所述音频信号能量值的平均能量值;还配置为分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;The second calculation unit is configured to calculate the average energy value of the audio signal energy value corresponding to each channel; and also configured to calculate the energy of the audio signal energy value and the average energy value corresponding to each channel respectively Absolute value of difference;
检测单元,配置为检测所述各通道的所述能量差值绝对值是否小于预设差值;A detection unit configured to detect whether the absolute value of the energy difference of each channel is less than a preset difference;
第一确定单元,配置为若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。The first determining unit is configured to determine that the energy value of the audio signal corresponding to each channel satisfies the preset consistency if the absolute value of the energy difference of each channel is less than the preset difference condition.
可选地,所述调整模块包括:Optionally, the adjustment module includes:
第二确定单元,配置为在所述各通道中确定待调整通道;A second determination unit configured to determine the channel to be adjusted in each channel;
调整单元,配置为根据预设调整方式调整所述待调整通道的增益值。The adjustment unit is configured to adjust the gain value of the channel to be adjusted according to a preset adjustment mode.
可选地,所述第二确定单元还配置为将所有通道中所述能量差值绝对值最大的通道确定为待调整通道;Optionally, the second determining unit is further configured to determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
所述调整单元还配置为判断所述待调整通道的所述音频信号能量值是否大于所述平均能量值;若所述待调整通道的所述音频信号能量值大于所述平均能量值,则将所述待调整通道的增益值减小预设值;若所述待调整通道的所述音频信号能量值小于所述平均能量值,则将所述待调整通道的增益值增大预设值。The adjusting unit is further configured to determine whether the energy value of the audio signal of the channel to be adjusted is greater than the average energy value; if the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, then The gain value of the channel to be adjusted decreases by a preset value; if the energy value of the audio signal of the channel to be adjusted is less than the average energy value, the gain value of the channel to be adjusted is increased by a preset value.
可选地,所述检测模块还配置为检测所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后是否出现失真现象;Optionally, the detection module is further configured to detect whether distortion occurs in the channel to be adjusted according to the adjusted gain value after performing gain amplification processing on the reference audio signal;
所述麦克风阵列自动校对装置还包括:The automatic calibration device of the microphone array further includes:
输出模块,配置为若检测到所述待调整通道根据调整后的增益值对所述基准音频信号进行增益放大处理后出现失真现象,则输出修改所述预设调整方式的提示消息。The output module is configured to output a prompt message to modify the preset adjustment mode if a distortion phenomenon occurs after the channel to be adjusted undergoes gain amplification processing on the reference audio signal according to the adjusted gain value.
可选地,所述检测模块还配置为检测当前调整所述各通道增益值的调整 次数是否大于预设次数;Optionally, the detection module is further configured to detect whether the current number of adjustments to adjust the gain value of each channel is greater than a preset number of times;
所述麦克风阵列自动校对装置还包括:The automatic calibration device of the microphone array further includes:
报错模块,配置为若当前调整所述各通道的增益值的调整次数大于所述预设次数,则执行预设报错操作。The error reporting module is configured to perform a preset error reporting operation if the current number of adjustments to adjust the gain value of each channel is greater than the preset number of times.
需要说明的是,麦克风阵列自动校对装置的各个实施例与上述麦克风阵列自动校对方法的各实施例基本相同,在此不再详细赘述。It should be noted that the embodiments of the microphone array automatic proofreading device are basically the same as the embodiments of the microphone array automatic proofreading method described above, and details are not repeated here.
此外,本申请实施例还提出一种计算机可读存储介质,所述计算机可读存储介质上存储有麦克风阵列自动校对程序,所述麦克风阵列自动校对程序被处理器执行时实现如上所述麦克风阵列自动校对方法的步骤。本申请麦克风阵列自动校对设备和存储介质(即计算机可读存储介质)的具体实施方式的拓展内容与上述麦克风阵列自动校对方法各实施例基本相同,在此不做赘述。In addition, an embodiment of the present application also provides a computer-readable storage medium that stores a microphone array automatic calibration program stored on the computer-readable storage medium, and the microphone array automatic calibration program is implemented by the processor to implement the microphone array as described above The steps of the automatic proofreading method. The expanded content of the specific implementation of the microphone array automatic proofreading device and the storage medium (ie, computer-readable storage medium) of the present application is basically the same as the above embodiments of the microphone array automatic proofreading method, and details are not described herein.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者系统不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者系统所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者系统中还存在另外的相同要素。It should be noted that in this article, the terms "include", "include" or any other variant thereof are intended to cover non-exclusive inclusion, so that a process, method, article or system that includes a series of elements includes not only those elements, It also includes other elements that are not explicitly listed, or include elements inherent to this process, method, article, or system. Without more restrictions, the element defined by the sentence "include a..." does not exclude that there are other identical elements in the process, method, article or system that includes the element.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The sequence numbers of the above embodiments of the present application are for description only, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods in the above embodiments can be implemented by means of software plus a necessary general hardware platform, and of course, can also be implemented by hardware, but in many cases the former is better Implementation. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or part that contributes to the existing technology, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above , Disk, CD), including several instructions to make a terminal device (which can be a mobile phone, computer, server, air conditioner, or network equipment, etc.) to perform the method described in each embodiment of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of the present application, and do not limit the scope of the patent of the present application. Any equivalent structure or equivalent process transformation made by the description and drawings of this application, or directly or indirectly used in other related technical fields The same reason is included in the patent protection scope of this application.

Claims (20)

  1. 一种麦克风阵列自动校对方法,其中,所述麦克风阵列自动校对方法包括以下步骤:An automatic calibration method for a microphone array, wherein the automatic calibration method for a microphone array includes the following steps:
    获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Obtain digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
    根据所述数字音频数据计算所述各通道对应的音频信号能量值;Calculating the energy value of the audio signal corresponding to each channel according to the digital audio data;
    检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;以及,Detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition; and,
    若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
  2. 如权利要求1所述的麦克风阵列自动校对方法,其中,所述获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据的步骤包括:The automatic calibration method for a microphone array according to claim 1, wherein the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
    获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
  3. 如权利要求1所述的麦克风阵列自动校对方法,其中,所述根据所述数字音频数据计算所述各通道对应的音频信号能量值的步骤包括:The method for automatically calibrating a microphone array according to claim 1, wherein the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
    分别对所述各通道对应的数字音频数据进行分帧处理;Separately framing the digital audio data corresponding to each channel;
    获取所述各通道对应的数字音频数据中一预设帧的音频数据;以及,Acquiring audio data of a preset frame in the digital audio data corresponding to each channel; and,
    根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  4. 如权利要求1所述的麦克风阵列自动校对方法,其中,所述检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件的步骤包括:The automatic calibration method for a microphone array according to claim 1, wherein the step of detecting whether the energy value of the audio signal corresponding to each channel satisfies a preset consistency condition includes:
    计算所述各通道对应的所述音频信号能量值的平均能量值;Calculating the average energy value of the energy values of the audio signals corresponding to the respective channels;
    分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Separately calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
    检测所述各通道的所述能量差值绝对值是否小于预设差值;以及,Detecting whether the absolute value of the energy difference of each channel is less than a preset difference; and,
    若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
  5. 如权利要求4所述的麦克风阵列自动校对方法,其中,所述调整所述各通道的增益值的步骤包括:The method for automatically calibrating a microphone array according to claim 4, wherein the step of adjusting the gain value of each channel includes:
    在所述各通道中确定待调整通道;Determine the channel to be adjusted in each channel;
    根据预设调整方式调整所述待调整通道的增益值。The gain value of the channel to be adjusted is adjusted according to a preset adjustment mode.
  6. 如权利要求5所述的麦克风阵列自动校对方法,其中,所述在所述各通道中确定待调整通道的步骤包括:The method for automatically calibrating a microphone array according to claim 5, wherein the step of determining the channel to be adjusted in each channel includes:
    将所有通道中所述能量差值绝对值最大的通道确定为待调整通道;Determine the channel with the largest absolute value of the energy difference among all channels as the channel to be adjusted;
    所述根据预设调整方式调整所述待调整通道的增益值的步骤包括:The step of adjusting the gain value of the channel to be adjusted according to a preset adjustment method includes:
    确定所述待调整通道的所述音频信号能量值大于所述平均能量值,将所述待调整通道的增益值减小预设值;Determining that the energy value of the audio signal of the channel to be adjusted is greater than the average energy value, and reducing the gain value of the channel to be adjusted by a preset value;
    确定所述待调整通道的所述音频信号能量值小于所述平均能量值,将所述待调整通道的增益值增大预设值。It is determined that the energy value of the audio signal of the channel to be adjusted is less than the average energy value, and the gain value of the channel to be adjusted is increased by a preset value.
  7. 如权利要求1所述的麦克风阵列自动校对方法,其中,所述调整所述各通道的增益值的步骤之前,还包括:The method for automatically calibrating a microphone array according to claim 1, wherein before the step of adjusting the gain value of each channel, further comprising:
    检测当前调整所述各通道增益值的调整次数是否大于预设次数;Detecting whether the current adjustment times for adjusting the gain values of the channels are greater than the preset times;
    若当前调整所述各通道的增益值的调整次数大于所述预设次数,则所述调整所述各通道的增益值的步骤包括:If the current number of adjustments to adjust the gain value of each channel is greater than the preset number of times, the step of adjusting the gain value of each channel includes:
    执行预设报错操作。Perform preset error reporting.
  8. 一种麦克风阵列自动校对装置,其中,所述麦克风阵列自动校对装置包括:A microphone array automatic proofreading device, wherein the microphone array automatic proofreading device includes:
    获取模块,配置为获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;The acquisition module is configured to acquire digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
    计算模块,配置为根据所述数字音频数据计算所述各通道对应的音频信号能量值;A calculation module configured to calculate the energy value of the audio signal corresponding to each channel according to the digital audio data;
    检测模块,配置为检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;以及,A detection module configured to detect whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition; and,
    调整模块,配置为若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。An adjustment module configured to adjust the gain value of each channel so that the audio signal corresponding to each channel if the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition The energy value meets the preset consistency condition.
  9. 如权利要求8所述的麦克风阵列自动校对装置,其中,所述获取模块 还配置为获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。The automatic microphone array calibration device according to claim 8, wherein the acquisition module is further configured to acquire digital audio data after gain amplification processing and analog-to-digital conversion processing are performed on the picked-up reference audio signal by each channel of the microphone.
  10. 如权利要求8所述的麦克风阵列自动校对装置,其中,所述计算模块包括:The microphone array automatic proofreading device according to claim 8, wherein the calculation module comprises:
    分帧处理单元,配置为分别对所述各通道对应的数字音频数据进行分帧处理;The frame processing unit is configured to perform frame processing on the digital audio data corresponding to the respective channels;
    获取单元,配置为获取所述各通道对应的数字音频数据中一预设帧的音频数据;An obtaining unit configured to obtain audio data of a preset frame in the digital audio data corresponding to each channel;
    第一计算单元,配置为根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。The first calculation unit is configured to calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  11. 如权利要求8所述的麦克风阵列自动校对装置,其中,所述检测模块包括:The microphone array automatic proofreading device according to claim 8, wherein the detection module comprises:
    第二计算单元,配置为计算所述各通道对应的所述音频信号能量值的平均能量值;还配置为分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;The second calculation unit is configured to calculate the average energy value of the audio signal energy value corresponding to each channel; and also configured to calculate the energy of the audio signal energy value and the average energy value corresponding to each channel respectively Absolute value of difference;
    检测单元,配置为检测所述各通道的所述能量差值绝对值是否小于预设差值;A detection unit configured to detect whether the absolute value of the energy difference of each channel is less than a preset difference;
    第一确定单元,配置为若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。The first determining unit is configured to determine that the energy value of the audio signal corresponding to each channel satisfies the preset consistency if the absolute value of the energy difference of each channel is less than the preset difference condition.
  12. 如权利要求11所述的麦克风阵列自动校对装置,其中,所述调整模块包括:The microphone array automatic proofreading device according to claim 11, wherein the adjustment module comprises:
    第二确定单元,配置为在所述各通道中确定待调整通道;A second determination unit configured to determine the channel to be adjusted in each channel;
    调整单元,配置为根据预设调整方式调整所述待调整通道的增益值。The adjustment unit is configured to adjust the gain value of the channel to be adjusted according to a preset adjustment mode.
  13. 一种麦克风阵列自动校对设备,其中,所述麦克风阵列自动校对设备包括存储器、处理器和存储在所述存储器上并可在所述处理器上运行的麦克风阵列自动校对程序,所述麦克风阵列自动校对程序被所述处理器执行时实现如下步骤:A microphone array automatic proofreading device, wherein the microphone array automatic proofreading device includes a memory, a processor, and a microphone array automatic proofreading program stored on the memory and operable on the processor, the microphone array automatic proofreading program When the proofreading program is executed by the processor, the following steps are implemented:
    获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Obtain digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
    根据所述数字音频数据计算所述各通道对应的音频信号能量值;Calculating the energy value of the audio signal corresponding to each channel according to the digital audio data;
    检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;以及,Detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition; and,
    若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
  14. 如权利要求13所述的麦克风阵列自动校对设备,其中,所述获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据的步骤包括:The microphone array automatic proofreading device according to claim 13, wherein the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
    获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
  15. 如权利要求13所述的麦克风阵列自动校对设备,其中,所述根据所述数字音频数据计算所述各通道对应的音频信号能量值的步骤包括:The microphone array automatic proofreading device according to claim 13, wherein the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
    分别对所述各通道对应的数字音频数据进行分帧处理;Separately framing the digital audio data corresponding to each channel;
    获取所述各通道对应的数字音频数据中一预设帧的音频数据;以及,Acquiring audio data of a preset frame in the digital audio data corresponding to each channel; and,
    根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  16. 如权利要求13所述的麦克风阵列自动校对设备,其中,所述检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件的步骤包括:The microphone array automatic proofreading device according to claim 13, wherein the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
    计算所述各通道对应的所述音频信号能量值的平均能量值;Calculating the average energy value of the energy values of the audio signals corresponding to the respective channels;
    分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Separately calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
    检测所述各通道的所述能量差值绝对值是否小于预设差值;以及,Detecting whether the absolute value of the energy difference of each channel is less than a preset difference; and,
    若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
  17. 一种计算机可读存储介质,其中,所述计算机可读存储介质上存储有麦克风阵列自动校对程序,所述麦克风阵列自动校对程序被处理器执行时实现如下步骤:A computer-readable storage medium, wherein the computer-readable storage medium stores a microphone array automatic calibration program, and when the microphone array automatic calibration program is executed by a processor, the following steps are implemented:
    获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据;Obtain digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array;
    根据所述数字音频数据计算所述各通道对应的音频信号能量值;Calculating the energy value of the audio signal corresponding to each channel according to the digital audio data;
    检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件;以及,Detecting whether the energy value of the audio signal corresponding to each channel meets the preset consistency condition; and,
    若所述各通道对应的所述音频信号能量值不满足所述预设一致性条件,则调整所述各通道的增益值,以使所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the energy value of the audio signal corresponding to each channel does not satisfy the preset consistency condition, the gain value of each channel is adjusted so that the energy value of the audio signal corresponding to each channel satisfies the Preset consistency conditions.
  18. 如权利要求17所述的计算机可读存储介质,其中,所述获取麦克风阵列各通道对拾取到的基准音频信号进行预处理后的数字音频数据的步骤包括:The computer-readable storage medium of claim 17, wherein the step of acquiring digital audio data after pre-processing the picked up reference audio signal by each channel of the microphone array includes:
    获取麦克风各通道对拾取到的基准音频信号进行增益放大处理和模数转换处理后的数字音频数据。Acquire digital audio data after each channel of the microphone performs gain amplification processing and analog-to-digital conversion processing on the picked-up reference audio signal.
  19. 如权利要求17所述的计算机可读存储介质,其中,所述根据所述数字音频数据计算所述各通道对应的音频信号能量值的步骤包括:The computer-readable storage medium of claim 17, wherein the step of calculating the energy value of the audio signal corresponding to each channel according to the digital audio data includes:
    分别对所述各通道对应的数字音频数据进行分帧处理;Separately framing the digital audio data corresponding to each channel;
    获取所述各通道对应的数字音频数据中一预设帧的音频数据;以及,Acquiring audio data of a preset frame in the digital audio data corresponding to each channel; and,
    根据所述预设帧的音频数据,计算所述各通道对应的音频信号能量值。Calculate the energy value of the audio signal corresponding to each channel according to the audio data of the preset frame.
  20. 如权利要求17所述的计算机可读存储介质,其中,所述检测所述各通道对应的所述音频信号能量值是否满足预设一致性条件的步骤包括:The computer-readable storage medium of claim 17, wherein the step of detecting whether the energy value of the audio signal corresponding to each channel meets a preset consistency condition includes:
    计算所述各通道对应的所述音频信号能量值的平均能量值;Calculating the average energy value of the energy values of the audio signals corresponding to the respective channels;
    分别计算所述各通道对应的所述音频信号能量值与所述平均能量值的能量差值绝对值;Separately calculating the absolute value of the energy difference between the energy value of the audio signal and the average energy value corresponding to each channel;
    检测所述各通道的所述能量差值绝对值是否小于预设差值;以及,Detecting whether the absolute value of the energy difference of each channel is less than a preset difference; and,
    若所述各通道的所述能量差值绝对值均小于所述预设差值,则确定所述各通道对应的所述音频信号能量值满足所述预设一致性条件。If the absolute value of the energy difference of each channel is less than the preset difference, it is determined that the energy value of the audio signal corresponding to each channel satisfies the preset consistency condition.
PCT/CN2019/124639 2018-12-17 2019-12-11 Automatic calibration method, device and apparatus for microphone array and storage medium WO2020125522A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811542125.1A CN109451415A (en) 2018-12-17 2018-12-17 Microphone array auto-collation, device, equipment and storage medium
CN201811542125.1 2018-12-17

Publications (1)

Publication Number Publication Date
WO2020125522A1 true WO2020125522A1 (en) 2020-06-25

Family

ID=65559699

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/124639 WO2020125522A1 (en) 2018-12-17 2019-12-11 Automatic calibration method, device and apparatus for microphone array and storage medium

Country Status (2)

Country Link
CN (1) CN109451415A (en)
WO (1) WO2020125522A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113808614A (en) * 2021-07-30 2021-12-17 北京声智科技有限公司 Sound energy value calibration and device wake-up method, device and storage medium

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium
CN110175013A (en) * 2019-05-20 2019-08-27 北京声智科技有限公司 Voice input method, apparatus, electronic equipment and storage medium
CN112071332B (en) * 2019-06-11 2024-10-15 阿里巴巴集团控股有限公司 Method and device for determining pickup quality
CN110225444A (en) * 2019-06-14 2019-09-10 四川长虹电器股份有限公司 A kind of fault detection method and its detection system of microphone array system
CN110392334B (en) * 2019-07-03 2021-06-08 北京小米移动软件有限公司 Microphone array audio signal self-adaptive processing method, device and medium
CN110636600B (en) * 2019-08-20 2022-01-18 恒玄科技(上海)股份有限公司 Audio synchronous playing method for wireless equipment
CN110996244B (en) * 2019-12-23 2021-08-03 四川虹美智能科技有限公司 Microphone array performance test method, device and system
CN112037825B (en) * 2020-08-10 2022-09-27 北京小米松果电子有限公司 Audio signal processing method and device and storage medium
CN112867138B (en) * 2020-10-13 2023-02-24 恒玄科技(上海)股份有限公司 Audio system and wireless earphone pair
CN112333606B (en) * 2020-10-21 2022-03-25 浙江华创视讯科技有限公司 Microphone gain abnormity adjusting method and device
CN114222234B (en) * 2021-12-31 2024-08-23 思必驰科技股份有限公司 Microphone array consistency detection method, electronic device and storage medium
CN114827821B (en) * 2022-04-25 2024-06-11 世邦通信股份有限公司 Pickup control method and system, pickup device, and storage medium
CN115148203B (en) * 2022-06-06 2024-09-24 青岛海尔科技有限公司 Pickup calibration method and device, nonvolatile storage medium and electronic equipment
CN115567864B (en) * 2022-12-02 2024-03-01 浙江华创视讯科技有限公司 Microphone gain adjusting method and device, storage medium and electronic equipment
CN115691556B (en) * 2023-01-03 2023-03-14 北京睿科伦智能科技有限公司 Method for detecting multichannel voice quality of equipment side

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110051953A1 (en) * 2008-04-25 2011-03-03 Nokia Corporation Calibrating multiple microphones
CN103929707A (en) * 2014-04-08 2014-07-16 深圳市中兴移动通信有限公司 Method for detecting conditions of microphone voice-grade channels and terminal
CN106340305A (en) * 2016-11-04 2017-01-18 北京声智科技有限公司 Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN107180642A (en) * 2017-07-20 2017-09-19 北京华捷艾米科技有限公司 Audio signal bearing calibration, device and equipment
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8731210B2 (en) * 2009-09-21 2014-05-20 Mediatek Inc. Audio processing methods and apparatuses utilizing the same

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110051953A1 (en) * 2008-04-25 2011-03-03 Nokia Corporation Calibrating multiple microphones
CN103929707A (en) * 2014-04-08 2014-07-16 深圳市中兴移动通信有限公司 Method for detecting conditions of microphone voice-grade channels and terminal
CN106340305A (en) * 2016-11-04 2017-01-18 北京声智科技有限公司 Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN107180642A (en) * 2017-07-20 2017-09-19 北京华捷艾米科技有限公司 Audio signal bearing calibration, device and equipment
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113808614A (en) * 2021-07-30 2021-12-17 北京声智科技有限公司 Sound energy value calibration and device wake-up method, device and storage medium

Also Published As

Publication number Publication date
CN109451415A (en) 2019-03-08

Similar Documents

Publication Publication Date Title
WO2020125522A1 (en) Automatic calibration method, device and apparatus for microphone array and storage medium
US9830924B1 (en) Matching output volume to a command volume
US8611556B2 (en) Calibrating multiple microphones
US8913761B2 (en) Sound source recording apparatus and method adaptable to operating environment
EP2811485A1 (en) Sound correcting apparatus, sound correcting program, and sound correcting method
US20110211035A1 (en) Voice communication apparatus and voice communication method
EP3337190B1 (en) A method of reducing noise in an audio processing device
WO2022174727A1 (en) Howling suppression method and apparatus, hearing aid, and storage medium
US20160118062A1 (en) Robust Voice Activity Detector System for Use with an Earphone
WO2020211004A1 (en) Audio signal processing method and device, and storage medium
CN106775558B (en) Method and device for obtaining earphone optimization parameters and audio providing method and system
CN115604628A (en) Filter calibration method and device based on earphone loudspeaker frequency response
US9564145B2 (en) Speech intelligibility detection
US11336999B2 (en) Sound processing device, sound processing method, and program
CN111580778A (en) Volume correction method and device, audio playing device and readable storage medium
JP2010212887A (en) Signal characteristic adjustment device
CN112802490B (en) Beam forming method and device based on microphone array
US10109293B2 (en) Voice signal processing apparatus and voice signal processing method
WO2020044377A1 (en) Personal communication device as a hearing aid with real-time interactive user interface
KR101225678B1 (en) Auto-steering directional hearing aid and method of operation thereof
US8995686B2 (en) Sound volume control device, sound volume control method, and sound volume control program
WO2021120247A1 (en) Hearing compensation method and device, and computer readable storage medium
CN104254041A (en) Near-end listening intelligibility enhancement
WO2020107385A1 (en) Gain processing method and device implementing same, electronic apparatus, signal acquisition method and system implementing same
CN115567864B (en) Microphone gain adjusting method and device, storage medium and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19901201

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19901201

Country of ref document: EP

Kind code of ref document: A1