US3057960A - Normalized sound control system - Google Patents
Normalized sound control system Download PDFInfo
- Publication number
- US3057960A US3057960A US95087A US9508761A US3057960A US 3057960 A US3057960 A US 3057960A US 95087 A US95087 A US 95087A US 9508761 A US9508761 A US 9508761A US 3057960 A US3057960 A US 3057960A
- Authority
- US
- United States
- Prior art keywords
- signal
- signals
- cross
- correlation
- normalizing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R27/00—Public address systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/40—Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
- H04R2201/403—Linear arrays of transducers
Definitions
- the cocktail party effect is a psychoaconstic mechanism by which a human listener is able to concentrate on speech from a specific talker, despite a surrounding noise or speech babble whose sound intensity is approximately equal to that of the speech of the preferred talker.
- two spatially separated microphones are placed in a sound field containing both speech from a preferred talker and background noise and speech of equal or higher intensity from other talkers.
- the output signals of the two microphones are combined t form a composite signal, and the amplitude of the composite signal is modulated or varied by a control signal.
- the properties of the control signal are selected to reinforce speech sounds from the preferred talker and to suppress noise and speech sounds from other talkers. This selective reinforcement and suppression in the David system improves the intelligibility of speech from the preferred talker and decreases the intelligibility of speech from other talkers, thereby simulating the ability of the human listener to concentrate on speech from a preferred talker in spite of background noise of equal or higher intensity.
- the control signal is derived by cross-correlating the two microphone signals; that is, the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval.
- the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval.
- the magnitude of the cross-correlation signal thus provides an accurate measure of the presence or absence of speech from the preferred talker, and modulation of the composite signal by the cross-correlation signal serves to reinforce the preferred talkers speech sounds, which are coherent with the cross-correlation signal, and to supp-ress the background noise and speech sounds, which are incoherent with the cross-correctional signal. Varying the amplitude of the composite signal in this fashion increases the intelligibility of speech from the preferred talker and decreases the intelligibility of noise and speech from background talkers.
- the David system achieves a significant improvement in the intelligibility of speech from a particular talker
- the improvement in intelligibility is accompanied by an audible distortion in intensity. This distortion is particularly noticeable during changes in the intensity of speech sounds from the preferred talker.
- the magnitude of the cross-correlation signal is a function of more than one independent variable.
- the most important independent variable is the degree of time coincidence between speech sounds arriving at the two microphones, since it is by adjusting this variable that speech sounds from the preferred talker are brought into time coincidence, thereby creating large magnitudes in the cross-correlation signal which ⁇ serve to reinforce speech from the preferred taiker.
- a second independent variable is the average intensity of the sound energy arriving at the two microphones, and this variable also increases the magnitude of the cross-correlation signal. It has been determined by applicant that this additional source of increased magnitude produces an excessive reinforcement of speech from the preferred talker which is audible as the distortion in intensity referred to above.
- the present invention eliminates ydisto-rtion from the David system by normalizing the cross-correlation signal to render it insensitive to the average intensity of the sound energy reaching the two microphones, without disturbing the dependence of the cross-correlation signal upon the degree of time coincidence l'between speech sounds ⁇ arriving at the two microphones.
- normalizing comprises dividing the cross-correlation signal by a normalizing signal that is also a function of the average intensity of the sound energy arriving at the microphones, thereby effectively canceling the dependence of the cross-correlation signal upon average sound intensity. Modulation of the composite signal by the normalized cross-correlation signal of the present invention enhances the intelligibility of speech sounds from a pre ferred talker without distorting the intensity of the preferred speech sounds.
- This invention provides two alternative systems for deriving a normalizing signal.
- the normalizing signal is derived by obtaining the product of the average magnitudes of the two microphone signals.
- the second system the largest value of each microphone signal over several pitch periods is obtained, and the normalizing signal is derived either by multiplying together the largest values of the two microphone signals or by squaring the larger of the two largest values.
- FIG. 1A is a schematic diagram of the cross-correlation sound control system described in the aforementioned David application;
- FIG. 1B is a block diagram of a sound control system embodying the normalizing apparatus of this invention.
- FIG. 2 is a schematic diagram of a preferred embodiment of the normalizing apparatus of this invention.
- FIG. 3 is a schematic diagram of normalizing apparatus alternative to that of FIG. 2.
- FIG. 1A there is shown a cross-correlation version of the sound control system described in the aforementioned David application.
- Spatially separated microphones 1, 2 are placed in a sound field containing both speech sounds from a preferred source and background noise from other sources, and each microphone generates an electrical signal in response to the sound energy it receives.
- the microphone signals are combined in adder 3 to form a composite signal whose amplitude is modulated or adjusted in a variable gain amplifier 5.
- Amplifier 5 is controlled by a cross-correlation signal derived from the microphone signals by signal processing circuits 10, 11 and cross-correlation circuit 12.
- Delay element 4 synchronizes the composite signal with the control signal, and reproducer 6 converts the modulated composite signal into audible sound waves in which the intelligibility of speech sounds from the preferred source is substantially improved.
- Signal processing circuits 10, 11 are of similar construction, comprising, respectively, delay elements 100, 110, equalizers 101, A111, rectitiers 102, 112, and low-pass filters 103, 113.
- the delay elements bring into time coincidence the portions of each microphone signal attributable to speech sounds from the preferred talker; the equalizers pre-emphasize selected frequencies; and the rectiers, which are preferably of the half-wave variety, together with the low-pass lters serve to make the polarity of the microphone signals unidirectional and to pass only the prominent features of the unidirectional signals.
- the processed microphone signals from circuits 10, 11 are applied to cross-correlation circuit 12, where the two signals are multiplied together in multiplier 120 and the product of the two signals is averaged by lowpass filter I121 to form a cross-correlation control signal.
- circuit 12 The cross-correlation operation performed by circuit 12 may be expressed as follows:
- f1(x), f2(x) denote the microphone signals
- the independent variable t represents the amount of time by which one of the microphone signals is delayed with respect to the other microphone signal in order to create time coincidence
- the superposed bar denotes the time averaging performed by low-pass filter 121 upon the product of the delayed microphone signals
- p(l) is the cross-correlation signal derived by circuit 12.
- Equation l the magnitude of the cross correlation of the two functions, as expressed in Equation l, depends primarily upon the independent variable t,
- the cross-correlation function is a measure of the I time coherence between related portions of the two functions; for example, see Y. W. Lee, T. P. Cheatham, Ir., I. B. Wiesner, Application of Correlation Analysis to the Detection of Periodic Signals in Noise, volume 38, Proceedings of the I.R.E., page 1165 (1950).
- the magnitude of the cross-correlation function increases as the time coherence of the two functions increases, starting from a mininmum value when the related portions of the two functions are most incoherent, and reaching a maximum value when the related portions are exactly time coincident.
- the independent variable t is adjusted via delay elements 100, 110 to bring into time coincidence the portions of both microphone signals which are attributable to speech sounds from the preferred source.
- This adjustment of the variable t causes the magnitude of the cross-correlation signal to be relatively large when speech sounds from the preferred source are present, and to be relatively small, with rare exceptions caused by chance coincidences, at all other times.
- Varying the amplitude of the composite signal from adder 3 in amplifier 5 in response to the cross-correlation signal from circuit 12 improves the intelligibility of speech from the prel ferred talker by selectively reinforcing those portions of the composite signal due to preferred speech sounds and suppressing those portions due to sounds from other sources.
- Equation l Analysis of Equation l reveals, however, that the magnitude of the cross correlation of the two microphone signals is also dependent upon the average intensity of the sounds arriving at the microphones, as well as upon the degree of time coherence between the sounds. This is demonstrated, for example, by increasing the average sound intensity at each of the microphones 1, 2 by constant factors K1, K2, respectively; then from Equation 1, the magnitude of the cross-correlation function is increased by a factor lil-K2, that is,
- Equation 2 the dependence of the crosscorrelation signal upon the product of the sound intensities, an expression in Equation 2, produces in amplitier 5 a reinforcement of the composite signal in excess of the reinforcement required for improvement of intelligibility, the excess reinforcement causing the audible distortion of intensity previously discussed.
- the present invention eliminates this distortion at its source: the cross-correlation signal is normalized before applying it to the control terminal of amplifier 5 in order to remove its dependence upon the product of the average sound intensities at the two microphones.
- FIG. 1B the normalizing apparatus of this invention is shown incorporated in the sound control system of FiG. 1A.
- the processed microphone signals from circuits 1t), 11 are applied to normalizing circuit 13 as well as to cross-correlation circuit 12.
- Normalizing circuit 13 which is described in detail below, derives from the microphone signals a nomalizing signal, N, which is a nonlinear function of the average magnitudes of the microphone signals.
- divider 14 By applying the normalizing signal to the divisor terminal of divider 14, which may be of any suitable construction, and by applying the cross-correlation signal to the dividend terminal of divider 14, there is obtained at the output terminal of divider 14 a normalized cross-correlation signal that is primarily dependent upon the degree of time coherence between related portions of the microphone signals and that is relatively insensitive to the product of the average sound intensities at the microphones.
- FIG. 2 illustrates a preferred embodiment of the principles of this invention, in which the microphone signals from signal processing circuits 10, 11 are applied to ampliers 20, 22, respectively, which are constructed in a conventional manner to provide low impedance driving sources for peak detectors 21, 23.
- Peak detectors 21, 23 comprise, respectively, diodes 210, 230, and RC circuits consisting of capacitors 211, 231, and resistors 212, 232.
- the peak detectors have a charging time constant on the order of microseconds and a discharge or holding time constant of about 100 milliseconds, in order to derive from each microphone signal a unidirectional or single polarity signal whose peaks coincide with the commencement of the periods of voiced sounds, and whose average amplitude is approximately equal to the average magnitude of each of the microphone signals, denoted respectively, as shown in FIG. 2. Since the average magnitude of each of the microphone signals is proportional Ito the average sound intensity at each microphone, by multiplying together the output signals of peak detectors 21, 23 in multiplier 24, there is developed at the output terminal of multiplier 24 a normalizing signal, N, approximately proportional to the product of the average sound intensity at each microphone,
- Threshold circuit 25 prevents the normalizing signal from falling below a predetermined nonzero value, and comprises a voltage divider network consisting of resistors 250, 251, with a diode 252 biased by an adjustable energy source 253 connected in parallel with resistor 251. -Diode 252 and source 253 are shown in FIG. 2 connected for a negative polarity normalizing signal; however, -for a positive polarity normalizing signal, the polarities of diode l252 and source 253 are reversed.
- FIG. 3 illustrates alternative apparatus for deriving a normalizing signal, which is somewhat faster-acting than the apparatus shown in FIG. 2.
- the processed microphone signals from circuits 10, y11 of FIG. lB are applied to amplifiers 30, 31 and then passed to tapped delay lines 32, 33, which are terminated in matching impedances 321, 33-1 to prevent reflection.
- the taps of each delay line 32, 33 are connected to a common output point P0, P1, through diodes 32a through 32u and 33a through 3311, respectively, and the signal formed at each of the common output points is the largest of all the signals appearing simultaenously at the taps of each delay line.
- a normalizing signal -either by multiplying together the two signals from .the common output points or by squaring the larger of the two signals.
- 'Ihe two largest signals formed at each common output point are passed through amplifiers 34, 35, and depending upon the positions in which switches 36, 37 are set, the two largest signals -are applied either to the input terminals of diodes 38, 39 or to the input terminals of multiplier 40.
- Diodes 38, 39 are connected to a common output point Pz and the signal developed at this point is the larger of the two largest signals from delay lines 32, 33.
- the combination that comprises a source of a pair of information bearing signals derived from the output signals of a pair of spatially separated sound detecting devices, means for cross-correlating said pair of information bearing signals, means for normalizing the output signal of said crosscorrelating means to eliminate its dependence upon the average amplitudes of said information bearing signals, and means under the control of said normalized crosscorrelation signal for adjusting the amplitude of the sum of the output signals of said sound detecting devices.
- the combination that comprises a pair of spatially separated microphones, means for additively combining the output signals of said microphones, means for individually processing the output signals of said microphones to emphasize selected features of said signals, means for averaging the product of said processed microphone signals to develop a crosscorrelation signal, means for deriving a normalizing signal from said processed microphone signals, means for dividing said cross-correlation signal by said normalizing signal to obtain a control signal, and means under the influence of said control signal for varying the amplitude of said additively combined microphone signals.
- said means for deriving a normalizing signal comprises two parallel subpaths each of which is provided with an input terminal, an output terminal, yand an amplifier and a peak detector connected in series between said input and output terminals, means for applying each of said processed microphone signals to the input terminal of one of said subpaths, a multiplier provided with two input terminals and an output terminal, means for connecting the output terminal of each of said subpaths to one of the input terminals of said multiplier, and threshold means connected between the output terminal of said multiplier and said dividing means.
- said threshold means comprises a voltage divider including a first resistor and a second resistor, a diode connected in series with an adjustable energy source, and means for connecting said series connected diode and energy source in parallel with said second resistor.
- apparatus for reducing distortion which comprises a source of a pair of information bearing signals representative of the sound energy at two spatially separated points in a sound field, a source of a cross-correlation signal derived from said pair of information bearing signals whose magnitude is a function of the time coherence between said signals and of the average amplitudes of said signals, means for deriving from each of said information 'bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals, means for combining said largest value signals to form a normalizing signal, and means for dividing said cross-correlation signal by said normalizing signal.
- said means for deriving from each of said information bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals comprises two parallel signal paths, one for each of said information bearing signals, wherein each of said signals paths includes a first amplifying means provided with an input terminal and an output terminal, a delay line provided with an input terminal and a plurality of taps, means for connecting the output terminal of said first amplifying means to the input terminal of said delay line, a plurality of diodes in one-to-one correspondence with the taps of said delay line, wherein each of said diodes is provided with an input terminal and an output terminal, means for connecting each tap of said delay line to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, a second amplifying means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said second amplifying means, and means for connecting the output terminal of said signals
- Apparatus as defined in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises means for multiplying together said average magnitude signals.
- said means for combining said average magnitude signals to form a normalizing signal comprises two diodes, one for each of said average magnitude signals, wherein each of said diodes is provided with an input terminal and an output terminal, means for applying each average magnitude signal to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, squaring means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said squaring means, and means for connecting the output terminal of said squaring means to said dividing means.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Circuit For Audible Band Transducer (AREA)
Description
Oct. 9, 1962 J. F. KAISER NORMALIZED SOUND CONTROL SYSTEM 3 Sheets-Sheet l Oct. 9, 1962 J. F. KAISER NORMALIZEO SOUND CONTROL SYSTEM 5 Sheets-Sheet 2 Filed March 15, 1961 QE @Ema ln Oct. 9, 1962 J. F. KAISER NORMALIZED SOUND CONTROL SYSTEM 5 Sheets-Sheet 5 Filed March 13, 1961 INVENTOR J. KA /SER BV ATTORNEY m. 6fm
United States Patent Office 3,057,960 Patented Oct. 9, 1962 3,057,960 NORMAMZED SUND CONTROL SYSTEM James F. Kaiser, Summit, NJ., assigner to Bell Telephone Laboratories, incorporated, New York, NX., a corporation of New York Filed Mar. 13, 1961, Ser. No. 95,087 9 Claims. (Cl. 179-1) This invention relates to sound control systems for improving the intelligibility of speech from a talker located in a noisy environment.
The sound control system described by E. E. David, Jr. in a copending application filed December 15, 1959, Serial No. 859,684, improves the inteiligibility of speech from a talker located in a noisy environment by simulating in part of the so-called cocktail party effect. The cocktail party effect is a psychoaconstic mechanism by which a human listener is able to concentrate on speech from a specific talker, despite a surrounding noise or speech babble whose sound intensity is approximately equal to that of the speech of the preferred talker. In the David system, two spatially separated microphones are placed in a sound field containing both speech from a preferred talker and background noise and speech of equal or higher intensity from other talkers. The output signals of the two microphones are combined t form a composite signal, and the amplitude of the composite signal is modulated or varied by a control signal. The properties of the control signal are selected to reinforce speech sounds from the preferred talker and to suppress noise and speech sounds from other talkers. This selective reinforcement and suppression in the David system improves the intelligibility of speech from the preferred talker and decreases the intelligibility of speech from other talkers, thereby simulating the ability of the human listener to concentrate on speech from a preferred talker in spite of background noise of equal or higher intensity.
In one embodiment of the David invention, the control signal is derived by cross-correlating the two microphone signals; that is, the two microphone signals are multiplied together and the product of the two signals is averaged over an appropriate time interval. By suitably delaying one or the other of the two microphone signals before cross-correlating them, those portions of the two microphone signals which correspond to speech sounds from the preferred talker are brought into time coincidence, thereby causing the magnitude of the crosscorrelation control signal to be relatively large when speech sounds from the preferred talker are present, and to be relatively small when speech sounds from the preferred talker are not present. The magnitude of the cross-correlation signal thus provides an accurate measure of the presence or absence of speech from the preferred talker, and modulation of the composite signal by the cross-correlation signal serves to reinforce the preferred talkers speech sounds, which are coherent with the cross-correlation signal, and to supp-ress the background noise and speech sounds, which are incoherent with the cross-correctional signal. Varying the amplitude of the composite signal in this fashion increases the intelligibility of speech from the preferred talker and decreases the intelligibility of noise and speech from background talkers.
Although the David system achieves a significant improvement in the intelligibility of speech from a particular talker, in the embodiment that utilizes the cross correlation of he microphone signals as a control signal the improvement in intelligibility is accompanied by an audible distortion in intensity. This distortion is particularly noticeable during changes in the intensity of speech sounds from the preferred talker.
Investigation into the source of distortion in the crosscorrelation version of the David system revealed that the magnitude of the cross-correlation signal is a function of more than one independent variable. The most important independent variable is the degree of time coincidence between speech sounds arriving at the two microphones, since it is by adjusting this variable that speech sounds from the preferred talker are brought into time coincidence, thereby creating large magnitudes in the cross-correlation signal which `serve to reinforce speech from the preferred taiker. A second independent variable is the average intensity of the sound energy arriving at the two microphones, and this variable also increases the magnitude of the cross-correlation signal. It has been determined by applicant that this additional source of increased magnitude produces an excessive reinforcement of speech from the preferred talker which is audible as the distortion in intensity referred to above.
It is the principal object of the present invention to improve the intelligibility of speech sounds from a preferred talker located in a noisy environment without distorting the intensity of the preferred speech sounds.
it is a specific object of this invention to eliminate distortion from the cross-correlation version of the David system by removing the dependence of the magnitude of the cross-correlation control signal upon the average intensity of the sound energy reaching the two microphones.
The present invention eliminates ydisto-rtion from the David system by normalizing the cross-correlation signal to render it insensitive to the average intensity of the sound energy reaching the two microphones, without disturbing the dependence of the cross-correlation signal upon the degree of time coincidence l'between speech sounds `arriving at the two microphones. In this invention, normalizing comprises dividing the cross-correlation signal by a normalizing signal that is also a function of the average intensity of the sound energy arriving at the microphones, thereby effectively canceling the dependence of the cross-correlation signal upon average sound intensity. Modulation of the composite signal by the normalized cross-correlation signal of the present invention enhances the intelligibility of speech sounds from a pre ferred talker without distorting the intensity of the preferred speech sounds.
This invention provides two alternative systems for deriving a normalizing signal. In the iirst system, the normalizing signal is derived by obtaining the product of the average magnitudes of the two microphone signals. In the second system, the largest value of each microphone signal over several pitch periods is obtained, and the normalizing signal is derived either by multiplying together the largest values of the two microphone signals or by squaring the larger of the two largest values.
The invention will be fully understood from the following detailed description of illustrative embodiments thereof taken in connection with the appended drawings, in which:
FIG. 1A is a schematic diagram of the cross-correlation sound control system described in the aforementioned David application;
FIG. 1B is a block diagram of a sound control system embodying the normalizing apparatus of this invention;
FIG. 2 is a schematic diagram of a preferred embodiment of the normalizing apparatus of this invention; and
FIG. 3 is a schematic diagram of normalizing apparatus alternative to that of FIG. 2.
Theoretical Considerations Referring'to FIG. 1A, there is shown a cross-correlation version of the sound control system described in the aforementioned David application. Spatially separated microphones 1, 2 are placed in a sound field containing both speech sounds from a preferred source and background noise from other sources, and each microphone generates an electrical signal in response to the sound energy it receives. The microphone signals are combined in adder 3 to form a composite signal whose amplitude is modulated or adjusted in a variable gain amplifier 5. Amplifier 5 is controlled by a cross-correlation signal derived from the microphone signals by signal processing circuits 10, 11 and cross-correlation circuit 12. Delay element 4 synchronizes the composite signal with the control signal, and reproducer 6 converts the modulated composite signal into audible sound waves in which the intelligibility of speech sounds from the preferred source is substantially improved.
The cross-correlation operation performed by circuit 12 may be expressed as follows:
where f1(x), f2(x) denote the microphone signals, the independent variable t represents the amount of time by which one of the microphone signals is delayed with respect to the other microphone signal in order to create time coincidence, the superposed bar denotes the time averaging performed by low-pass filter 121 upon the product of the delayed microphone signals, and p(l) is the cross-correlation signal derived by circuit 12.
It is well known that for two functions that are in some manner related to each other, the magnitude of the cross correlation of the two functions, as expressed in Equation l, depends primarily upon the independent variable t,
that is, the cross-correlation function is a measure of the I time coherence between related portions of the two functions; for example, see Y. W. Lee, T. P. Cheatham, Ir., I. B. Wiesner, Application of Correlation Analysis to the Detection of Periodic Signals in Noise, volume 38, Proceedings of the I.R.E., page 1165 (1950). Thus the magnitude of the cross-correlation function increases as the time coherence of the two functions increases, starting from a mininmum value when the related portions of the two functions are most incoherent, and reaching a maximum value when the related portions are exactly time coincident.
`ln the case of the cross-correlation control signal derived from the two microphone signals by the apparatus of FIG. 1A, the independent variable t is adjusted via delay elements 100, 110 to bring into time coincidence the portions of both microphone signals which are attributable to speech sounds from the preferred source. This adjustment of the variable t causes the magnitude of the cross-correlation signal to be relatively large when speech sounds from the preferred source are present, and to be relatively small, with rare exceptions caused by chance coincidences, at all other times. Varying the amplitude of the composite signal from adder 3 in amplifier 5 in response to the cross-correlation signal from circuit 12 improves the intelligibility of speech from the prel ferred talker by selectively reinforcing those portions of the composite signal due to preferred speech sounds and suppressing those portions due to sounds from other sources.
Analysis of Equation l reveals, however, that the magnitude of the cross correlation of the two microphone signals is also dependent upon the average intensity of the sounds arriving at the microphones, as well as upon the degree of time coherence between the sounds. This is demonstrated, for example, by increasing the average sound intensity at each of the microphones 1, 2 by constant factors K1, K2, respectively; then from Equation 1, the magnitude of the cross-correlation function is increased by a factor lil-K2, that is,
ata :Iolite-Ionen) :K1-1cm 2) It has been determined that the dependence of the crosscorrelation signal upon the product of the sound intensities, an expression in Equation 2, produces in amplitier 5 a reinforcement of the composite signal in excess of the reinforcement required for improvement of intelligibility, the excess reinforcement causing the audible distortion of intensity previously discussed. The present invention eliminates this distortion at its source: the cross-correlation signal is normalized before applying it to the control terminal of amplifier 5 in order to remove its dependence upon the product of the average sound intensities at the two microphones. The dependence of the normalized cross-correlation signal upon the degree of time coherence between the microphone signals is preserved, however, and modulation of the composite signal in response to the normalized cross-correlation signal improves the intelligibility of speech sounds from a preferred source Without distorting the intensity of the preferred speech sounds.
Com plete System Referring now to FIG. 1B, the normalizing apparatus of this invention is shown incorporated in the sound control system of FiG. 1A. The processed microphone signals from circuits 1t), 11 are applied to normalizing circuit 13 as well as to cross-correlation circuit 12. Normalizing circuit 13, which is described in detail below, derives from the microphone signals a nomalizing signal, N, which is a nonlinear function of the average magnitudes of the microphone signals. By applying the normalizing signal to the divisor terminal of divider 14, which may be of any suitable construction, and by applying the cross-correlation signal to the dividend terminal of divider 14, there is obtained at the output terminal of divider 14 a normalized cross-correlation signal that is primarily dependent upon the degree of time coherence between related portions of the microphone signals and that is relatively insensitive to the product of the average sound intensities at the microphones.
Preferred Normalzing Circuit FIG. 2 illustrates a preferred embodiment of the principles of this invention, in which the microphone signals from signal processing circuits 10, 11 are applied to ampliers 20, 22, respectively, which are constructed in a conventional manner to provide low impedance driving sources for peak detectors 21, 23. Peak detectors 21, 23 comprise, respectively, diodes 210, 230, and RC circuits consisting of capacitors 211, 231, and resistors 212, 232. The peak detectors have a charging time constant on the order of microseconds and a discharge or holding time constant of about 100 milliseconds, in order to derive from each microphone signal a unidirectional or single polarity signal whose peaks coincide with the commencement of the periods of voiced sounds, and whose average amplitude is approximately equal to the average magnitude of each of the microphone signals, denoted respectively, as shown in FIG. 2. Since the average magnitude of each of the microphone signals is proportional Ito the average sound intensity at each microphone, by multiplying together the output signals of peak detectors 21, 23 in multiplier 24, there is developed at the output terminal of multiplier 24 a normalizing signal, N, approximately proportional to the product of the average sound intensity at each microphone,
N=|f1|'|f2| (3) By dividing the cross-correlation signal p, by the normalizing signal N, as shown in divider 14 of FIG. 1B, there is obtained a normalized cross-correlation signal I; where It is observed in Equation 4 that an increase in sound intensity at each microphone by factors K1, K2, does not affect the magnitude of the normalized cross-correlation signal, since the factor K1K2 appears in both the numerator and the denominator of AEquation 4.
In order to prevent the denominator of Equation 4 from passing to Zero at some time and thereby producing an erroneous normalized cross-correlation signal at the output terminal of divider 14, the normalizing signal is passed through threshold circuit 25. Threshold circuit 25 prevents the normalizing signal from falling below a predetermined nonzero value, and comprises a voltage divider network consisting of resistors 250, 251, with a diode 252 biased by an adjustable energy source 253 connected in parallel with resistor 251. -Diode 252 and source 253 are shown in FIG. 2 connected for a negative polarity normalizing signal; however, -for a positive polarity normalizing signal, the polarities of diode l252 and source 253 are reversed.
Alternative Normalizng Circuit FIG. 3 illustrates alternative apparatus for deriving a normalizing signal, which is somewhat faster-acting than the apparatus shown in FIG. 2. The processed microphone signals from circuits 10, y11 of FIG. lB are applied to amplifiers 30, 31 and then passed to tapped delay lines 32, 33, which are terminated in matching impedances 321, 33-1 to prevent reflection. The taps of each delay line 32, 33 are connected to a common output point P0, P1, through diodes 32a through 32u and 33a through 3311, respectively, and the signal formed at each of the common output points is the largest of all the signals appearing simultaenously at the taps of each delay line. By making the total delay of the delay lines equal to several pitch periods, that is, about l milliseconds delay with n=l equally spaced taps, the largest value of each microphone signal over several pitch periods is obtained at each of the common output points.
From the largest signal obtained at each of the common output points there is derived a normalizing signal, -either by multiplying together the two signals from .the common output points or by squaring the larger of the two signals. 'Ihe two largest signals formed at each common output point are passed through amplifiers 34, 35, and depending upon the positions in which switches 36, 37 are set, the two largest signals -are applied either to the input terminals of diodes 38, 39 or to the input terminals of multiplier 40. Diodes 38, 39 are connected to a common output point Pz and the signal developed at this point is the larger of the two largest signals from delay lines 32, 33. The magnitude of the larger of the two signals passed by diodes 38, 39 is squared in squaring circuit -41 to form -a normalizing signal. When switches 36, 37 connect the two common output points P0, P1 of delay lines 32, 33 to the input terminals of multiplier 40, however, the product signal developed at the output terminal of multiplier 40 from the largest signal `at each common output point is the normalizing signal.
It is to be understood that the above-described arrangements are merely illustrative of applications of the principles of the invention. Numerous other arrangements may be devised by those skilled in the art without departing from the spirit and scope of the invention.
What is claimed is:
l. In a system for the control of sound, the combination that comprises a source of a pair of information bearing signals derived from the output signals of a pair of spatially separated sound detecting devices, means for cross-correlating said pair of information bearing signals, means for normalizing the output signal of said crosscorrelating means to eliminate its dependence upon the average amplitudes of said information bearing signals, and means under the control of said normalized crosscorrelation signal for adjusting the amplitude of the sum of the output signals of said sound detecting devices.
2. In a sound control system, the combination that comprises a pair of spatially separated microphones, means for additively combining the output signals of said microphones, means for individually processing the output signals of said microphones to emphasize selected features of said signals, means for averaging the product of said processed microphone signals to develop a crosscorrelation signal, means for deriving a normalizing signal from said processed microphone signals, means for dividing said cross-correlation signal by said normalizing signal to obtain a control signal, and means under the influence of said control signal for varying the amplitude of said additively combined microphone signals.
3. Apparatus as deiined in claim 2 wherein said means for deriving a normalizing signal comprises two parallel subpaths each of which is provided with an input terminal, an output terminal, yand an amplifier and a peak detector connected in series between said input and output terminals, means for applying each of said processed microphone signals to the input terminal of one of said subpaths, a multiplier provided with two input terminals and an output terminal, means for connecting the output terminal of each of said subpaths to one of the input terminals of said multiplier, and threshold means connected between the output terminal of said multiplier and said dividing means.
4. Apparatus as deiined in claim 3 wherein said peak detector comprises a diode and an RC circuit.
5. Apparatus las defined in claim 3 wherein said threshold means comprises a voltage divider including a first resistor and a second resistor, a diode connected in series with an adjustable energy source, and means for connecting said series connected diode and energy source in parallel with said second resistor.
6. In combination with a cross-correlation sound control system, apparatus for reducing distortion which comprises a source of a pair of information bearing signals representative of the sound energy at two spatially separated points in a sound field, a source of a cross-correlation signal derived from said pair of information bearing signals whose magnitude is a function of the time coherence between said signals and of the average amplitudes of said signals, means for deriving from each of said information 'bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals, means for combining said largest value signals to form a normalizing signal, and means for dividing said cross-correlation signal by said normalizing signal.
7. Apparatus as defined in claim `6 wherein said means for deriving from each of said information bearing signals a signal proportional to the largest value over several pitch periods of each of said information bearing signals comprises two parallel signal paths, one for each of said information bearing signals, wherein each of said signals paths includes a first amplifying means provided with an input terminal and an output terminal, a delay line provided with an input terminal and a plurality of taps, means for connecting the output terminal of said first amplifying means to the input terminal of said delay line, a plurality of diodes in one-to-one correspondence with the taps of said delay line, wherein each of said diodes is provided with an input terminal and an output terminal, means for connecting each tap of said delay line to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, a second amplifying means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said second amplifying means, and means for connecting the output terminal of said second amplifying means to said combining means.
8. Apparatus as defined in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises means for multiplying together said average magnitude signals.
9. Apparatus as dened in claim 6 wherein said means for combining said average magnitude signals to form a normalizing signal comprises two diodes, one for each of said average magnitude signals, wherein each of said diodes is provided with an input terminal and an output terminal, means for applying each average magnitude signal to the input terminal of one of said diodes, means for connecting the output terminals of said diodes to a common output point, squaring means provided with an input terminal and an output terminal, means for connecting said common output point to the input terminal of said squaring means, and means for connecting the output terminal of said squaring means to said dividing means.
No references cited.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US95087A US3057960A (en) | 1961-03-13 | 1961-03-13 | Normalized sound control system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US95087A US3057960A (en) | 1961-03-13 | 1961-03-13 | Normalized sound control system |
Publications (1)
Publication Number | Publication Date |
---|---|
US3057960A true US3057960A (en) | 1962-10-09 |
Family
ID=22249463
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US95087A Expired - Lifetime US3057960A (en) | 1961-03-13 | 1961-03-13 | Normalized sound control system |
Country Status (1)
Country | Link |
---|---|
US (1) | US3057960A (en) |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3440350A (en) * | 1966-08-01 | 1969-04-22 | Bell Telephone Labor Inc | Reception of signals transmitted in a reverberant environment |
US3497622A (en) * | 1966-10-21 | 1970-02-24 | Zenith Radio Corp | Automatic gain control |
US3502989A (en) * | 1966-05-03 | 1970-03-24 | Itt | Receiver employing correlation techniques |
US3794766A (en) * | 1973-02-08 | 1974-02-26 | Bell Telephone Labor Inc | Delay equalizing circuit for an audio system using multiple microphones |
US3947636A (en) * | 1974-08-12 | 1976-03-30 | Edgar Albert D | Transient noise filter employing crosscorrelation to detect noise and autocorrelation to replace the noisey segment |
US4131760A (en) * | 1977-12-07 | 1978-12-26 | Bell Telephone Laboratories, Incorporated | Multiple microphone dereverberation system |
US4181818A (en) * | 1976-11-05 | 1980-01-01 | Gentex Corporation | Personal amplifier system |
US4185167A (en) * | 1976-06-28 | 1980-01-22 | Acoustical Design Incorporated | Sound masking package |
US4236040A (en) * | 1978-08-17 | 1980-11-25 | General Motors Corporation | Sound intensity meter |
NL8303589A (en) * | 1982-10-18 | 1984-05-16 | Nippon Telegraph & Telephone | MICROPHONE CONFIGURATION AND METHOD FOR DERIVING A DESIRED SIGNAL. |
US4956867A (en) * | 1989-04-20 | 1990-09-11 | Massachusetts Institute Of Technology | Adaptive beamforming for noise reduction |
US20020176594A1 (en) * | 2001-03-02 | 2002-11-28 | Volker Hohmann | Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system |
WO2003015458A2 (en) * | 2001-08-10 | 2003-02-20 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment |
WO2003015457A2 (en) * | 2001-08-10 | 2003-02-20 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment |
US6549630B1 (en) * | 2000-02-04 | 2003-04-15 | Plantronics, Inc. | Signal expander with discrimination between close and distant acoustic source |
US20070050441A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation,A Nevada Corporati | Method and apparatus for improving noise discrimination using attenuation factor |
US20070047743A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation, A Nevada Corporation | Method and apparatus for improving noise discrimination using enhanced phase difference value |
US20070047742A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation, A Nevada Corporation | Method and system for enhancing regional sensitivity noise discrimination |
US20090234618A1 (en) * | 2005-08-26 | 2009-09-17 | Step Labs, Inc. | Method & Apparatus For Accommodating Device And/Or Signal Mismatch In A Sensor Array |
US20100109951A1 (en) * | 2005-08-26 | 2010-05-06 | Dolby Laboratories, Inc. | Beam former using phase difference enhancement |
US20100158269A1 (en) * | 2008-12-22 | 2010-06-24 | Vimicro Corporation | Method and apparatus for reducing wind noise |
US20110029288A1 (en) * | 2005-08-26 | 2011-02-03 | Dolby Laboratories Licensing Corporation | Method And Apparatus For Improving Noise Discrimination In Multiple Sensor Pairs |
CN104885152A (en) * | 2012-12-28 | 2015-09-02 | 共荣工程株式会社 | Sound-source separation method, device, and program |
-
1961
- 1961-03-13 US US95087A patent/US3057960A/en not_active Expired - Lifetime
Non-Patent Citations (1)
Title |
---|
None * |
Cited By (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3502989A (en) * | 1966-05-03 | 1970-03-24 | Itt | Receiver employing correlation techniques |
US3440350A (en) * | 1966-08-01 | 1969-04-22 | Bell Telephone Labor Inc | Reception of signals transmitted in a reverberant environment |
US3497622A (en) * | 1966-10-21 | 1970-02-24 | Zenith Radio Corp | Automatic gain control |
US3794766A (en) * | 1973-02-08 | 1974-02-26 | Bell Telephone Labor Inc | Delay equalizing circuit for an audio system using multiple microphones |
US3947636A (en) * | 1974-08-12 | 1976-03-30 | Edgar Albert D | Transient noise filter employing crosscorrelation to detect noise and autocorrelation to replace the noisey segment |
US4185167A (en) * | 1976-06-28 | 1980-01-22 | Acoustical Design Incorporated | Sound masking package |
US4181818A (en) * | 1976-11-05 | 1980-01-01 | Gentex Corporation | Personal amplifier system |
US4131760A (en) * | 1977-12-07 | 1978-12-26 | Bell Telephone Laboratories, Incorporated | Multiple microphone dereverberation system |
US4236040A (en) * | 1978-08-17 | 1980-11-25 | General Motors Corporation | Sound intensity meter |
NL8303589A (en) * | 1982-10-18 | 1984-05-16 | Nippon Telegraph & Telephone | MICROPHONE CONFIGURATION AND METHOD FOR DERIVING A DESIRED SIGNAL. |
US4956867A (en) * | 1989-04-20 | 1990-09-11 | Massachusetts Institute Of Technology | Adaptive beamforming for noise reduction |
US6549630B1 (en) * | 2000-02-04 | 2003-04-15 | Plantronics, Inc. | Signal expander with discrimination between close and distant acoustic source |
US20020176594A1 (en) * | 2001-03-02 | 2002-11-28 | Volker Hohmann | Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system |
US7013015B2 (en) * | 2001-03-02 | 2006-03-14 | Siemens Audiologische Technik Gmbh | Method for the operation of a hearing aid device or hearing device system as well as hearing aid device or hearing device system |
WO2003015458A2 (en) * | 2001-08-10 | 2003-02-20 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment |
WO2003015457A2 (en) * | 2001-08-10 | 2003-02-20 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment |
WO2003015458A3 (en) * | 2001-08-10 | 2003-10-23 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in multiple wave sound environment |
WO2003015457A3 (en) * | 2001-08-10 | 2004-03-11 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment |
US7274794B1 (en) | 2001-08-10 | 2007-09-25 | Sonic Innovations, Inc. | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment |
AU2002331235B2 (en) * | 2001-08-10 | 2008-08-14 | Rasmussen Digital Aps | Sound processing system including forward filter that exhibits arbitrary directivity and gradient response in single wave sound environment |
US20070050441A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation,A Nevada Corporati | Method and apparatus for improving noise discrimination using attenuation factor |
US20110029288A1 (en) * | 2005-08-26 | 2011-02-03 | Dolby Laboratories Licensing Corporation | Method And Apparatus For Improving Noise Discrimination In Multiple Sensor Pairs |
EP1917837A2 (en) * | 2005-08-26 | 2008-05-07 | Step Communications Corporation | Method and apparatus for improving noise discrimination using attenuation factor |
US20070047743A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation, A Nevada Corporation | Method and apparatus for improving noise discrimination using enhanced phase difference value |
US20090234618A1 (en) * | 2005-08-26 | 2009-09-17 | Step Labs, Inc. | Method & Apparatus For Accommodating Device And/Or Signal Mismatch In A Sensor Array |
US20100109951A1 (en) * | 2005-08-26 | 2010-05-06 | Dolby Laboratories, Inc. | Beam former using phase difference enhancement |
USRE47535E1 (en) | 2005-08-26 | 2019-07-23 | Dolby Laboratories Licensing Corporation | Method and apparatus for accommodating device and/or signal mismatch in a sensor array |
US20070047742A1 (en) * | 2005-08-26 | 2007-03-01 | Step Communications Corporation, A Nevada Corporation | Method and system for enhancing regional sensitivity noise discrimination |
EP1917837A4 (en) * | 2005-08-26 | 2011-03-02 | Dolby Lab Licensing Corp | Method and apparatus for improving noise discrimination using attenuation factor |
US8111192B2 (en) | 2005-08-26 | 2012-02-07 | Dolby Laboratories Licensing Corporation | Beam former using phase difference enhancement |
US8155926B2 (en) | 2005-08-26 | 2012-04-10 | Dolby Laboratories Licensing Corporation | Method and apparatus for accommodating device and/or signal mismatch in a sensor array |
US8155927B2 (en) | 2005-08-26 | 2012-04-10 | Dolby Laboratories Licensing Corporation | Method and apparatus for improving noise discrimination in multiple sensor pairs |
US20100158269A1 (en) * | 2008-12-22 | 2010-06-24 | Vimicro Corporation | Method and apparatus for reducing wind noise |
CN104885152A (en) * | 2012-12-28 | 2015-09-02 | 共荣工程株式会社 | Sound-source separation method, device, and program |
EP2940686A4 (en) * | 2012-12-28 | 2016-08-03 | Kyoei Engineering Co Ltd | Sound-source separation method, device, and program |
CN104885152B (en) * | 2012-12-28 | 2019-04-26 | 共荣工程株式会社 | Sound source separating method, device and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US3057960A (en) | Normalized sound control system | |
GB1458663A (en) | Microphone circuits | |
US3518578A (en) | Signal compression and expansion system | |
GB1033001A (en) | Electroacoustic transducers | |
US3732371A (en) | Wide dynamic range noise masking compandor | |
GB1341468A (en) | Multi frequency voice receiver | |
US3278866A (en) | Selective transmission | |
US3701028A (en) | Reduction of harmonic distortion | |
US2799734A (en) | Speech brighteners | |
WO2001026352A1 (en) | Tagging echoes with low frequency noise | |
US2208160A (en) | Communication system | |
FR2305909A1 (en) | Microphones and associated equipment - include one unshielded microphone and one masked microphone, and electronics system to minimise noise background | |
US3746791A (en) | Speech synthesizer utilizing white noise | |
GB1008565A (en) | Improvements in or relating to voiced sound detection circuits | |
GB1031145A (en) | Improvements in or relating to audio-frequency amplifiers | |
US4025894A (en) | Method and apparatus for improving the directional characteristic of an echo sounder receiving base | |
ES456619A1 (en) | Receiver apparatus for detection of two voice frequencies in a multifrequency tone signal | |
US4266094A (en) | Electronic speech processing system | |
US3548100A (en) | Formant frequency extractor | |
ES445885A1 (en) | Installation of electronic listening. (Machine-translation by Google Translate, not legally binding) | |
FR2271725A1 (en) | Telephonic conference system with speech detector - enables speech from one participant to block all other transmissions | |
GB1054796A (en) | ||
FR2290814A1 (en) | Acoustic feedback coupling disturbance suppression arrangement - has successive circuit stages operating on desired and undesired components | |
GB936124A (en) | Speech bandwidth compression systems | |
Korohoda et al. | Design of the mutually cancelling narrow passband and stopband filters—A case study |