Nothing Special   »   [go: up one dir, main page]

CN111669697B - Coherent sound and environmental sound extraction method and system of multichannel signal - Google Patents

Coherent sound and environmental sound extraction method and system of multichannel signal Download PDF

Info

Publication number
CN111669697B
CN111669697B CN202010447863.9A CN202010447863A CN111669697B CN 111669697 B CN111669697 B CN 111669697B CN 202010447863 A CN202010447863 A CN 202010447863A CN 111669697 B CN111669697 B CN 111669697B
Authority
CN
China
Prior art keywords
sound
channel
coherent
calculating
channels
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010447863.9A
Other languages
Chinese (zh)
Other versions
CN111669697A (en
Inventor
吴彦琴
桑晋秋
郑成诗
张芳杰
李晓东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Acoustics CAS
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CN202010447863.9A priority Critical patent/CN111669697B/en
Publication of CN111669697A publication Critical patent/CN111669697A/en
Application granted granted Critical
Publication of CN111669697B publication Critical patent/CN111669697B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

The invention discloses a method and a system for extracting coherent sound and environmental sound of a multi-channel signal, wherein the method comprises the following steps: calculating weight expressions of the N channel signal coherent sounds, and estimating the coherent sounds according to the weight expressions, thereby calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same; calculating the environment sound of each channel according to the coherent sound of each channel; and carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by a time domain. The method of the invention explores a weight expression for estimating coherent sound aiming at signals with any number of channels under the condition that the environmental sound energy of each channel is the same, and solves each unknown parameter in the weight expression by using the signal energy of each channel and the correlation value among the channels, thereby realizing the extraction of the coherent sound and the environmental sound of the multi-channel signals and having high extraction precision.

Description

Coherent sound and environmental sound extraction method and system of multichannel signal
Technical Field
The invention relates to the field of spatial sound reproduction, in particular to a method and a system for extracting coherent sound and environmental sound of a multi-channel signal.
Background
The spatial sound reproduction technology is widely applied to entertainment media, for example, when a movie theater, a home theater and a portable electronic device play a movie, spatial sound with a certain sound image width and good immersion feeling is reproduced through an earphone or a loudspeaker, and better audio-visual experience can be brought to consumers. In recent years, the spatial sound reproduction gradually shows important application prospects in the fields of advanced scientific research and practical engineering, such as the fields of virtual reality, aviation, aerospace and the like.
The spatial sound mainly comprises two components with different properties, one is a sound component with directivity, and is called coherent sound; the other is an acoustic component having diffusivity and no direction discrimination, which is called ambient sound. In order to achieve a better sound reproduction effect, coherent sound and Ambient sound Extraction (PAE) and different processing are required for spatial sound. For example, in an audio codec system, the PAE is used as a front end of audio encoding or decoding, and thus effective and immersive spatial sound playback can be achieved.
The development of the PAE method aiming at the two-channel signals is mature, and the PAE method is widely applied to a principal component analysis method and a least square method. For multi-channel signals, PAE may be performed using a pairwise correlation method. But the accuracy of extracting components by the pairwise correlation method is not high. Therefore, it is significant to extend the PAE method suitable for stereo to multi-channel signals. Under the premise that coherent sound occupies main energy, the principal component analysis method carries out PAE on the stereo signal by calculating the eigenvalue of the covariance matrix of the input signal. The method can also extract components of the multi-channel signals, but when the number of channels is large, the calculation complexity is increased, and the principal component analysis method has a good extraction effect only when coherent sound occupies main energy. The least square method realizes PAE of the stereo signal by calculating and estimating the weight of coherent sound under the premise that the environmental sound energy of each channel is equal. However, when the least squares method is directly applied to a multichannel signal, the estimation weight is not easily solved. The ambient sound component mainly plays a role of atmosphere warming in the space sound, and in order to achieve better surrounding feeling, the energy distribution difference of the ambient sound in each channel is small. Therefore, on the premise that the environmental sound energy of each channel is equal, the coherent sound and the environmental sound extraction of the multi-channel signal have important significance.
Disclosure of Invention
The invention aims to overcome the technical defects, and under the premise that the energy of the environmental sound in each channel is equal, the weight of coherent sound is estimated by using a least square method when the number of the channels is calculated to be less, and a weight expression when the coherent sound estimation is carried out on a multi-channel signal with any number of channels is obtained according to the regularity of the change of the weight along with the number of the channels.
In order to achieve the above object, the present invention provides a coherent sound and ambient sound extraction method for a multi-channel signal, the method comprising:
calculating weight expressions of the N channel signal coherent sounds, and estimating the coherent sounds according to the weight expressions, thereby calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same;
calculating the environment sound of each channel according to the coherent sound of each channel;
and carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by a time domain.
As an improvement of the above method, the method calculates a weight expression of the coherent sound of the N channel signals, estimates the coherent sound according to the weight expression, and thereby calculates the coherent sound of each channel; wherein the ambient sound energy of each channel is the same; the method specifically comprises the following steps:
fourier transform is carried out on time domain multi-channel signals, and the nth channel inputs a signal XnExpressed as:
Xn=βnS+An
wherein S represents the spectrum of coherent sound, βnRepresenting the amplitude difference factor of the coherent sound of the nth channel and the coherent sound of the first channel, N is more than or equal to 1 and less than or equal to N, beta1=1,AnA frequency spectrum representing the ambient sound of the nth channel;
calculating the nth channel input signal XnShort time energy of
Figure BDA0002506583330000021
Figure BDA0002506583330000022
Calculating a correlation value phi of the first channel and the second channel12
Figure BDA0002506583330000023
According to the short-time energy of the first channel
Figure BDA0002506583330000024
Short time energy of the second channel
Figure BDA0002506583330000025
And the correlation value phi between the two channels12Calculating intermediate parameters C and D:
Figure BDA0002506583330000026
Figure BDA0002506583330000027
from which the short-time energy P of the coherent sound is calculatedSShort-time energy of ambient sound PAAnd beta2
Figure BDA0002506583330000031
Figure BDA0002506583330000032
Figure BDA0002506583330000033
Calculating betan
Figure BDA0002506583330000034
The weight value of the nth channel is as follows:
Figure BDA0002506583330000035
then the estimate of the coherent sound
Figure BDA0002506583330000036
Comprises the following steps:
Figure BDA0002506583330000037
the nth channel coherent sound Sn
Figure BDA0002506583330000038
As an improvement of the above method, the ambient sound of each channel is calculated from the coherent sound of each channel; the method specifically comprises the following steps:
ambient sound of nth channel AnComprises the following steps:
An=Xn-Sn
embodiment 2 of the present invention provides a coherent sound and ambient sound extraction system for a multichannel signal, including:
the coherent sound extraction module is used for calculating weight expressions of the coherent sounds of the signals of the N channels, estimating the coherent sounds according to the weight expressions, and calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same;
the environment sound extraction module is used for calculating the environment sound of each channel according to the coherent sound of each channel;
and the frequency domain to time domain module is used for carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by time domain.
As an improvement of the above system, the implementation process of the coherent sound extraction module includes:
fourier transform is carried out on time domain multi-channel signals, and the nth channel inputs a signal XnExpressed as:
Xn=βnS+An
wherein S represents the spectrum of coherent sound, βnRepresenting the amplitude difference factor of the coherent sound of the nth channel and the coherent sound of the first channel, N is more than or equal to 1 and less than or equal to N, beta1=1,AnA frequency spectrum representing the ambient sound of the nth channel;
calculating the nth channel input signal XnShort time energy of
Figure BDA0002506583330000041
Figure BDA0002506583330000042
Calculating a correlation value phi of the first channel and the second channel12
Figure BDA0002506583330000043
According to the short-time energy of the first channel
Figure BDA0002506583330000044
Short time energy of the second channel
Figure BDA0002506583330000045
And the correlation value phi between the two channels12Calculating intermediate parameters C and D:
Figure BDA0002506583330000046
Figure BDA0002506583330000047
from which the short-time energy P of the coherent sound is calculatedSShort-time energy of ambient sound PAAnd beta2
Figure BDA0002506583330000048
Figure BDA0002506583330000049
Figure BDA00025065833300000410
Calculating betan
Figure BDA00025065833300000411
The weight value of the nth channel is as follows:
Figure BDA00025065833300000412
then the estimate of the coherent sound
Figure BDA00025065833300000413
Comprises the following steps:
Figure BDA0002506583330000051
the nth channel coherent sound Sn
Figure BDA0002506583330000052
As an improvement of the above system, the implementation process of the ambient sound calculation module includes:
ambient sound of nth channel AnComprises the following steps:
An=Xn-Sn
the invention has the advantages that:
the method of the invention explores a weight expression for estimating coherent sound aiming at signals with any number of channels under the condition that the environmental sound energy of each channel is the same, and solves each unknown parameter in the weight expression by using the signal energy of each channel and the correlation value among the channels, thereby realizing the extraction of the coherent sound and the environmental sound of the multi-channel signals and having high extraction precision.
Drawings
FIG. 1 is a flow chart of a coherent acoustic and ambient acoustic extraction method of a multi-channel signal of the present invention;
FIG. 2(a) is an error plot of coherent acoustic component extraction for a mixed five channel signal 1 using the method of the present invention and pairwise correlation;
FIG. 2(b) is an error plot of ambient sound component extraction for a mixed five channel signal 1 using the method of the present invention and pairwise correlation;
FIG. 3(a) is an error plot of coherent acoustic component extraction for a mixed five-channel signal 2 using the method of the present invention and pairwise correlation;
fig. 3(b) is an error map of ambient sound component extraction for a mixed five-channel signal 2 using the method of the present invention and the pairwise correlation method.
Detailed Description
The technical solution of the present invention is described in detail below with reference to the accompanying drawings and specific embodiments.
Example 1
As shown in fig. 1, embodiment 1 of the present invention proposes a coherent acoustic and ambient acoustic extraction method for equalizing ambient acoustic energy of each channel of a multi-channel signal, including the following steps:
step 1) framing a multichannel signal, performing Fourier transform to obtain a frequency spectrum, and expressing short-time energy of each channel and correlation values between any two channels according to a multichannel signal model, wherein the method specifically comprises the following steps:
in the multi-channel signal model, the input signal is represented as a superposition of coherent sound and ambient sound. Because the characteristics of coherent sound and environmental sound are different, the coherent sound of each channel is assumed to be completely correlated, namely, a linear relation exists; it is assumed that coherent sound is uncorrelated with ambient sound of each channel and ambient sound between channels.
Step 1-1), performing Fourier transform on the time domain multi-channel signal to obtain a frequency spectrum:
Xn=βnS+An,n=1,2,…,N
where N is the number of channels, S represents the frequency spectrum of the coherent sound, βnAn amplitude difference factor representing the presence of coherent sound of the nth channel and coherent sound of the first channel, and beta1=1,AnA frequency spectrum representing the ambient sound of the nth channel;
step 1-2) the signal energy of each channel can be expressed as:
Figure BDA0002506583330000061
wherein E { } represents a short-time average.
The correlation values between the channels of steps 1-3) can be expressed as:
Figure BDA0002506583330000062
wherein,
Figure BDA0002506583330000063
is n th1A channel and an n-th channel2Correlation value between channels, n1=1,2,…,N,n2=1,2,…,N,n1≠n2
Step 2) calculating the weight value of the coherent sound estimated by using a least square method when the number of channels is small, and exploring the regularity of the coherent sound, wherein the method specifically comprises the following steps:
step 2-1) calculating an input signal X for the two-channel signal1And X2Estimating the weight value of the coherent sound S:
step 2-1-1) estimating coherent sound S:
Figure BDA0002506583330000064
wherein, w1And w2Representing the estimated weights to be found.
Step 2-1-2) estimation error σ of SSExpressed as:
Figure BDA0002506583330000065
step 2-1-3) is solved by using a least square algorithm, namely when the estimation error is completely uncorrelated with the input stereo signal, the obtained weight is an optimal estimation:
E{σSX1}=0
E{σSX2}=0.
at this time, the weight of the optimal estimation is expressed as:
Figure BDA0002506583330000071
Figure BDA0002506583330000072
wherein, PSShort-time energy, P, representing coherent soundARepresenting the short-term energy of the ambient sound.
Step 2-2) calculating an input signal X for the three-channel signal1、X2And X3Estimating the weight value of the coherent sound S:
step 2-2-1) estimating coherent sound S:
Figure BDA0002506583330000073
wherein, w1、w2And w3Representing the estimated weights to be found.
Step 2-2-2) and the processing method similar to step 2-1) can obtain the weight value of the three-channel signal estimated coherent sound:
Figure BDA0002506583330000074
Figure BDA0002506583330000075
Figure BDA0002506583330000076
and 2-3) calculating more estimated weights of coherent sounds when the number of channels is more, and finding that the weight values can be uniformly expressed. For a multi-channel signal with a number of channels N, the estimated coherent sound is represented as:
Figure BDA0002506583330000077
wherein, the weight value can be expressed as:
Figure BDA0002506583330000081
step 3) calculating and estimating each unknown parameter in the weight of the coherent sound, and completing the extraction of the coherent sound and the environmental sound of the multichannel signal, wherein the method specifically comprises the following steps:
step 3-1) known of beta 11, the unknown parameter P can be obtained from the signal energy of the first two channels and the correlation value between the channels in step 1)S、PAAnd beta2
Figure BDA0002506583330000082
Figure BDA0002506583330000083
Figure BDA0002506583330000084
Wherein,
Figure BDA0002506583330000085
Figure BDA0002506583330000086
step 3-2) according to the energy values of other channels except the first channel and the second channel, the beta when N is more than or equal to 3 and less than or equal to N can be obtainedn
Figure BDA0002506583330000087
Step 3-3) aiming at the multichannel signal with the number of channels N, the parameter P calculated in the step 3-1) and the step 3-2) isS、PAAnd betanSubstituting (N-1, 2, …, N) into the weight value w of the estimated coherent sound calculated in step 2-3)n(1,2, …, N) the operation of extracting coherent sound from the multi-channel signal can be completed.
Step 4) PAE is carried out on the multichannel signals with any number of channels, and the method specifically comprises the following steps:
step 4-1) calculating coherent sound of each channel, which specifically comprises the following steps:
since step 2) calculates the weight expression for estimating coherent sound when PAE is performed on the multichannel signal with any number of channels, and step 3) calculates each unknown parameter in the weight expression, when the number of channels of the multichannel signal is determined, the coherent sound S can be estimated directly according to the weight expression. The coherent sound is directly the coherent sound of the first channel, the coherent sound of other channels is obtained by S linear processing, namely betanS(n=2,…,N)。
Step 4-2) calculating the environment sound of each channel, which specifically comprises the following steps:
the remaining component of each channel is considered as ambient sound, i.e. An=XnnS。
And 4-3) carrying out inverse Fourier transform on the obtained N-channel coherent sound and N-channel environment sound to obtain coherent sound and environment sound represented by a time domain.
The following describes the performance of the method proposed by the present invention with reference to the simulation example:
and synthesizing the completely correlated coherent sound and the completely uncorrelated environmental sound into a mixed five-channel signal according to a certain proportion, and performing component extraction by using the multi-channel PAE method and the pairwise correlation method provided by the invention. Two groups of mixed multi-channel signals are synthesized, namely a mixed five-channel signal 1 with pure voice as coherent sound and sea wave sound as environment sound, and a mixed five-channel signal 2 with pure music sound as coherent sound and forest background sound as environment sound. In mixing, in order to control the distribution of coherent sound energy between channels, a coherent sound amplitude difference factor beta between channels is setnWith its reference value beta0The components are in a certain proportional relation; setting the ambient sound energy of each channel equal to PA0(ii) a In order to control the proportion of coherent sound components in the mixed signal, different coherent sound energy proportion gamma is set. Reference value beta0And PA0Determined by gamma.
The experiment sets the amplitude of coherent sound of each channel to have beta1=β2=β0,β3=2β0,β4=β5=0.5β0The proportional relationship of (c). The coherent acoustic energy ratio γ is 0.05 to 0.95 (interval is 0.1). Extraction error epsilon of coherent soundPRespectively expressed as:
Figure BDA0002506583330000091
extraction error epsilon of environmental soundaRespectively expressed as:
Figure BDA0002506583330000092
fig. 2(a) and 2(b) represent the extraction errors of coherent sound and ambient sound when PAE is performed on the mixed five-channel signal 1 by the algorithm and the pairwise correlation method proposed by the present invention, respectively; fig. 3(a) and 3(b) represent extraction errors of coherent sound and ambient sound when the algorithm and the pairwise correlation method proposed by the present invention perform PAE on the mixed five-channel signal 2, respectively. It can be seen that, in the whole interval of the coherent acoustic energy ratio gamma of 0.05 to 0.95 (interval of 0.1), the extraction error of the method provided by the invention is smaller than that of the pairwise correlation method.
Embodiment 2 of the present invention provides a coherent sound and ambient sound extraction system for a multichannel signal, including:
the coherent sound extraction module is used for calculating weight expressions of the coherent sounds of the signals of the N channels, estimating the coherent sounds according to the weight expressions, and calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same;
the environment sound extraction module is used for calculating the environment sound of each channel according to the coherent sound of each channel;
and the frequency domain to time domain module is used for carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by time domain.
Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention and are not limited. Although the present invention has been described in detail with reference to the embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (4)

1. A method of coherent acoustic and ambient acoustic extraction of a multichannel signal, the method comprising:
calculating weight expressions of the N channel signal coherent sounds, and estimating the coherent sounds according to the weight expressions, thereby calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same;
calculating the environment sound of each channel according to the coherent sound of each channel;
carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by a time domain;
the weight expression of the coherent sound of the signals of the N channels is calculated, and the coherent sound is estimated according to the weight expression, so that the coherent sound of each channel is calculated; wherein the ambient sound energy of each channel is the same; the method specifically comprises the following steps:
fourier transform is carried out on time domain multi-channel signals, and the nth channel inputs a signal XnExpressed as:
Xn=βnS+An
wherein S represents the spectrum of coherent sound, βnRepresenting the amplitude difference factor of the coherent sound of the nth channel and the coherent sound of the first channel, N is more than or equal to 1 and less than or equal to N, beta1=1,AnA frequency spectrum representing the ambient sound of the nth channel;
calculating the nth channel input signal XnShort time energy of
Figure FDA0002918929400000011
Figure FDA0002918929400000012
Calculating a correlation value phi of the first channel and the second channel12
Figure FDA0002918929400000013
According to the short-time energy of the first channel
Figure FDA0002918929400000014
Short time energy of the second channel
Figure FDA0002918929400000015
And the correlation value phi between the two channels12Calculating intermediate parameters C and D:
Figure FDA0002918929400000016
Figure FDA0002918929400000017
from which the short-time energy P of the coherent sound is calculatedSShort-time energy of ambient sound PAAnd beta2
Figure FDA0002918929400000021
Figure FDA0002918929400000022
Figure FDA0002918929400000023
Calculating betan
Figure FDA0002918929400000024
The weight value of the nth channel is as follows:
Figure FDA0002918929400000025
then the estimate of the coherent sound
Figure FDA0002918929400000026
Comprises the following steps:
Figure FDA0002918929400000027
the nth channel coherent sound Sn
Figure FDA0002918929400000028
2. The method according to claim 1, wherein the method calculates the ambient sound of each channel from the coherent sound of each channel; the method specifically comprises the following steps:
ambient sound of nth channel AnComprises the following steps:
An=Xn-Sn
3. a coherent acoustic and ambient acoustic extraction system for a multichannel signal, the system comprising:
the coherent sound extraction module is used for calculating weight expressions of the coherent sounds of the signals of the N channels, estimating the coherent sounds according to the weight expressions, and calculating the coherent sounds of each channel; wherein the ambient sound energy of each channel is the same;
the environment sound extraction module is used for calculating the environment sound of each channel according to the coherent sound of each channel;
the frequency domain to time domain conversion module is used for carrying out inverse Fourier transform on the N channels of coherent sound and the N channels of environment sound to obtain coherent sound and environment sound represented by a time domain;
the specific implementation process of the coherent sound extraction module comprises the following steps:
fourier transform is carried out on time domain multi-channel signals, and the nth channel inputs a signal XnExpressed as:
Xn=βnS+An
wherein S represents the spectrum of coherent sound, βnRepresenting the amplitude difference factor of the coherent sound of the nth channel and the coherent sound of the first channel, N is more than or equal to 1 and less than or equal to N, beta1=1,AnA frequency spectrum representing the ambient sound of the nth channel;
calculating the nth channel input signal XnShort time energy of
Figure FDA0002918929400000031
Figure FDA0002918929400000032
Calculating a correlation value phi of the first channel and the second channel12
Figure FDA0002918929400000033
According to the short-time energy of the first channel
Figure FDA0002918929400000034
Short time energy of the second channel
Figure FDA0002918929400000035
And the correlation value phi between the two channels12Calculating intermediate parameters C and D:
Figure FDA0002918929400000036
Figure FDA0002918929400000037
from which the short-time energy P of the coherent sound is calculatedSShort-time energy of ambient sound PAAnd beta2
Figure FDA0002918929400000038
Figure FDA0002918929400000039
Figure FDA00029189294000000310
Calculating betan
Figure FDA00029189294000000311
The weight value of the nth channel is as follows:
Figure FDA00029189294000000312
then the estimate of the coherent sound
Figure FDA00029189294000000313
Comprises the following steps:
Figure FDA0002918929400000041
the nth channel coherent sound Sn
Figure FDA0002918929400000042
4. The system for coherent acoustic and ambient sound extraction of multichannel signal according to claim 3, wherein the ambient sound extraction module is implemented by:
ambient sound of nth channel AnComprises the following steps:
An=Xn-Sn
CN202010447863.9A 2020-05-25 2020-05-25 Coherent sound and environmental sound extraction method and system of multichannel signal Active CN111669697B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010447863.9A CN111669697B (en) 2020-05-25 2020-05-25 Coherent sound and environmental sound extraction method and system of multichannel signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010447863.9A CN111669697B (en) 2020-05-25 2020-05-25 Coherent sound and environmental sound extraction method and system of multichannel signal

Publications (2)

Publication Number Publication Date
CN111669697A CN111669697A (en) 2020-09-15
CN111669697B true CN111669697B (en) 2021-05-18

Family

ID=72384501

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010447863.9A Active CN111669697B (en) 2020-05-25 2020-05-25 Coherent sound and environmental sound extraction method and system of multichannel signal

Country Status (1)

Country Link
CN (1) CN111669697B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101401151A (en) * 2006-03-15 2009-04-01 法国电信公司 Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
CN101816191A (en) * 2007-09-26 2010-08-25 弗劳恩霍夫应用研究促进协会 Be used for obtaining extracting the apparatus and method and the computer program that are used to extract ambient signal of apparatus and method of the weight coefficient of ambient signal
EP2523473A1 (en) * 2011-05-11 2012-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating an output signal employing a decomposer
CN109036455A (en) * 2018-09-17 2018-12-18 中科上声(苏州)电子有限公司 Direct sound wave and background sound extracting method, speaker system and its sound playback method
CN110534129A (en) * 2018-05-23 2019-12-03 哈曼贝克自动系统股份有限公司 The separation of dry sound and ambient sound

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103902822B (en) * 2014-03-28 2017-09-08 西安交通大学苏州研究院 Sources number detection method in the case of the mixing of incoherent and coherent signal
CN110531310B (en) * 2019-07-25 2021-07-13 西安交通大学 Far-field coherent signal direction-of-arrival estimation method based on subspace and interpolation transformation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101401151A (en) * 2006-03-15 2009-04-01 法国电信公司 Device and method for graduated encoding of a multichannel audio signal based on a principal component analysis
CN101816191A (en) * 2007-09-26 2010-08-25 弗劳恩霍夫应用研究促进协会 Be used for obtaining extracting the apparatus and method and the computer program that are used to extract ambient signal of apparatus and method of the weight coefficient of ambient signal
EP2523473A1 (en) * 2011-05-11 2012-11-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating an output signal employing a decomposer
CN110534129A (en) * 2018-05-23 2019-12-03 哈曼贝克自动系统股份有限公司 The separation of dry sound and ambient sound
CN109036455A (en) * 2018-09-17 2018-12-18 中科上声(苏州)电子有限公司 Direct sound wave and background sound extracting method, speaker system and its sound playback method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
相干声与环境声提取方法的客观性能评估;吴彦琴等;《声学技术》;20191031;第38卷(第5期);全文 *

Also Published As

Publication number Publication date
CN111669697A (en) 2020-09-15

Similar Documents

Publication Publication Date Title
CN110089134B (en) Method, system and computer readable medium for reproducing spatially distributed sound
CN111316354B (en) Determination of target spatial audio parameters and associated spatial audio playback
US11832080B2 (en) Spatial audio parameters and associated spatial audio playback
EP2285139B1 (en) Device and method for converting spatial audio signal
Avendano et al. A frequency-domain approach to multichannel upmix
JP2024084842A (en) Method or device for compressing or decompressing higher-order ambisonic signal representations
RU2361185C2 (en) Device for generating multi-channel output signal
US10362431B2 (en) Headtracking for parametric binaural output system and method
US20230100071A1 (en) Rendering reverberation
Avendano et al. Frequency domain techniques for stereo to multichannel upmix
CN105578379B (en) Device and method for generating the output signal at least two output channels
US20100169102A1 (en) Low complexity mpeg encoding for surround sound recordings
Farina et al. Ambiophonic principles for the recording and reproduction of surround sound for music
US8873762B2 (en) System and method for efficient sound production using directional enhancement
Su et al. Inras: Implicit neural representation for audio scenes
Herzog et al. Ambisep: Ambisonic-to-ambisonic reverberant speech separation using transformer networks
Kim et al. A robust framework for sound event localization and detection on real recordings
CN111669697B (en) Coherent sound and environmental sound extraction method and system of multichannel signal
Chun et al. Real-time conversion of stereo audio to 5.1 channel audio for providing realistic sounds
US20240282321A1 (en) Multichannel audio encode and decode using directional metadata
CN104424971B (en) A kind of audio file play method and device
WO2020057050A1 (en) Method for extracting direct sound and background sound, and loudspeaker system and sound reproduction method therefor
CN117376784A (en) Method for expanding mono stereo field, electronic device, and storage medium
CN111711918B (en) Coherent sound and environmental sound extraction method and system of multichannel signal
Hong et al. End-to-end sound field reproduction based on deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant