EP1623411B1 - Fidelity-optimised variable frame length encoding - Google Patents
Fidelity-optimised variable frame length encoding Download PDFInfo
- Publication number
- EP1623411B1 EP1623411B1 EP04820553A EP04820553A EP1623411B1 EP 1623411 B1 EP1623411 B1 EP 1623411B1 EP 04820553 A EP04820553 A EP 04820553A EP 04820553 A EP04820553 A EP 04820553A EP 1623411 B1 EP1623411 B1 EP 1623411B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- encoding
- sub
- frames
- mono
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 claims abstract description 75
- 230000005236 sound signal Effects 0.000 claims abstract description 18
- 230000001419 dependent effect Effects 0.000 claims description 6
- 230000003595 spectral effect Effects 0.000 claims description 5
- 230000005540 biological transmission Effects 0.000 description 16
- 238000010586 diagram Methods 0.000 description 14
- 230000006870 function Effects 0.000 description 14
- 230000008447 perception Effects 0.000 description 7
- 238000013459 approach Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 230000002123 temporal effect Effects 0.000 description 6
- 230000009286 beneficial effect Effects 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 230000008054 signal transmission Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000003750 conditioning effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008450 motivation Effects 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 125000000391 vinyl group Chemical group [H]C([*])=C([H])[H] 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
Definitions
- the present invention relates in general to encoding of audio signals, and in particular to encoding of multi-channel audio signals.
- stereophonic or multi-channel coding of audio signals is to encode the signals of the different channels separately as individual and independent signals.
- Another basic way used in stereo FM radio transmission and which ensures compatibility with legacy mono radio receivers is to transmit a sum and a difference signal of the two involved channels.
- M/ S stereo coding is similar to the described procedure in stereo FM radio, in a sense that it encodes and transmits the sum and difference signals of the channel sub-bands and thereby exploits redundancy between the channel sub-bands.
- the structure and operation of an encoder based on M/S stereo coding is described, e.g. in US patent 5,285,498 by J.D. Johnston .
- Intensity stereo on the other hand is able to make use of stereo irrelevancy. It transmits the joint intensity of the channels (of the different sub-bands) along with some location information indicating how the intensity is distributed among the channels. Intensity stereo does only provide spectral magnitude information of the channels. Phase information is not conveyed. For this reason and since the temporal inter-channel information (more specifically the inter-channel time difference) is of major psycho-acoustical relevancy particularly at lower frequencies, intensity stereo can only be used at high frequencies above e.g. 2 kHz. An intensity stereo coding method is described, e.g. in the European patent 0497413 by R. Veldhuis et al .
- a recently developed stereo coding method is described, e.g. in a conference paper with the title "Binaural cue coding applied to stereo and multi-channel audio compression", 112th AES convention, May 2002, Kunststoff, Germany by C. Faller et al.
- This method is a parametric multi-channel audio coding method.
- the basic principle is that at the encoding side, the input signals from N channels c 1 , c 2 , ... c N are combined to one mono signal m.
- the mono signal is audio encoded using any conventional monophonic audio codec.
- parameters are derived from the channel signals, which describe the multi-channel image.
- the parameters are encoded and transmitted to the decoder, along with the audio bit stream.
- the decoder first decodes the mono signal m' and then regenerates the channel signals c 1 ', c 2 ',..., c N ', based on the parametric description of the multi-channel image.
- the principle of the Binaural Cue Coding (BCC) method is that it transmits the encoded mono signal and so-called BCC parameters.
- the BCC parameters comprise coded inter-channel level differences and inter-channel time differences for sub-bands of the original multi-channel input signal.
- the decoder regenerates the different channel signals by applying sub-band-wise level and phase adjustments of the mono signal based on the BCC parameters.
- M/S or intensity stereo is that stereo information comprising temporal inter-channel information is transmitted at much lower bit rates.
- this technique requires computational demanding time-frequency transforms on each of the channels, both at the encoder and the decoder.
- BCC does not handle the fact that a lot of the stereo information, especially at low frequencies, is diffuse, i.e. it does not come from any specific direction. Diffuse sound fields exist in both channels of a stereo recording but they are to a great extent out of phase with respect to each other. If an algorithm such as BCC is subject to recordings with a great amount of diffuse sound fields the reproduced stereo image will become confused, jumping from left to right as the BCC algorithm can only pan the signal in specific frequency bands to the left or right.
- a possible means to encode the stereo signal and ensure good reproduction of diffuse sound fields is to use an encoding scheme very similar to the technique used in FM stereo radio broadcast, namely to encode the mono (Left+Right) and the difference (Left-Right) signals separately.
- a technique, described in US patent 5,434,948 by C.E. Holt et al. uses a similar technique as in BCC for encoding the mono signal and side information.
- side information consists of predictor filters and optionally a residual signal.
- the predictor filters estimated by a least-mean-square algorithm, when applied to the mono signal allow the prediction of the multi-channel audio signals. With this technique one is able to reach very low bit rate encoding of multi-channel audio sources, however, at the expense of a quality drop, discussed further below.
- This technique synthesises the right and left channel signals by filtering sound source signals with so-called head-related filters.
- this technique requires the different sound source signals to be separated and can thus not generally be applied for stereo or multi-channel coding.
- a further problem with schemes based on encoding of a main and one or several side signals is that they often require relatively large computational resources.
- handling discontinuities in parameters from one frame to another is a complex task.
- estimation errors of transient sound may cause very large side signals, in turn increasing the transmission rate demand.
- An object of the present invention is therefore to provide an encoding method and device improving the perception quality of multi-channel audio signals, in particular to avoid artefacts such as pre-echoing, ghost-like sounds or frame discontinuity artefacts.
- a further object of the present invention is to provide an encoding method and device requiring less processing power and having more constant transmission bit rate requirements.
- polyphonic signals are used to create a main signal, typically a mono signal, and a side signal.
- the main signal is encoded according to prior-art encoding principles.
- a number of encoding schemes for the side signal are provided.
- Each encoding scheme is characterised by a set of sub-frames of different lengths.
- the total length of the sub-frames corresponds to the length of the encoding frame of the encoding scheme.
- the sets of sub-frames comprise at least one sub-frame.
- the encoding scheme to be used on the side signal is selected at least partly dependent on the present signal content of the polyphonic signals.
- the selection takes place, before the encoding, based on signal characteristics analysis.
- the side signal is encoded by each of the encoding schemes, and based on measurements of the quality of the encoding, the best encoding scheme is selected.
- a side residual signal is created as the difference between the side signal and the main signal scaled with a balance factor.
- the balance factor is selected to minimise the side residual signal.
- the optimised side residual signal and the balance factor are encoded and provided as parameters representing the side signal. At the decoder side, the balance factor, the side residual signal and the man signal are used to recover the side signal.
- the encoding of the side signal comprises an energy contour scaling in order to avoid pre-echoing effects.
- different encoding schemes may comprise different encoding procedures in the separate sub-frames.
- the main advantage with the present invention is that the preservation of the perception of the audio signals is improved. Furthermore, the present invention still allows multi-channel signal transmission at very low bit rates.
- FIG. 1 illustrates a typical system 1, in which the present invention advantageously can be utilised.
- a transmitter 10 comprises an antenna 12 including associated hardware and software to be able to transmit radio signals 5 to a receiver 20.
- the transmitter 10 comprises among other parts a multi-channel encoder 14, which transforms signals of a number of input channels 16 into output signals suitable for radio transmission. Examples of suitable multi-channel encoders 14 are described in detail further below.
- the signals of the input channels 16 can be provided from e.g. an audio signal storage 18, such as a data file of digital representation of audio recordings, magnetic tape or vinyl disc recordings of audio etc.
- the signals of the input channels 16 can also be provided in "live", e.g. from a set of microphones 19.
- the audio signals are digitised, if not already in digital form, before entering the multi-channel encoder 14.
- an antenna 22 with associated hardware and software handles the actual reception of radio signals 5 representing polyphonic audio signals.
- typical functionalities such as e.g. error correction, are performed.
- a decoder 24 decodes the received radio signals 5 and transforms the audio data carried thereby into signals of a number of output channels 26.
- the output signals can be provided to e.g. loudspeakers 29 for immediate presentation, or can be stored in an audio signal storage 28 of any kind.
- the system 1 can for instance be a phone conference system, a system for supplying audio services or other audio applications.
- the communication has to be of a duplex type, while e.g. distribution of music from a service provider to a subscriber can be essentially of a one-way type.
- the transmission of signals from the transmitter 10 to the receiver 20 can also be performed by any other means, e.g. by different kinds of electromagnetic waves, cables or fibres as well as combinations thereof.
- Fig. 2a illustrates an embodiment of an encoder according to the present invention.
- the polyphonic signal is a stereo signal comprising two channels a and b, received at input 16A and 16B, respectively.
- the signals of channel a and b are provided to a pre-processing unit 32, where different signal conditioning procedures may be performed.
- the (perhaps modified) signals from the output of the pre-processing unit 32 are summed in an addition unit 34.
- This addition unit 34 also divides the sum by a factor of two.
- the signal x mono produced in this way is a main signal of the stereo signals, since it basically comprises all data from both channels. In this embodiment the main signal thus represents a pure "mono" signal.
- the main signal x mono is provided to a main signal encoder unit 38, which encodes the main signal according to any suitable encoding principles. Such principles are available within prior-art and are thus not further discussed here.
- the main signal encoder unit 38 gives an output signal p mono , being encoding parameters representing a main signal.
- a difference (divided by a factor of two) of the channel signals is provided as a side signal x side .
- the side signal represents the difference between the two channels in the stereo signal.
- the side signal x side is provided to a side signal encoding unit 30. Preferred embodiments of the side signal encoding unit 30 will be discussed further below.
- the side signal x side is transferred into encoding parameters p side representing a side signal x side .
- this encoding takes place utilising also information of the main signal x mono .
- the arrow 42 indicates such a provision, where the original uncoded main signal x mono is utilised.
- the main signal information that is used in the side signal encoding unit 30 can be deduced from the encoding parameters p mono representing the main signal, as indicated by the broken line 44.
- the encoding parameters p mono representing the main signal x mono is a first output signal
- the encoding parameters p side representing the side signal x side is a second output signal.
- these two output signals p mono , p side together representing the full stereo sound, are multiplexed into one transmission signal 52 in a multiplexor unit 40.
- the transmission of the first and second output signals p mono , p side may take place separately.
- a decoder 24 In Fig. 2b, an embodiment of a decoder 24 according to the present invention is illustrated as a block scheme.
- the received signal 54 comprising encoding parameters representing the main and side signal information are provided to a demultiplexor unit 56, which separates a first and second input signal, respectively.
- the first input signal corresponding to encoding parameters p mono of a main signal, is provided to a main signal decoder unit 64.
- the encoding parameters p mono representing the main signal are used to generate an decoded main signal x" mono , being as similar to the main signal x mono (Fig. 2a) of the encoder 14 (Fig. 2a) as possible.
- the second input signal corresponding to a side signal
- the encoding parameters p side representing the side signal are used to recover a decoded side signal x" side .
- the decoding procedure utilises information about the main signal x" mono , as indicated by an arrow.
- the decoded main and side signals x" mono , x" side are provided to an addition unit 70, which provides an output signal that is a representation of the original signal of channel a.
- a difference provided by a subtraction unit 68 provides an output signal that is a representation of the original signal of channel b.
- These channel signals may be post-processed in a post-processor unit 74 according to prior-art signal processing procedures.
- the channel signals a and b are provided at the outputs 26A and 26B of the decoder.
- encoding is typically performed in one frame at a time.
- a frame comprises audio samples within a pre-defined time period.
- a frame SF2 of time duration L is illustrated.
- the audio samples within the unhatched portion are to be encoded together.
- the preceding samples and the subsequent samples are encoded in other frames.
- the division of the samples into frames will in any case introduce some discontinuities at the frame borders. Shifting sounds will give shifting encoding parameters, changing basically at each frame border. This will give rise to perceptible errors.
- One way to compensate somewhat for this is to base the encoding, not only on the samples that are to be encoded, but also on samples in the absolute vicinity of the frame, as indicated by the hatched portions.
- interpolation techniques are sometimes also utilised for reducing perception artefacts caused by frame borders.
- all such procedures require large additional computational resources, and for certain specific encoding techniques, it might also be difficult to provide it with any resources.
- the audio perception will be improved by using a frame length for encoding of the side signal that is dependent on the present signal content. Since the influence of different frame lengths on the audio perception will differ depending on the nature of the sound to be encoded, an improvement can be obtained by letting the nature of the signal itself affect the frame length that is used.
- the encoding of the main signal is not the object of the present invention and is therefore not described in detail. However, the frame lengths used for the main signal may or may not be equal to the frame lengths used for the side signal.
- FIG. 3b One embodiment of a side signal encoder unit 30 according to the present invention is illustrated in Fig. 3b, in which a closed loop decision is utilised.
- a basic encoding frame of length L is used here.
- a number of encoding schemes 81 characterised by a separate set 80 of sub-frames, are created.
- Each set 80 of sub-frames comprises one or more sub-frames of equal or differing lengths.
- the total length of the set 80 of sub-frames is, however, always equal to the basic encoding frame length L.
- the top encoding scheme is characterised by a set of sub-frames comprising only one sub-frame of length L.
- the next set of frames comprises two frames of length L/2.
- the third set comprises two frames of length L/4 followed by a L/2 frame.
- the signal x side provided to the side signal encoder unit 30 is encoded by all encoding schemes 81. In the top encoding scheme, the entire basic encoding frame is encoded in one piece. However, in the other encoding schemes, the signal x side is encoded in each sub-frame separately from each other.
- the result from each encoding scheme is provided to a selector 85.
- a fidelity measurement means 83 determines a fidelity measure for each of the encoded signals.
- the fidelity measure is an objective quality value, preferably a signal-to-noise measure or a weighted signal-to-noise ratio.
- the fidelity measures associated with each encoding scheme are compared and the result controls a switching means 87 to select the encoding parameters representing the side signal from the encoding scheme giving the best fidelity measure as the output signal p side from the side signal encoder unit 30.
- Fig. 3c another embodiment of a side signal encoder unit 30 according to the present invention is illustrated.
- the frame length decision is an open loop decision, based on the statistics of the signal.
- the spectral characteristics of the side signal will be used as a base for deciding which encoding scheme that is going to be used.
- different encoding schemes characterised by different sets of sub-frames are available.
- the selector 85 is placed before the actual encoding.
- the input side signal x side enters the selector 85 and a signal analysing unit 84.
- the result of the analysis becomes the input of a switch 86, in which only one of the encoding schemes 81 are utilised.
- the output from that encoding scheme will also be the output signal p side from the side signal encoder unit 30.
- the advantage with an open loop decision is that only one actual encoding has to be performed.
- the disadvantage is, however, that the analysis of the signal characteristics may be very complicated indeed and it may be difficult to predict possible behaviours in advance to be able to give an appropriate choice in the switch 86.
- a lot of statistical analysis of sound has to be performed and included in the signal analysing unit 84. Any small change in the encoding schemes may turn upside down on the statistical behaviour.
- variable frame length coding for the side signal is that one can select between a fine temporal resolution and coarse frequency resolution on one side and coarse temporal resolution and fine frequency resolution on the other.
- the above embodiments will preserve the stereo image in the best possible manner.
- the method presented in US 5,434,948 uses a filtered version of the mono (main) signal to resemble the side or difference signal.
- the filter parameters are optimised and allowed to vary in time.
- the filter parameters are then transmitted representing an encoding of the side signal.
- a residual side signal is transmitted.
- Such an approach would be possible to use as side signal encoding method within the scope of the present invention.
- This approach has, however, some disadvantages.
- the quantisation of the of the filter coefficients and any residual side signal often require relatively high bit rates for transmission, since the filter order has to be high to provide an accurate side signal estimate.
- the estimation of the filter itself may be problematic, especially in cases of transient rich music.
- Estimation errors will give a modified side signal that is sometimes larger in magnitude than the unmodified signal. This will lead to higher bit rate demands. Moreover, if a new set of filter coefficients are computed every N samples, the filter coefficients need to be interpolated to yield a smooth transition from one set of filter coefficients to another, as discussed above. Interpolation of filter coefficients is a complex task and errors in the interpolation will manifest itself in large side error signals leading to higher bit rates needed for the difference error signal encoder.
- a means to avoid the need for interpolation is to update the filter coefficients on a sample-by-sample basis and rely on backwards-adaptive analysis. For this to work well it is needed that the bit rate of the residual encoder is fairly high. This is therefore not a good alternative for low bit rate stereo coding.
- the encoding of the side signal is based on the idea to reduce the redundancy between the mono and side signal by using a simple balance factor instead of a complex bit rate consuming predictor filter.
- the residual of this operation is then encoded.
- the magnitude of such a residual is relatively small and does not call for very high bit rate need for transfer. This idea is very suitable indeed to combine with the variable frame set approach described earlier, since the computational complexity is low.
- the use of a balance factor combined with the variable frame length approach removes the need for complex interpolation and the associated problems that interpolation may cause. Moreover, the use of a simple balance factor instead of a complex filter gives fewer problems with estimation as possible estimation errors for the balance factor has less impact. The preferred solution will be able to reproduce both panned signals and diffuse sound fields with good quality and with limited bit rate requirements and computational resources.
- Fig. 4 illustrates a preferred embodiment of a stereo encoder according to the present invention.
- This embodiment is very similar to the one shown in Fig. 2a, however, with the details of the side signal encoder unit 30 revealed.
- the encoder 14 of this embodiment does not have any pre-processing unit, and the input signals are provided directly to the addition and subtraction units 34, 36.
- the mono signal x mono is multiplied with a certain balance factor g sm in a multiplier 33.
- a subtraction unit 35 the multiplied mono signal is subtracted from the side signal x side , i.e. essentially the difference between the two channels, to produce a side residual signal.
- the balance factor g sm is determined based on the content of the mono and side signals by the optimiser 37 in order to minimise the side residual signal according to a quality criterion.
- the quality criterion is preferably a least mean square criterion.
- the side residual signal is encoded in a side residual encoder 39 according to any encoder procedures.
- the side residual encoder 39 is a low bit rate transform encoder or a CELP (Codebook Excited Linear Prediction) encoder.
- the encoding parameters p side representing the side signal then comprises the encoding parameters p side residual representing the side residual signal and the optimised balance factor 49.
- the mono signal 42 used for synthesising the side signals is the target signal x mono for the mono encoder 38.
- the local synthesis signal of the mono encoder 38 can also be utilised. In the latter case, the total encoder delay may be increased and the computational complexity for the side signal may increase. On the other hand, the quality may be better as it is then possible to repair coding errors made in the mono encoder.
- x mono n ⁇ ⁇ a n + 1 - ⁇ ⁇ b n
- x side n ⁇ ⁇ a n - 1 - ⁇ ⁇ b n 0 ⁇ ⁇ ⁇ 1.0.
- the balance factor is used to minimise the residual side signal. In the special case where it is minimised in a mean square sense, this is equivalent to minimising the energy of the residual side signal x side residual .
- weighting in the frequency domain it is possible to add weighting in the frequency domain to the computation of the balance factor. This is done by convoluting the x side and x mono signals with the impulse response of a weighting filter. It is then possible to move the estimation error to a frequency range where they are less easy to hear. This is referred to as perceptual weighting.
- Q g (..) is a quantization function that is applied to the balance factor given by the function f ( x mono , x side ).
- the balance factor is transmitted on the transmission channel. In normal left-right panned signals the balance factor is limited to the interval [-1.0 1.0]. If on the other hand the channels are out of phase with regards to one another, the balance factor may extend beyond these limits.
- g Q Q g - 1 ( Q g
- E s is the encoding function (e.g. a transform encoder) of the residual side signal and E m is the encoding function of the mono signal
- E m is the encoding function of the mono signal
- One important benefit from computing the balance factor for each frame is that one avoids the use of interpolation. Instead, normally, as described above, the frame processing is performed with overlapping frames.
- the encoding principle using balance factors operates particularly well in the case of music signals, where fast changes typically are needed to track the stereo image.
- multi-channel coding has become popular.
- One example is 5.1 channel surround sound in DVD movies.
- the channels are there arranged as: front left, front centre, front right, rear left, rear right and subwoofer.
- Fig. 5 an embodiment of an encoder that encodes the three front channels in such an arrangement exploiting interchannel redundancies according to the present invention is shown.
- a centre signal encoder unit 130 is added, which receives the centre signal x centre .
- the mono signal 42 is in this embodiment the encoded and decoded mono signal x" mono , and is multiplied with a certain balance factor g Q in a multiplier 133.
- the multiplied mono signal is subtracted from the centre signal x centre , to produce a centre residual signal.
- the balance factor g Q is determined based on the content of the mono and centre signals by an optimiser 137 in order to minimise the centre residual signal according to the quality criterion.
- the centre residual signal is encoded in a centre residual encoder 139 according to any encoder procedures.
- the centre residual encoder 139 is a low bit rate transform encoder or a CELP encoder.
- the encoding parameters p centre representing the centre signal then comprises the encoding parameters p centre residual representing the centre residual signal and the optimised balance factor 149.
- the centre residual signal and the scaled mono signal are added in an addition unit 235, creating a modified centre signal 142 being compensated for encoding errors.
- the side signal x side i.e. the difference between the left L and right R channels is provided to the side signal encoder unit 30 as in earlier embodiments.
- the optimiser 37 also depends on the modified centre signal 142 provided by the centre signal encoder unit 130. The side residual signal will therefore be created as an optimum linear combination of the mono signal 42, the modified centre signal 142 and the side signal in the subtraction unit 35.
- variable frame length concept described above can be applied on either of the side and centre signals, or on both.
- Fig. 6 illustrates a decoder unit suitable for receiving encoded audio signals from the encoder unit of Fig. 5.
- the received signal 54 is divided into encoding parameters p mono representing the main signal, encoding parameters p centre representing the centre signal and encoding parameters p side representing the side signal.
- the encoding parameters p mono representing the main signal are used to generate a main signal x" mono .
- the encoding parameters p centre representing the centre signal are used to generate a centre signal x" centre , based on main signal x" mono .
- the encoding parameters p side representing the side signal are decoded, generating a side signal x" side , based on main signal x" mono and centre signal x" centre .
- ⁇ , ⁇ and ⁇ are in the remaining section set to 1.0 for simplicity, but they can be set to arbitrary values.
- the ⁇ , ⁇ and ⁇ values can be either constant or dependent of the signal contents in order to emphasise one or two channels in order to achieve an optimal quality.
- x centre is the centre signal and x mono is the mono signal.
- the mono signal comes from the mono target signal but it is possible to use the local synthesis of the mono encoder as well.
- Q g (..) is a quantization function that is applied to the balance factor.
- the balance factor is transmitted on the transmission channel.
- E c is the encoding function (e.g. a transform encoder) of the centre residual signal and E m is the encoding function of the mono signal
- E m is the encoding function of the mono signal
- ⁇ can for instance be equal to 2 for a least square minimisation of the error.
- the g sm and g sc parameters can be quantized jointly or separately.
- FIG. 7a-b diagrams are illustrating such an artefact.
- a signal component having the time development as shown by curve 100.
- the signal component is not present in the audio sample.
- the signal component suddenly appears.
- the signal component is encoded, using a frame length of t2-t1
- the occurrence of the signal component will be "smeared out” over the entire frame, as indicated in curve 101.
- the signal component appears a time ⁇ t before the intended appearance of the signal component, and a "pre-echo" is perceived.
- the pre-echoing artefacts become more accentuated if long encoding frames are used. By using shorter frames, the artefact is somewhat suppressed.
- Another way to deal with the pre-echoing problems described above is to utilise the fact that the mono signal is available at both the encoder and decoder end. This makes it possible to scale the side signal according to the energy contour of the mono signal. In the decoder end, the inverse scaling is performed and thus some of the pre-echo problems may be alleviated.
- the simplest windowing function is a rectangular window, but other window types such as a hamming window may be more desirable.
- x ⁇ side residual n x sideresidual n f ( E c n ) , frame start ⁇ n ⁇ frame end , where f (..) is a monotonic continuous function.
- this energy contour scaling in some sense is alternative to the use of shorter frame lengths, this concept is particularly well suited to be combined with the variable frame length concept, described further above.
- a more flexible set of encoding schemes may be provided.
- the different encoding schemes 81 comprise hatched sub-frames, representing encoding applying the energy contour scaling, and un-hatched sub-frames, representing encoding procedures not applying the energy contour scaling.
- the set of encoding schemes of Fig. 8 comprises schemes that handle e.g. pre-echoing artefacts in different ways. In some schemes, longer sub-frames with pre-echoing minimisation according to the energy contour principle are used. In other schemes, shorter sub-frames without energy contour scaling are utilised. Depending on the signal content, one of the alternatives may be more advantageous. For very severe pre-echoing cases, encoding schemes utilising short sub-frames with energy contour scaling may be necessary.
- the proposed solution can be used in the full frequency band or in one or more distinct sub bands.
- the use of sub-band can be applied either on both the main and side signals, or on one of them separately.
- a preferred embodiment comprises a split of the side signal in several frequency bands. The reason is simply that it is easier to remove the possible redundancy in an isolated frequency band than in the entire frequency band. This is particularly important when encoding music signals with rich spectral content.
- the pre-determined threshold can preferably be 2 kHz, or even more preferably 1 kHz.
- the diffuse sound fields generally have little energy content at high frequencies.
- the natural reason is that sound absorption typically increases with frequency.
- the diffuse sound field components seem to play a less important role for the human auditory system at higher frequencies. Therefore, it is beneficial to employ this solution at low frequencies (below 1 or 2 kHz) and rely on other, even more bit efficient coding schemes at higher frequencies.
- the fact that the scheme is only applied at low frequencies gives a large saving in bit rate as the necessary bit rate with the proposed method is proportional to the required bandwidth.
- the mono encoder can encode the entire frequency band, while the proposed side signal encoding is suggested to be performed only in the lower part of the frequency band, as schematically illustrated by Fig. 9.
- Reference number 301 refers to an encoding scheme according to the present invention of the side signal
- reference number 302 refers to any other encoding scheme of the side signal
- reference number 303 refers to an encoding scheme of the side signal.
- Fig. 10 the main steps of an embodiment of an encoding method according to the present invention are illustrated as a flow diagram.
- the procedure starts in step 200.
- a main signal deduced from the polyphonic signals is encoded.
- encoding schemes are provided, which comprise sub-frames with differing lengths and/or order.
- a side signal deduced in step 214 from the polyphonic signals is encoded by an encoding scheme selected dependent at least partly on the actual signal content of the present polyphonic signals.
- the procedure ends in step 299.
- Fig. 11 the main steps of an embodiment of a decoding method according to the present invention are illustrated as a flow diagram.
- the procedure starts in step 200.
- a received encoded main signal is decoded.
- encoding schemes are provided, which comprise sub-frames with differing lengths and/or order.
- a received side signal is decoded in step 224 by a selected encoding scheme.
- the decoded main and side signals are combined to a polyphonic signal.
- the procedure ends in step 299.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Mobile Radio Communication Systems (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
- Endoscopes (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
Abstract
Description
- The present invention relates in general to encoding of audio signals, and in particular to encoding of multi-channel audio signals.
- There is a high market need to transmit and store audio signals at low bit rate while maintaining high audio quality. Particularly, in cases where transmission resources or storage is limited low bit rate operation is an essential cost factor. This is typically the case, e.g. in streaming and messaging applications in mobile communication systems such as GSM, UMTS, or CDMA.
- Today, there are no standardised codecs available providing high stereophonic audio quality at bit rates that are economically interesting for use in mobile communication systems. What is possible with available codecs is monophonic transmission of the audio signals. To some extent also stereophonic transmission is available. However, bit rate limitations usually require limiting the stereo representation quite drastically.
- The simplest way of stereophonic or multi-channel coding of audio signals is to encode the signals of the different channels separately as individual and independent signals. Another basic way used in stereo FM radio transmission and which ensures compatibility with legacy mono radio receivers is to transmit a sum and a difference signal of the two involved channels.
- State-of-the-art audio codecs, such as MPEG-1/2 Layer III and MPEG-2/4 AAC make use of so-called joint stereo coding. According to this technique, the signals of the different channels are processed jointly, rather than separately and individually. The two most commonly used joint stereo coding techniques are known as "Mid/Side" (M/S) stereo coding and intensity stereo coding, which usually are applied on sub-bands of the stereo or multi-channel signals to be encoded.
- M/ S stereo coding is similar to the described procedure in stereo FM radio, in a sense that it encodes and transmits the sum and difference signals of the channel sub-bands and thereby exploits redundancy between the channel sub-bands. The structure and operation of an encoder based on M/S stereo coding is described, e.g. in
US patent 5,285,498 by J.D. Johnston . - Intensity stereo on the other hand is able to make use of stereo irrelevancy. It transmits the joint intensity of the channels (of the different sub-bands) along with some location information indicating how the intensity is distributed among the channels. Intensity stereo does only provide spectral magnitude information of the channels. Phase information is not conveyed. For this reason and since the temporal inter-channel information (more specifically the inter-channel time difference) is of major psycho-acoustical relevancy particularly at lower frequencies, intensity stereo can only be used at high frequencies above e.g. 2 kHz. An intensity stereo coding method is described, e.g. in the
European patent 0497413 by R. Veldhuis et al . - A recently developed stereo coding method is described, e.g. in a conference paper with the title "Binaural cue coding applied to stereo and multi-channel audio compression", 112th AES convention, May 2002, Munich, Germany by C. Faller et al. This method is a parametric multi-channel audio coding method. The basic principle is that at the encoding side, the input signals from N channels c1, c2, ... cN are combined to one mono signal m. The mono signal is audio encoded using any conventional monophonic audio codec. In parallel, parameters are derived from the channel signals, which describe the multi-channel image. The parameters are encoded and transmitted to the decoder, along with the audio bit stream. The decoder first decodes the mono signal m' and then regenerates the channel signals c1', c2',..., cN', based on the parametric description of the multi-channel image.
- The principle of the Binaural Cue Coding (BCC) method is that it transmits the encoded mono signal and so-called BCC parameters. The BCC parameters comprise coded inter-channel level differences and inter-channel time differences for sub-bands of the original multi-channel input signal. The decoder regenerates the different channel signals by applying sub-band-wise level and phase adjustments of the mono signal based on the BCC parameters. The advantage over e.g. M/S or intensity stereo is that stereo information comprising temporal inter-channel information is transmitted at much lower bit rates. However, this technique requires computational demanding time-frequency transforms on each of the channels, both at the encoder and the decoder.
- Moreover, BCC does not handle the fact that a lot of the stereo information, especially at low frequencies, is diffuse, i.e. it does not come from any specific direction. Diffuse sound fields exist in both channels of a stereo recording but they are to a great extent out of phase with respect to each other. If an algorithm such as BCC is subject to recordings with a great amount of diffuse sound fields the reproduced stereo image will become confused, jumping from left to right as the BCC algorithm can only pan the signal in specific frequency bands to the left or right.
- A possible means to encode the stereo signal and ensure good reproduction of diffuse sound fields is to use an encoding scheme very similar to the technique used in FM stereo radio broadcast, namely to encode the mono (Left+Right) and the difference (Left-Right) signals separately.
- A technique, described in
US patent 5,434,948 by C.E. Holt et al. uses a similar technique as in BCC for encoding the mono signal and side information. In this case, side information consists of predictor filters and optionally a residual signal. The predictor filters, estimated by a least-mean-square algorithm, when applied to the mono signal allow the prediction of the multi-channel audio signals. With this technique one is able to reach very low bit rate encoding of multi-channel audio sources, however, at the expense of a quality drop, discussed further below. - Finally, for completeness, a technique is to be mentioned that is used in 3D audio. This technique synthesises the right and left channel signals by filtering sound source signals with so-called head-related filters. However, this technique requires the different sound source signals to be separated and can thus not generally be applied for stereo or multi-channel coding.
- A problem with existing encoding schemes based on encoding of frames of signals, in particular a main signal and one or more side signals, is that the division of audio information into frames may introduce unattractive perceptual artefacts. Dividing the information into frames of relative long duration generally reduces the average requested bit rate. This may be beneficial e.g. for music containing a large amount of diffuse sound. However, for transient rich music or speech, the fast temporal variations will be smeared out over the frame duration, giving rise to ghost-like sounds or even pre-echoing problems. Encoding short frames will instead give a more accurate representation of the sound, minimising the energy, but requires higher transmission bit rates and higher computational resources. The coding efficiency as such may also decrease with very short frame lengths. The introduction of more frame boundaries may also introduce discontinuities in encoding parameters, which may appear as perceptual artefacts.
- A further problem with schemes based on encoding of a main and one or several side signals is that they often require relatively large computational resources. In particular when short frames are used, handling discontinuities in parameters from one frame to another is a complex task. When long frames are used, estimation errors of transient sound may cause very large side signals, in turn increasing the transmission rate demand.
- An object of the present invention is therefore to provide an encoding method and device improving the perception quality of multi-channel audio signals, in particular to avoid artefacts such as pre-echoing, ghost-like sounds or frame discontinuity artefacts. A further object of the present invention is to provide an encoding method and device requiring less processing power and having more constant transmission bit rate requirements.
- The above objects are achieved by methods and devices according to the enclosed patent claims. In general words, polyphonic signals are used to create a main signal, typically a mono signal, and a side signal. The main signal is encoded according to prior-art encoding principles. A number of encoding schemes for the side signal are provided. Each encoding scheme is characterised by a set of sub-frames of different lengths. The total length of the sub-frames corresponds to the length of the encoding frame of the encoding scheme. The sets of sub-frames comprise at least one sub-frame. The encoding scheme to be used on the side signal is selected at least partly dependent on the present signal content of the polyphonic signals.
- In one embodiment, the selection takes place, before the encoding, based on signal characteristics analysis. In another embodiment, the side signal is encoded by each of the encoding schemes, and based on measurements of the quality of the encoding, the best encoding scheme is selected.
- In a preferred embodiment, a side residual signal is created as the difference between the side signal and the main signal scaled with a balance factor. The balance factor is selected to minimise the side residual signal. The optimised side residual signal and the balance factor are encoded and provided as parameters representing the side signal. At the decoder side, the balance factor, the side residual signal and the man signal are used to recover the side signal.
- In a further preferred embodiment, the encoding of the side signal comprises an energy contour scaling in order to avoid pre-echoing effects. Furthermore, different encoding schemes may comprise different encoding procedures in the separate sub-frames.
- The main advantage with the present invention is that the preservation of the perception of the audio signals is improved. Furthermore, the present invention still allows multi-channel signal transmission at very low bit rates.
- The invention, together with further objects and advantages thereof, may best be understood by making reference to the following description taken together with the accompanying drawings, in which:
- FIG. 1 is a block scheme of a system for transmitting polyphonic signals;
- FIG. 2a is a block diagram of an encoder in a transmitter;
- FIG. 2b is a block diagram of a decoder in a receiver;
- FIG. 3a is a diagram illustrating encoding frames of different lengths;
- FIGS. 3b and 3c are block diagrams of embodiments of side signal encoder units according to the present invention;
- FIG. 4 is a block diagram of an embodiment of an encoder using balance factor encoding of side signal;
- FIG. 5 is a block diagram of an embodiment of an encoder for multi-signal systems;
- FIG. 6 is a block diagram of an embodiment of a decoder suitable for decoding signals from the device of Fig. 5;
- FIG. 7a and b are diagrams illustrating a pre-echo artefact;
- FIG. 8 is a block diagram of an embodiment of a side signal encoder unit according to the present invention, employing different encoding principles in different sub-frames;
- FIG. 9 illustrates the use of different encoding principles in different frequency sub-bands;
- FIG. 10 is a flow diagram of the basic steps of an embodiment of an encoding method according to the present invention; and
- FIG. 11 is a flow diagram of the basic steps of an embodiment of a decoding method according to the present invention.
- Fig. 1 illustrates a
typical system 1, in which the present invention advantageously can be utilised. Atransmitter 10 comprises anantenna 12 including associated hardware and software to be able to transmit radio signals 5 to areceiver 20. Thetransmitter 10 comprises among other parts amulti-channel encoder 14, which transforms signals of a number ofinput channels 16 into output signals suitable for radio transmission. Examples of suitablemulti-channel encoders 14 are described in detail further below. The signals of theinput channels 16 can be provided from e.g. anaudio signal storage 18, such as a data file of digital representation of audio recordings, magnetic tape or vinyl disc recordings of audio etc. The signals of theinput channels 16 can also be provided in "live", e.g. from a set ofmicrophones 19. The audio signals are digitised, if not already in digital form, before entering themulti-channel encoder 14. - At the
receiver 20 side, anantenna 22 with associated hardware and software handles the actual reception of radio signals 5 representing polyphonic audio signals. Here, typical functionalities, such as e.g. error correction, are performed. Adecoder 24 decodes the received radio signals 5 and transforms the audio data carried thereby into signals of a number ofoutput channels 26. The output signals can be provided toe.g. loudspeakers 29 for immediate presentation, or can be stored in anaudio signal storage 28 of any kind. - The
system 1 can for instance be a phone conference system, a system for supplying audio services or other audio applications. In some systems, such as e.g. the phone conference system, the communication has to be of a duplex type, while e.g. distribution of music from a service provider to a subscriber can be essentially of a one-way type. The transmission of signals from thetransmitter 10 to thereceiver 20 can also be performed by any other means, e.g. by different kinds of electromagnetic waves, cables or fibres as well as combinations thereof. - Fig. 2a illustrates an embodiment of an encoder according to the present invention. In this embodiment, the polyphonic signal is a stereo signal comprising two channels a and b, received at
input pre-processing unit 32, where different signal conditioning procedures may be performed. The (perhaps modified) signals from the output of thepre-processing unit 32 are summed in anaddition unit 34. Thisaddition unit 34 also divides the sum by a factor of two. The signal xmono produced in this way is a main signal of the stereo signals, since it basically comprises all data from both channels. In this embodiment the main signal thus represents a pure "mono" signal. The main signal xmono is provided to a mainsignal encoder unit 38, which encodes the main signal according to any suitable encoding principles. Such principles are available within prior-art and are thus not further discussed here. The mainsignal encoder unit 38 gives an output signal pmono, being encoding parameters representing a main signal. - In a
subtraction unit 36, a difference (divided by a factor of two) of the channel signals is provided as a side signal xside. In this embodiment, the side signal represents the difference between the two channels in the stereo signal. The side signal xside is provided to a sidesignal encoding unit 30. Preferred embodiments of the sidesignal encoding unit 30 will be discussed further below. According to a side signal encoding procedure, which will be described more in detail further below, the side signal xside is transferred into encoding parameters pside representing a side signal xside. In certain embodiments, this encoding takes place utilising also information of the main signal xmono. Thearrow 42 indicates such a provision, where the original uncoded main signal xmono is utilised. In further other embodiments, the main signal information that is used in the sidesignal encoding unit 30 can be deduced from the encoding parameters pmono representing the main signal, as indicated by thebroken line 44. - The encoding parameters pmono representing the main signal xmono is a first output signal, and the encoding parameters pside representing the side signal xside is a second output signal. In a typical case, these two output signals pmono, pside, together representing the full stereo sound, are multiplexed into one
transmission signal 52 in amultiplexor unit 40. However, in other embodiments, the transmission of the first and second output signals pmono, pside may take place separately. - In Fig. 2b, an embodiment of a
decoder 24 according to the present invention is illustrated as a block scheme. The receivedsignal 54, comprising encoding parameters representing the main and side signal information are provided to ademultiplexor unit 56, which separates a first and second input signal, respectively. The first input signal, corresponding to encoding parameters pmono of a main signal, is provided to a mainsignal decoder unit 64. In a conventional manner, the encoding parameters pmono representing the main signal are used to generate an decoded main signal x"mono, being as similar to the main signal xmono (Fig. 2a) of the encoder 14 (Fig. 2a) as possible. - Similarly, the second input signal, corresponding to a side signal, is provided to a side
signal decoder unit 60. Here, the encoding parameters pside representing the side signal are used to recover a decoded side signal x"side. In some embodiments, the decoding procedure utilises information about the main signal x"mono, as indicated by an arrow. - The decoded main and side signals x"mono, x"side are provided to an
addition unit 70, which provides an output signal that is a representation of the original signal of channel a. Similarly, a difference provided by asubtraction unit 68 provides an output signal that is a representation of the original signal of channel b. These channel signals may be post-processed in apost-processor unit 74 according to prior-art signal processing procedures. Finally, the channel signals a and b are provided at theoutputs - As mentioned in the summary, encoding is typically performed in one frame at a time. A frame comprises audio samples within a pre-defined time period. In the bottom part of Fig. 3a, a frame SF2 of time duration L is illustrated. The audio samples within the unhatched portion are to be encoded together. The preceding samples and the subsequent samples are encoded in other frames. The division of the samples into frames will in any case introduce some discontinuities at the frame borders. Shifting sounds will give shifting encoding parameters, changing basically at each frame border. This will give rise to perceptible errors. One way to compensate somewhat for this is to base the encoding, not only on the samples that are to be encoded, but also on samples in the absolute vicinity of the frame, as indicated by the hatched portions. In such a way, there will be a softer transfer between the different frames. As an alternative, or complement, interpolation techniques are sometimes also utilised for reducing perception artefacts caused by frame borders. However, all such procedures require large additional computational resources, and for certain specific encoding techniques, it might also be difficult to provide it with any resources.
- In this view, it is beneficial to utilise as long frames as possible, since the number of frame borders will be small. Also the coding efficiency typically becomes high and the necessary transmission bit-rate will typically be minimised. However, long frames give problems with pre-echo artefacts and ghost-like sounds.
- By instead utilising shorter frames, such as SF1 or even SF0, having the durations of L/2 and L/4, respectively, anyone skilled in the art realises that the coding efficiency may be decreased, the transmission bit-rate may have to be higher and the problems with frame border artefacts will increase. However, shorter frames suffer less from e.g. other perception artefacts, such as ghost-like sounds and pre-echoing. In order to be able to minimise the coding error as much as possible, one should use an as short frame length as possible.
- According to the present invention, the audio perception will be improved by using a frame length for encoding of the side signal that is dependent on the present signal content. Since the influence of different frame lengths on the audio perception will differ depending on the nature of the sound to be encoded, an improvement can be obtained by letting the nature of the signal itself affect the frame length that is used. The encoding of the main signal is not the object of the present invention and is therefore not described in detail. However, the frame lengths used for the main signal may or may not be equal to the frame lengths used for the side signal.
- Due to small temporal variations, it may e.g. in some cases be beneficial to encode the side signal with use of relatively long frames. This may be the case with recordings with a great amount of diffuse sound field such as concert recordings. In other cases, such as stereo speech conversation, short frames are probably to prefer. The decision which frame length is to prefer can be performed in two basic ways.
- One embodiment of a side
signal encoder unit 30 according to the present invention is illustrated in Fig. 3b, in which a closed loop decision is utilised. A basic encoding frame of length L is used here. A number ofencoding schemes 81, characterised by aseparate set 80 of sub-frames, are created. Each set 80 of sub-frames comprises one or more sub-frames of equal or differing lengths. The total length of theset 80 of sub-frames is, however, always equal to the basic encoding frame length L. With references to Fig. 3b, the top encoding scheme is characterised by a set of sub-frames comprising only one sub-frame of length L. The next set of frames comprises two frames of length L/2. The third set comprises two frames of length L/4 followed by a L/2 frame. - The signal xside provided to the side
signal encoder unit 30 is encoded by all encodingschemes 81. In the top encoding scheme, the entire basic encoding frame is encoded in one piece. However, in the other encoding schemes, the signal xside is encoded in each sub-frame separately from each other. The result from each encoding scheme is provided to aselector 85. A fidelity measurement means 83 determines a fidelity measure for each of the encoded signals. The fidelity measure is an objective quality value, preferably a signal-to-noise measure or a weighted signal-to-noise ratio. The fidelity measures associated with each encoding scheme are compared and the result controls a switching means 87 to select the encoding parameters representing the side signal from the encoding scheme giving the best fidelity measure as the output signal pside from the sidesignal encoder unit 30. - Preferably, all possible combinations of frame lengths are tested and the set of sub-frames that gives the best objective quality, e.g. signal-to-noise ratio is selected.
- In the present embodiment, the lengths of the sub-frames used are selected according to:
where lsf are the lengths of the sub-frames, lf is the length of the encoding frame and n is an integer. In the present embodiment, n is selected between 0 and 3. However, any frame lengths will be possible to use as long as the total length of the set is kept constant. - In Fig. 3c, another embodiment of a side
signal encoder unit 30 according to the present invention is illustrated. Here, the frame length decision is an open loop decision, based on the statistics of the signal. In other words, the spectral characteristics of the side signal will be used as a base for deciding which encoding scheme that is going to be used. As before, different encoding schemes characterised by different sets of sub-frames are available. However, in this embodiment, theselector 85 is placed before the actual encoding. The input side signal xside enters theselector 85 and a signal analysing unit 84. The result of the analysis becomes the input of aswitch 86, in which only one of theencoding schemes 81 are utilised. The output from that encoding scheme will also be the output signal pside from the sidesignal encoder unit 30. - The advantage with an open loop decision is that only one actual encoding has to be performed. The disadvantage is, however, that the analysis of the signal characteristics may be very complicated indeed and it may be difficult to predict possible behaviours in advance to be able to give an appropriate choice in the
switch 86. A lot of statistical analysis of sound has to be performed and included in the signal analysing unit 84. Any small change in the encoding schemes may turn upside down on the statistical behaviour. - By using closed loop selection (Fig. 3b), encoding schemes may be exchanged without making any changes in the rest of the unit. On the other hand, if many encoding schemes are to be investigated, the computational requirements will be high.
- The benefit with such a variable frame length coding for the side signal is that one can select between a fine temporal resolution and coarse frequency resolution on one side and coarse temporal resolution and fine frequency resolution on the other. The above embodiments will preserve the stereo image in the best possible manner.
- There are also some requirements on the actual encoding utilised in the different encoding schemes. In particular when the closed loop selection is used, the computational resources to perform a number of more or less simultaneous encoding have to be large. The more complicated the encoding process is, the more computational power is needed. Furthermore, a low bit rate at transmission is also to prefer.
- The method presented in
US 5,434,948 , uses a filtered version of the mono (main) signal to resemble the side or difference signal. The filter parameters are optimised and allowed to vary in time. The filter parameters are then transmitted representing an encoding of the side signal. In one embodiment, also a residual side signal is transmitted. In many cases, such an approach would be possible to use as side signal encoding method within the scope of the present invention. This approach has, however, some disadvantages. The quantisation of the of the filter coefficients and any residual side signal often require relatively high bit rates for transmission, since the filter order has to be high to provide an accurate side signal estimate. The estimation of the filter itself may be problematic, especially in cases of transient rich music. Estimation errors will give a modified side signal that is sometimes larger in magnitude than the unmodified signal. This will lead to higher bit rate demands. Moreover, if a new set of filter coefficients are computed every N samples, the filter coefficients need to be interpolated to yield a smooth transition from one set of filter coefficients to another, as discussed above. Interpolation of filter coefficients is a complex task and errors in the interpolation will manifest itself in large side error signals leading to higher bit rates needed for the difference error signal encoder. - A means to avoid the need for interpolation is to update the filter coefficients on a sample-by-sample basis and rely on backwards-adaptive analysis. For this to work well it is needed that the bit rate of the residual encoder is fairly high. This is therefore not a good alternative for low bit rate stereo coding.
- There exist cases, e.g. quite common with music, where the mono and the difference signals are almost un-correlated. The filter estimation then becomes very troublesome with the added risk of just making things worse for the difference error signal encoder.
- The solution according to
US 5,434,948 can work pretty well in cases where the filter coefficients vary very slowly in time, e.g. conference telephony systems. In the case of music signals, this approach does not work very well as the filters need to change very fast to track the stereo image. This means that sub-frame lengths of very differing magnitude has to be utilised, which means that the number of combinations to test increases rapidly. This in turn means that the requirements for computing all possible encoding schemes becomes impracticably high. - Therefore, in a preferred embodiment, the encoding of the side signal is based on the idea to reduce the redundancy between the mono and side signal by using a simple balance factor instead of a complex bit rate consuming predictor filter. The residual of this operation is then encoded. The magnitude of such a residual is relatively small and does not call for very high bit rate need for transfer. This idea is very suitable indeed to combine with the variable frame set approach described earlier, since the computational complexity is low.
- The use of a balance factor combined with the variable frame length approach removes the need for complex interpolation and the associated problems that interpolation may cause. Moreover, the use of a simple balance factor instead of a complex filter gives fewer problems with estimation as possible estimation errors for the balance factor has less impact. The preferred solution will be able to reproduce both panned signals and diffuse sound fields with good quality and with limited bit rate requirements and computational resources.
- Fig. 4 illustrates a preferred embodiment of a stereo encoder according to the present invention. This embodiment is very similar to the one shown in Fig. 2a, however, with the details of the side
signal encoder unit 30 revealed. Theencoder 14 of this embodiment does not have any pre-processing unit, and the input signals are provided directly to the addition andsubtraction units multiplier 33. In asubtraction unit 35, the multiplied mono signal is subtracted from the side signal xside, i.e. essentially the difference between the two channels, to produce a side residual signal. The balance factor gsm is determined based on the content of the mono and side signals by theoptimiser 37 in order to minimise the side residual signal according to a quality criterion. The quality criterion is preferably a least mean square criterion. The side residual signal is encoded in a sideresidual encoder 39 according to any encoder procedures. Preferably, the sideresidual encoder 39 is a low bit rate transform encoder or a CELP (Codebook Excited Linear Prediction) encoder. The encoding parameters pside representing the side signal then comprises the encoding parameters pside residual representing the side residual signal and the optimisedbalance factor 49. - In the embodiment of Fig. 4, the
mono signal 42 used for synthesising the side signals is the target signal xmono for themono encoder 38. As mentioned above (in connection with Fig. 2a), the local synthesis signal of themono encoder 38 can also be utilised. In the latter case, the total encoder delay may be increased and the computational complexity for the side signal may increase. On the other hand, the quality may be better as it is then possible to repair coding errors made in the mono encoder. - In a more mathematical way, the basic encoding scheme can be described as follows. Denote the two channel signals as a and b, which may be the left and right channel of a stereo pair. The channel signals are combined into a mono signal by addition and to a side signal by a subtraction. In equation form, the operations are described as:
-
- On blocks of the input signals, a modified or residual side signal is computed according to:
where f(xmono, xside) is a balance factor function that based on the block on N samples, i.e. a sub-frame, from the side and mono signals strive to remove as much as possible from the side signal. In other words, the balance factor is used to minimise the residual side signal. In the special case where it is minimised in a mean square sense, this is equivalent to minimising the energy of the residual side signal xside residual. -
- It is possible to add weighting in the frequency domain to the computation of the balance factor. This is done by convoluting the xside and xmono signals with the impulse response of a weighting filter. It is then possible to move the estimation error to a frequency range where they are less easy to hear. This is referred to as perceptual weighting.
-
- Qg (..) is a quantization function that is applied to the balance factor given by the function f(xmono , xside ). The balance factor is transmitted on the transmission channel. In normal left-right panned signals the balance factor is limited to the interval [-1.0 1.0]. If on the other hand the channels are out of phase with regards to one another, the balance factor may extend beyond these limits.
-
- These situations occur quite frequently with e.g. classical music or studio music with a great amount of diffuse sounds, where in some cases the a and b channels might almost cancel out one another on occasions when a mono signal is created. The effect on the balance factor is that is can jump rapidly, causing a confused stereo image. The fix above alleviates this problem.
- The filter-based approach in
US 5,434,948 has the similar problems, but in that case the solution is not so simple. -
- One important benefit from computing the balance factor for each frame is that one avoids the use of interpolation. Instead, normally, as described above, the frame processing is performed with overlapping frames.
- The encoding principle using balance factors operates particularly well in the case of music signals, where fast changes typically are needed to track the stereo image.
- Lately, multi-channel coding has become popular. One example is 5.1 channel surround sound in DVD movies. The channels are there arranged as: front left, front centre, front right, rear left, rear right and subwoofer. In Fig. 5, an embodiment of an encoder that encodes the three front channels in such an arrangement exploiting interchannel redundancies according to the present invention is shown.
- Three channel signals L, C, R are provided on three
inputs 16A-C, and the mono signal xmono is created by a sum of all three signals. A centresignal encoder unit 130 is added, which receives the centre signal xcentre. Themono signal 42 is in this embodiment the encoded and decoded mono signal x"mono, and is multiplied with a certain balance factor gQ in amultiplier 133. In asubtraction unit 135, the multiplied mono signal is subtracted from the centre signal xcentre, to produce a centre residual signal. The balance factor gQ is determined based on the content of the mono and centre signals by anoptimiser 137 in order to minimise the centre residual signal according to the quality criterion. The centre residual signal is encoded in a centreresidual encoder 139 according to any encoder procedures. Preferably, the centreresidual encoder 139 is a low bit rate transform encoder or a CELP encoder. The encoding parameters pcentre representing the centre signal then comprises the encoding parameters pcentre residual representing the centre residual signal and the optimisedbalance factor 149. The centre residual signal and the scaled mono signal are added in anaddition unit 235, creating a modifiedcentre signal 142 being compensated for encoding errors. - The side signal xside, i.e. the difference between the left L and right R channels is provided to the side
signal encoder unit 30 as in earlier embodiments. However, here, theoptimiser 37 also depends on the modifiedcentre signal 142 provided by the centresignal encoder unit 130. The side residual signal will therefore be created as an optimum linear combination of themono signal 42, the modifiedcentre signal 142 and the side signal in thesubtraction unit 35. - The variable frame length concept described above can be applied on either of the side and centre signals, or on both.
- Fig. 6 illustrates a decoder unit suitable for receiving encoded audio signals from the encoder unit of Fig. 5. The received
signal 54 is divided into encoding parameters pmono representing the main signal, encoding parameters pcentre representing the centre signal and encoding parameters pside representing the side signal. In thedecoder 64, the encoding parameters pmono representing the main signal are used to generate a main signal x"mono. In thedecoder 160, the encoding parameters pcentre representing the centre signal are used to generate a centre signal x"centre, based on main signal x"mono. In thedecoder 60, the encoding parameters pside representing the side signal are decoded, generating a side signal x"side, based on main signal x"mono and centre signal x"centre. - The procedure can be mathematically expressed as follows:
-
- α, β and χ are in the remaining section set to 1.0 for simplicity, but they can be set to arbitrary values. The α, β and χ values can be either constant or dependent of the signal contents in order to emphasise one or two channels in order to achieve an optimal quality.
-
- xcentre is the centre signal and x mono is the mono signal. The mono signal comes from the mono target signal but it is possible to use the local synthesis of the mono encoder as well.
-
- Qg (..) is a quantization function that is applied to the balance factor. The balance factor is transmitted on the transmission channel.
-
-
- η can for instance be equal to 2 for a least square minimisation of the error. The gsm and gsc parameters can be quantized jointly or separately.
-
- One of the perception artefacts that are most annoying is the pre-echo effect. In Fig. 7a-b, diagrams are illustrating such an artefact. Assume a signal component having the time development as shown by
curve 100. In the beginning, starting from t0, the signal component is not present in the audio sample. At a time t between t1 and t2, the signal component suddenly appears. When the signal component is encoded, using a frame length of t2-t1, the occurrence of the signal component will be "smeared out" over the entire frame, as indicated incurve 101. If a decoding takes place of thecurve 101, the signal component appears a time Δt before the intended appearance of the signal component, and a "pre-echo" is perceived. - The pre-echoing artefacts become more accentuated if long encoding frames are used. By using shorter frames, the artefact is somewhat suppressed. Another way to deal with the pre-echoing problems described above is to utilise the fact that the mono signal is available at both the encoder and decoder end. This makes it possible to scale the side signal according to the energy contour of the mono signal. In the decoder end, the inverse scaling is performed and thus some of the pre-echo problems may be alleviated.
-
-
-
- Since this energy contour scaling in some sense is alternative to the use of shorter frame lengths, this concept is particularly well suited to be combined with the variable frame length concept, described further above. By having some encoding schemes that applies energy contour scaling, some that do not and some that applies energy contour scaling only during certain sub-frames, a more flexible set of encoding schemes may be provided. In Fig. 8, an embodiment of a
signal encoder unit 30 according to the present invention is illustrated. Here, thedifferent encoding schemes 81 comprise hatched sub-frames, representing encoding applying the energy contour scaling, and un-hatched sub-frames, representing encoding procedures not applying the energy contour scaling. In this manner, combinations not only of sub-frames of differing lengths, but sub-frames also of differing encoding principles are available. In the present explanatory example, the application of energy contour scaling differs between different encoding schemes. In a more general case, any encoding principles can be combined with the variable length concept in an analogous manner. - The set of encoding schemes of Fig. 8 comprises schemes that handle e.g. pre-echoing artefacts in different ways. In some schemes, longer sub-frames with pre-echoing minimisation according to the energy contour principle are used. In other schemes, shorter sub-frames without energy contour scaling are utilised. Depending on the signal content, one of the alternatives may be more advantageous. For very severe pre-echoing cases, encoding schemes utilising short sub-frames with energy contour scaling may be necessary.
- The proposed solution can be used in the full frequency band or in one or more distinct sub bands. The use of sub-band can be applied either on both the main and side signals, or on one of them separately. A preferred embodiment comprises a split of the side signal in several frequency bands. The reason is simply that it is easier to remove the possible redundancy in an isolated frequency band than in the entire frequency band. This is particularly important when encoding music signals with rich spectral content.
- One possible use is to encode the frequency band below a pre-determined threshold with the above method. The pre-determined threshold can preferably be 2 kHz, or even more preferably 1 kHz. For the remaining part of the frequency range of interest, one can either encode another additional frequency band with the above method, or use a completely different method.
- One motivation to use the above method preferably for low frequencies is that the diffuse sound fields generally have little energy content at high frequencies. The natural reason is that sound absorption typically increases with frequency. Also, the diffuse sound field components seem to play a less important role for the human auditory system at higher frequencies. Therefore, it is beneficial to employ this solution at low frequencies (below 1 or 2 kHz) and rely on other, even more bit efficient coding schemes at higher frequencies. The fact that the scheme is only applied at low frequencies gives a large saving in bit rate as the necessary bit rate with the proposed method is proportional to the required bandwidth. In most cases, the mono encoder can encode the entire frequency band, while the proposed side signal encoding is suggested to be performed only in the lower part of the frequency band, as schematically illustrated by Fig. 9.
Reference number 301 refers to an encoding scheme according to the present invention of the side signal,reference number 302 refers to any other encoding scheme of the side signal andreference number 303 refers to an encoding scheme of the side signal. - There also exist the possibility to use the proposed method for several distinct frequency bands.
- In Fig. 10, the main steps of an embodiment of an encoding method according to the present invention are illustrated as a flow diagram. The procedure starts in
step 200. Instep 210, a main signal deduced from the polyphonic signals is encoded. Instep 212, encoding schemes are provided, which comprise sub-frames with differing lengths and/or order. A side signal deduced instep 214 from the polyphonic signals is encoded by an encoding scheme selected dependent at least partly on the actual signal content of the present polyphonic signals. The procedure ends instep 299. - In Fig. 11, the main steps of an embodiment of a decoding method according to the present invention are illustrated as a flow diagram. The procedure starts in
step 200. Instep 220, a received encoded main signal is decoded. Instep 222, encoding schemes are provided, which comprise sub-frames with differing lengths and/or order. A received side signal is decoded instep 224 by a selected encoding scheme. Instep 226, the decoded main and side signals are combined to a polyphonic signal. The procedure ends instep 299. - The embodiments described above are to be understood as a few illustrative examples of the present invention. It will be understood by those skilled in the art that various modifications, combinations and changes may be made to the embodiments without departing from the scope of the present invention. In particular, different part solutions in the different embodiments can be combined in other configurations, where technically possible. The scope of the present invention is, however, defined by the appended claims.
-
-
European patent 0497413 -
US patent 5,285,498 -
US patent 5,434,948 - "Binaural cue coding applied to stereo and multi-channel audio compression", 112th AES convention, May 2002, Munich, Germany by C. Faller et al.
Claims (26)
- A method of encoding multi-channel audio signals, comprising the steps of:generating (210) a first output signal (pmono) being encoding parameters representing a main signal (xmono);said main signal (xmono) being a first linear combination of signals of at least a first and a second channel (a, b; L, R); andgenerating (214) a second output signal (pside) being encoding parameters representing a side signal (xside);said side (xside) signal being a second linear combination of signals of at least the first and the second channel (a, b; L, R) within an encoding frame (80),characterised by the further step of:providing (212) at least two encoding schemes (81), each of the at least two encoding schemes being characterised by a respective set of sub-frames (90), each set of sub-frames constituting the encoding frame (80);sub-frames (90) of said sets of sub-frames having different lengths;the sum of the lengths of the sub-frames (90) in each encoding scheme (81) being equal to the length of the encoding frame (80);each set of sub-frames (90) comprising at least one sub-frame (90);whereby the step of generating (214) the second output signal (pside) comprises the step of selecting an encoding scheme (81) at least to a part dependent of a present signal content of the side signal (xside);the second output signal (pside) being encoded in each of the sub-frames (90) of the selected set of sub-frames (90) separately.
- A method according to claim 1, characterised in that the step of generating (214) the second output signal (pside) in turn comprises the steps of:generating encoding parameters representing the side signal (xside) within all sub-frames (90) of each of the at least two sets of sub-frames (90) separately;calculating a total fidelity measure for each of the at least two encoding schemes (81); andselecting the encoded signal from the encoding scheme (81) having the best fidelity measure as the encoding parameters (pside) representing the side signal.
- A method according to claim 2, characterised in that the fidelity measure is based on a signal-to-noise measure.
- A method according to claim 4, characterised in that n is smaller than a predetermined value.
- A method according to claim 5, characterised in that the at least two encoding schemes (81) comprise all permutations of sub-frame (90) lengths.
- A method according to any of the claims 1 to 6, characterised in that the step of generating (210) the first output signal (pmono) in turn comprises the steps of:creating the main signal (xmono); andencoding the main signal (xmono) into encoding parameters (pmono) representing the main signal,the step of generating (214) said second output signal in turn comprising the steps of:creating a side residual signal (xside residual) as a difference between the side signal (xside) and the main signal (xmono) scaled by a balance factor (gsm);the balance factor (gsm) being determined as the factor minimising the side residual signal according to a quality criterion;encoding the side residual signal and the balance factor (gsm) into the encoding parameters (pside) representing the side signal.
- A method according to claim 7, characterised in that the quality criterion is based on a least-mean-square measure.
- A method according to any of the claims 1 to 8, characterised in that the step of encoding the side signal further comprises the step of:scaling the side signal (xside) to an energy contour of the main signal (xmono).
- A method according to claim 9, characterised in that the scaling of the side signal (xside) is a division by a factor being a monotonic continuous function of the energy contour of the main signal (xmono).
- A method according to claim 10, characterised in that the monotonic continuous function is a square root function.
- A method according to claim 12, characterised in that the windowing function is a rectangular windowing function.
- A method according to claim 12, characterised in that the windowing function is a hamming window function.
- A method according to any of the claims 1 to 14, characterised in that the at least two encoding schemes (81) comprise different encoding principles of the side signal (xside).
- A method according to claim 15, characterised in that at least a first encoding scheme of the at least two encoding schemes (81) comprises a first encoding principle for the side signal (xside) for all sub-frames (90) and at least a second encoding scheme of the at least two encoding schemes (81) comprises a second encoding principle for the side signal (xside) for all sub-frames (90).
- A method according to claim 15 or 16, characterised in that at least one encoding scheme of the at least two encoding schemes (81) comprises the first encoding principle for the side signal (xside) for one sub-frame and the second encoding principle for the side signal (xside) for another sub-frame.
- A method according to claim 1, characterised in that the step of generating (214) the second output signal (pside) in turn comprises the steps of:analysing spectral characteristics of the side signal (xside) ;selecting a set of sub-frames (90) based on the analysed spectral characteristics; andencoding the side signal (xside) within all sub-frames (90) of the selected set of sub-frames (90) separately.
- A method according to any of the claims 1 to 18, characterised in that the step of generating (214) a second output signal (pside) is applied in a limited frequency band.
- A method according to claim 19, characterised in that the step of generating (214) a second output signal (pside) is applied only for frequencies below 2 kHz.
- A method according to claim 20, characterised in that the step of generating (214) a second output signal (pside) is applied only for frequencies below 1 kHz.
- A method according to any of the claims 1 to 21, characterised in that the multi-channel audio signals represent music signals.
- A method of decoding multi-channel audio signals, comprising the steps of:decoding (220) encoding parameters (pmono) representing a main signal (xmono) into a decoded main signal (x"mono);said main signal (xmono) being a first linear combination of signals of at least a first and a second channel (a, b; L, R);decoding (224) encoding parameters (pside) representing a side signal (xside) into a decoded side signal (x"side);said side signal (xside) being a second linear combination of signals of at least a first and a second channel (a, b; L, R), within an encoding frame (80); andcombining (226) at least the decoded main signal (x"mono) and the decoded side signal (x"side) into signals of at least said first and said second channel (a, b; L, R),characterised by the step of:providing (222) at least two encoding schemes (81), each of the at least two encoding schemes (81) being characterised by a respective set of sub-frames (90), each set of sub-frames constituting the encoding frame (80);sub-frames (90) of said sets of sub-frames having different lengths;the sum of the lengths of the sub-frames (90) in each encoding scheme (81) being equal to the length of the encoding frame (80);each set of sub-frames (90) comprising at least one sub-frame (90),whereby the step of decoding (224) the encoding parameters (pside) representing the side signal in turn comprises the step of decoding the encoding parameters (pside) representing the side signal separately in the sub-frames (90) of one of the at least two encoding schemes (81).
- Encoder apparatus (14), comprising:input means (16; 16A-C) for multi-channel audio signals (a, b; L, R, C) comprising at least a first and a second channel (a, b; L, R),means (38) for generating a first output signal (pmono) being encoding parameters representing a main signal (xmono);said main signal (xmono) being a first linear combination of signals of at least the first and the second channel (a, b; L, R);means (30) for generating a second output signal (pside) being encoding parameters representing a side signal (xside);said side signal (xside) being a second linear combination of signals of at least the first and the second channel (a, b; L, R), within an encoding frame (80); andoutput means (52);characterised bymeans for providing at least two encoding schemes (81), each of the at least two encoding schemes (81) being characterised by a respective set of sub-frames (90), each set of sub-frames constituting the encoding frame (80);sub-frames (90) of said sets of sub-frames having different lengths;the sum of the lengths of the sub-frames (90) in each encoding scheme (81) being equal to the length of the encoding frame (80);each set of sub-frames (90) comprising at least one sub-frame (90);whereby the means (30) for generating the second output signal (pside) in turn comprises means (86; 87) for selecting an encoding scheme at least to a part dependent of a present signal content of the side signal (xside);means for encoding the side signal (xside) in each of the sub-frames (90) of the selected encoding scheme separately.
- Decoder apparatus (24), comprising:input means (54) for encoding parameters (pmono) representing a main signal and encoding parameters (pside) representing a side signal;said main signal (xmono) being a first linear combination of a first and a second channel (a, b; L, R);said side signal (xside) being a second linear combination of a first and a second channel (a, b; L, R);means (64) for decoding the encoding parameters (pmono) representing the main signal into a decoded main signal (x"mono);means (60) for decoding the encoding parameters (pside) representing the side signal within an encoding frame (80) into a decoded side signal (x"side);means (68, 70) for combining at least the decoded main signal (x"mono) and the decoded side signal (x"side) into signals of at least a first and a second channel (a, b; L, R); andoutput means (26; 26A-C),characterised in that the means (60) for decoding the encoding parameters (pside) representing the side signal in turn comprises:means for providing at least two encoding schemes (81), each of the at least two encoding schemes (81) being characterised by a respective set of sub-frames (90), each set of sub-frames constituting the encoding frame (80);sub-frames (90) of said sets of sub-frames having different lengths;the sum of the lengths of the sub-frames (90) in each encoding scheme being equal to the length of the encoding frame (80);each set of sub-frames (90) comprising at least one sub-frame (90); andsaid means for decoding the encoding parameters (pside) representing the side signal being arranged for decoding the encoding parameters (pside) representing the side signal separately in the sub-frames (90) of one of the at least two encoding schemes (81).
- Audio system (1) comprising at least one of:an encoder apparatus (14) according to claim 24, anda decoder apparatus (24) according to claim 25.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP07109801A EP1845519B1 (en) | 2003-12-19 | 2004-12-15 | Encoding and decoding of multi-channel audio signals based on a main and side signal representation |
PL04820553T PL1623411T3 (en) | 2003-12-19 | 2004-12-15 | Fidelity-optimised variable frame length encoding |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE0303501A SE0303501D0 (en) | 2003-12-19 | 2003-12-19 | Filter-based parametric multi-channel coding |
SE0400417A SE527670C2 (en) | 2003-12-19 | 2004-02-20 | Natural fidelity optimized coding with variable frame length |
PCT/SE2004/001867 WO2005059899A1 (en) | 2003-12-19 | 2004-12-15 | Fidelity-optimised variable frame length encoding |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07109801A Division EP1845519B1 (en) | 2003-12-19 | 2004-12-15 | Encoding and decoding of multi-channel audio signals based on a main and side signal representation |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1623411A1 EP1623411A1 (en) | 2006-02-08 |
EP1623411B1 true EP1623411B1 (en) | 2007-08-29 |
Family
ID=31996354
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP04820553A Ceased EP1623411B1 (en) | 2003-12-19 | 2004-12-15 | Fidelity-optimised variable frame length encoding |
EP07109801A Active EP1845519B1 (en) | 2003-12-19 | 2004-12-15 | Encoding and decoding of multi-channel audio signals based on a main and side signal representation |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07109801A Active EP1845519B1 (en) | 2003-12-19 | 2004-12-15 | Encoding and decoding of multi-channel audio signals based on a main and side signal representation |
Country Status (15)
Country | Link |
---|---|
EP (2) | EP1623411B1 (en) |
JP (2) | JP4335917B2 (en) |
CN (2) | CN100559465C (en) |
AT (2) | ATE371924T1 (en) |
AU (1) | AU2004298708B2 (en) |
BR (2) | BRPI0410856B8 (en) |
CA (2) | CA2690885C (en) |
DE (2) | DE602004008613T2 (en) |
HK (2) | HK1091585A1 (en) |
MX (1) | MXPA05012230A (en) |
PL (1) | PL1623411T3 (en) |
RU (2) | RU2305870C2 (en) |
SE (1) | SE527670C2 (en) |
WO (1) | WO2005059899A1 (en) |
ZA (1) | ZA200508980B (en) |
Families Citing this family (52)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR0305434A (en) * | 2002-07-12 | 2004-09-28 | Koninkl Philips Electronics Nv | Methods and arrangements for encoding and decoding a multichannel audio signal, apparatus for providing an encoded audio signal and a decoded audio signal, encoded multichannel audio signal, and storage medium |
WO2006126856A2 (en) | 2005-05-26 | 2006-11-30 | Lg Electronics Inc. | Method of encoding and decoding an audio signal |
JP4639966B2 (en) * | 2005-05-31 | 2011-02-23 | ヤマハ株式会社 | Audio data compression method, audio data compression circuit, and audio data expansion circuit |
EP1913578B1 (en) | 2005-06-30 | 2012-08-01 | LG Electronics Inc. | Method and apparatus for decoding an audio signal |
US8082157B2 (en) | 2005-06-30 | 2011-12-20 | Lg Electronics Inc. | Apparatus for encoding and decoding audio signal and method thereof |
AU2006266655B2 (en) | 2005-06-30 | 2009-08-20 | Lg Electronics Inc. | Apparatus for encoding and decoding audio signal and method thereof |
US8050915B2 (en) * | 2005-07-11 | 2011-11-01 | Lg Electronics Inc. | Apparatus and method of encoding and decoding audio signals using hierarchical block switching and linear prediction coding |
US7788107B2 (en) | 2005-08-30 | 2010-08-31 | Lg Electronics Inc. | Method for decoding an audio signal |
KR101169280B1 (en) | 2005-08-30 | 2012-08-02 | 엘지전자 주식회사 | Method and apparatus for decoding an audio signal |
EP1920635B1 (en) | 2005-08-30 | 2010-01-13 | LG Electronics Inc. | Apparatus and method for decoding an audio signal |
JP4859925B2 (en) | 2005-08-30 | 2012-01-25 | エルジー エレクトロニクス インコーポレイティド | Audio signal decoding method and apparatus |
US7646319B2 (en) | 2005-10-05 | 2010-01-12 | Lg Electronics Inc. | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
US8068569B2 (en) | 2005-10-05 | 2011-11-29 | Lg Electronics, Inc. | Method and apparatus for signal processing and encoding and decoding |
US7696907B2 (en) | 2005-10-05 | 2010-04-13 | Lg Electronics Inc. | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
US7751485B2 (en) | 2005-10-05 | 2010-07-06 | Lg Electronics Inc. | Signal processing using pilot based coding |
US7672379B2 (en) | 2005-10-05 | 2010-03-02 | Lg Electronics Inc. | Audio signal processing, encoding, and decoding |
WO2007040353A1 (en) | 2005-10-05 | 2007-04-12 | Lg Electronics Inc. | Method and apparatus for signal processing |
KR100857114B1 (en) | 2005-10-05 | 2008-09-08 | 엘지전자 주식회사 | Method and apparatus for signal processing and encoding and decoding method, and apparatus therefor |
US7761289B2 (en) | 2005-10-24 | 2010-07-20 | Lg Electronics Inc. | Removing time delays in signal paths |
WO2007080211A1 (en) * | 2006-01-09 | 2007-07-19 | Nokia Corporation | Decoding of binaural audio signals |
CN101366082B (en) * | 2006-02-06 | 2012-10-03 | 艾利森电话股份有限公司 | Variable frame shifting code method, codec and wireless communication device |
US7461106B2 (en) | 2006-09-12 | 2008-12-02 | Motorola, Inc. | Apparatus and method for low complexity combinatorial coding of signals |
US8576096B2 (en) | 2007-10-11 | 2013-11-05 | Motorola Mobility Llc | Apparatus and method for low complexity combinatorial coding of signals |
US8209190B2 (en) | 2007-10-25 | 2012-06-26 | Motorola Mobility, Inc. | Method and apparatus for generating an enhancement layer within an audio coding system |
US7889103B2 (en) | 2008-03-13 | 2011-02-15 | Motorola Mobility, Inc. | Method and apparatus for low complexity combinatorial coding of signals |
US8639519B2 (en) | 2008-04-09 | 2014-01-28 | Motorola Mobility Llc | Method and apparatus for selective signal coding based on core encoder performance |
EP2124486A1 (en) * | 2008-05-13 | 2009-11-25 | Clemens Par | Angle-dependent operating device or method for generating a pseudo-stereophonic audio signal |
EP2283483B1 (en) | 2008-05-23 | 2013-03-13 | Koninklijke Philips Electronics N.V. | A parametric stereo upmix apparatus, a parametric stereo decoder, a parametric stereo downmix apparatus, a parametric stereo encoder |
WO2010016270A1 (en) * | 2008-08-08 | 2010-02-11 | パナソニック株式会社 | Quantizing device, encoding device, quantizing method, and encoding method |
CN102160114B (en) * | 2008-09-17 | 2012-08-29 | 法国电信公司 | Method and device of pre-echo attenuation in a digital audio signal |
JP5309944B2 (en) | 2008-12-11 | 2013-10-09 | 富士通株式会社 | Audio decoding apparatus, method, and program |
US8219408B2 (en) | 2008-12-29 | 2012-07-10 | Motorola Mobility, Inc. | Audio signal decoder and method for producing a scaled reconstructed audio signal |
US8140342B2 (en) | 2008-12-29 | 2012-03-20 | Motorola Mobility, Inc. | Selective scaling mask computation based on peak detection |
US8175888B2 (en) | 2008-12-29 | 2012-05-08 | Motorola Mobility, Inc. | Enhanced layered gain factor balancing within a multiple-channel audio coding system |
US8200496B2 (en) | 2008-12-29 | 2012-06-12 | Motorola Mobility, Inc. | Audio signal decoder and method for producing a scaled reconstructed audio signal |
EP2461321B1 (en) | 2009-07-31 | 2018-05-16 | Panasonic Intellectual Property Management Co., Ltd. | Coding device and decoding device |
US8977546B2 (en) * | 2009-10-20 | 2015-03-10 | Panasonic Intellectual Property Corporation Of America | Encoding device, decoding device and method for both |
EP2346028A1 (en) * | 2009-12-17 | 2011-07-20 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | An apparatus and a method for converting a first parametric spatial audio signal into a second parametric spatial audio signal |
EP2517201B1 (en) * | 2009-12-23 | 2015-11-04 | Nokia Technologies Oy | Sparse audio processing |
US8442837B2 (en) | 2009-12-31 | 2013-05-14 | Motorola Mobility Llc | Embedded speech and audio coding using a switchable model core |
US8423355B2 (en) | 2010-03-05 | 2013-04-16 | Motorola Mobility Llc | Encoder for audio signal including generic audio and speech frames |
US8428936B2 (en) | 2010-03-05 | 2013-04-23 | Motorola Mobility Llc | Decoder for audio signal including generic audio and speech frames |
EP2544466A1 (en) | 2011-07-05 | 2013-01-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Method and apparatus for decomposing a stereo recording using frequency-domain processing employing a spectral subtractor |
US9129600B2 (en) | 2012-09-26 | 2015-09-08 | Google Technology Holdings LLC | Method and apparatus for encoding an audio signal |
CA3210225A1 (en) * | 2012-11-15 | 2014-05-22 | Ntt Docomo, Inc. | Audio coding device, audio coding method, audio coding program, audio decoding device, audio decoding method, and audio decoding program |
US10060955B2 (en) * | 2014-06-25 | 2018-08-28 | Advanced Micro Devices, Inc. | Calibrating power supply voltages using reference measurements from code loop executions |
US12125492B2 (en) | 2015-09-25 | 2024-10-22 | Voiceage Coproration | Method and system for decoding left and right channels of a stereo sound signal |
JP6887995B2 (en) | 2015-09-25 | 2021-06-16 | ヴォイスエイジ・コーポレーション | Methods and systems for encoding stereo audio signals that use the coding parameters of the primary channel to encode the secondary channel |
CN107742521B (en) | 2016-08-10 | 2021-08-13 | 华为技术有限公司 | Coding method and coder for multi-channel signal |
CN109215668B (en) * | 2017-06-30 | 2021-01-05 | 华为技术有限公司 | Method and device for encoding inter-channel phase difference parameters |
CN110728986B (en) | 2018-06-29 | 2022-10-18 | 华为技术有限公司 | Coding method, decoding method, coding device and decoding device for stereo signal |
CN112233682B (en) * | 2019-06-29 | 2024-07-16 | 华为技术有限公司 | Stereo encoding method, stereo decoding method and device |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5434948A (en) * | 1989-06-15 | 1995-07-18 | British Telecommunications Public Limited Company | Polyphonic coding |
NL9100173A (en) * | 1991-02-01 | 1992-09-01 | Philips Nv | SUBBAND CODING DEVICE, AND A TRANSMITTER EQUIPPED WITH THE CODING DEVICE. |
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
US5694332A (en) * | 1994-12-13 | 1997-12-02 | Lsi Logic Corporation | MPEG audio decoding system with subframe input buffering |
US5956674A (en) * | 1995-12-01 | 1999-09-21 | Digital Theater Systems, Inc. | Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels |
US5812971A (en) * | 1996-03-22 | 1998-09-22 | Lucent Technologies Inc. | Enhanced joint stereo coding method using temporal envelope shaping |
US5796842A (en) * | 1996-06-07 | 1998-08-18 | That Corporation | BTSC encoder |
US6463410B1 (en) * | 1998-10-13 | 2002-10-08 | Victor Company Of Japan, Ltd. | Audio signal processing apparatus |
US6226616B1 (en) * | 1999-06-21 | 2001-05-01 | Digital Theater Systems, Inc. | Sound quality of established low bit-rate audio coding systems without loss of decoder compatibility |
JP3335605B2 (en) * | 2000-03-13 | 2002-10-21 | 日本電信電話株式会社 | Stereo signal encoding method |
WO2002091363A1 (en) * | 2001-05-08 | 2002-11-14 | Koninklijke Philips Electronics N.V. | Audio coding |
JP2003084790A (en) * | 2001-09-17 | 2003-03-19 | Matsushita Electric Ind Co Ltd | Speech component emphasizing device |
CN1219415C (en) * | 2002-07-23 | 2005-09-14 | 华南理工大学 | 5.1 path surround sound earphone repeat signal processing method |
-
2004
- 2004-02-20 SE SE0400417A patent/SE527670C2/en unknown
- 2004-12-15 DE DE602004008613T patent/DE602004008613T2/en active Active
- 2004-12-15 CN CNB2004800186630A patent/CN100559465C/en active Active
- 2004-12-15 DE DE602004023240T patent/DE602004023240D1/en active Active
- 2004-12-15 WO PCT/SE2004/001867 patent/WO2005059899A1/en active IP Right Grant
- 2004-12-15 CA CA2690885A patent/CA2690885C/en active Active
- 2004-12-15 AT AT04820553T patent/ATE371924T1/en not_active IP Right Cessation
- 2004-12-15 CA CA2527971A patent/CA2527971C/en active Active
- 2004-12-15 PL PL04820553T patent/PL1623411T3/en unknown
- 2004-12-15 BR BRPI0410856A patent/BRPI0410856B8/en not_active IP Right Cessation
- 2004-12-15 MX MXPA05012230A patent/MXPA05012230A/en active IP Right Grant
- 2004-12-15 CN CN200710138487XA patent/CN101118747B/en not_active Expired - Fee Related
- 2004-12-15 RU RU2005134365/09A patent/RU2305870C2/en active
- 2004-12-15 EP EP04820553A patent/EP1623411B1/en not_active Ceased
- 2004-12-15 AU AU2004298708A patent/AU2004298708B2/en not_active Ceased
- 2004-12-15 AT AT07109801T patent/ATE443317T1/en not_active IP Right Cessation
- 2004-12-15 ZA ZA200508980A patent/ZA200508980B/en unknown
- 2004-12-15 BR BRPI0419281-8A patent/BRPI0419281B1/en not_active IP Right Cessation
- 2004-12-15 JP JP2006518596A patent/JP4335917B2/en not_active Expired - Fee Related
- 2004-12-15 EP EP07109801A patent/EP1845519B1/en active Active
-
2006
- 2006-11-01 HK HK06112026.7A patent/HK1091585A1/en not_active IP Right Cessation
- 2006-11-01 HK HK08106066.8A patent/HK1115665A1/en not_active IP Right Cessation
-
2007
- 2007-06-05 RU RU2007121143/09A patent/RU2425340C2/en active
- 2007-08-22 JP JP2007216374A patent/JP4589366B2/en not_active Expired - Fee Related
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1623411B1 (en) | Fidelity-optimised variable frame length encoding | |
US7809579B2 (en) | Fidelity-optimized variable frame length encoding | |
JP5171269B2 (en) | Optimizing fidelity and reducing signal transmission in multi-channel audio coding | |
JP5179881B2 (en) | Parametric joint coding of audio sources | |
US9626973B2 (en) | Adaptive bit allocation for multi-channel audio encoding | |
CN103119647B (en) | Based on the plural number prediction stereo coding of MDCT | |
EP2109861B1 (en) | Audio decoder | |
US20160247515A1 (en) | Bitstream syntax for multi-process audio decoding | |
US7725324B2 (en) | Constrained filter encoding of polyphonic signals | |
AU2007237227B2 (en) | Fidelity-optimised pre-echo suppressing encoding | |
EP1639580B1 (en) | Coding of multi-channel signals |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20051108 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR LV MK YU |
|
17Q | First examination report despatched |
Effective date: 20060703 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
DAX | Request for extension of the european patent (deleted) | ||
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 602004008613 Country of ref document: DE Date of ref document: 20071011 Kind code of ref document: P |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071229 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071210 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
REG | Reference to a national code |
Ref country code: PL Ref legal event code: T3 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CH Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: LI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
EN | Fr: translation not filed | ||
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071130 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080129 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071129 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071231 |
|
26N | No opposition filed |
Effective date: 20080530 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071217 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071231 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070829 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071129 Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20071215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20080301 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20151215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20151215 |
|
PGRI | Patent reinstated in contracting state [announced from national office to epo] |
Ref country code: IT Effective date: 20170710 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: TR Payment date: 20201127 Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: PL Payment date: 20201118 Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20201221 Year of fee payment: 17 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: NL Payment date: 20221226 Year of fee payment: 19 Ref country code: GB Payment date: 20221227 Year of fee payment: 19 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20211215 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20221228 Year of fee payment: 19 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230517 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20211231 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 602004008613 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: MM Effective date: 20240101 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20231215 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20240101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20211215 Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20240101 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20240702 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20231215 |