JP3342996B2

JP3342996B2 - Multi-channel audio encoder and encoding method

Info

Publication number: JP3342996B2
Application number: JP21182895A
Authority: JP
Inventors: 丁權許; 永南呉
Original assignee: Samsung Electronics Co Ltd
Current assignee: Samsung Electronics Co Ltd
Priority date: 1995-08-21
Filing date: 1995-08-21
Publication date: 2002-11-11
Anticipated expiration: 2015-08-21
Also published as: JPH0964751A

Abstract

PROBLEM TO BE SOLVED: To efficiently encode the audio signals of multiple channels with a small quantity of bit by APCM-quantizing the multiple channels audio signals of a frequency area through the use of the result of bit allocation in a bit a location unit. SOLUTION: An APCM pre-processing part 120 inputting the stereo audio signals, and the multiple channel audio signals of the frequency area and calculating information on the size of respective frequency identification signals and a multiple channel processor 130 allocating the channels through the use of a correlation among the multiple channels and signals size information are provided. The bit allocation unit 140 allocating the bits to the respective audio signals by using the result of the multiple channel processor 130 and the result of a psychological acoustic part 170 and a quantizer 150 quantizing the multiple channel audio signals in the frequency area by using the bit allocated to the bit allocation unit 140 are provided. Thus, the audio signals of the multiple channels can be transmitted by encoding the audio signals of the multiple channels by using APCM and the correlation among the channels.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は多チャネルオーディ
オ符号化器及び符号化方法に係り、特にＡＰＣＭ（Ａｄ
ａｐｔｉｖｅＰｌｕｓｅＣｏｄｅｄＭｏｄｕｌａ
ｔｉｏｎ）と各チャネル間の相関関係を用いて多チャネ
ルのオーディオ信号を少量のビットで効率的に符号化す
るための符号化器及び符号化方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a multi-channel audio encoder and an encoding method, and more particularly to an APCM (Ad).
active Plus Coded Modula
The present invention relates to an encoder and an encoding method for efficiently encoding a multi-channel audio signal with a small number of bits using the correlation between each channel and each channel.

【０００２】[0002]

【従来の技術】従来の多チャネルオーディオの符号化器
と復号化器の場合、与えられる伝送チャネルの数が制限
されている反面、元の信号チャネルは増える趨勢なので
十分な音質を有して伝送することが困難になる。このよ
うな理由により一部オーディオコデックアルゴリズムは
音の正しい伝送より再生時の音の方向性に重点を置く場
合もある。2. Description of the Related Art In conventional encoders and decoders for multi-channel audio, although the number of transmission channels provided is limited, the number of original signal channels tends to increase, so that transmission with sufficient sound quality is performed. It becomes difficult to do. For this reason, some audio codec algorithms may place more emphasis on the directionality of the sound during playback than on the correct transmission of the sound.

【０００３】従来のオーディオコデックアルゴリズムは
既存のステレオシステムとの交換性を有するため元の信
号を混ぜてステレオチャネルを作るが、この場合復号化
時ステレオチャネルで再び多チャネル信号を作ると、ス
テレオ再生時には聞こえなかった量子化歪曲が生じる場
合がある。これは伝送された多チャネル信号をさらに混
合して使用すると、深刻な問題となる場合がある。The conventional audio codec algorithm has the interchangeability with the existing stereo system, so that the original signal is mixed to create a stereo channel. In this case, when a multi-channel signal is created again by the stereo channel at the time of decoding, the stereo reproduction is performed. Sometimes, inaudible quantization distortion may occur. This can be a serious problem if the transmitted multi-channel signal is further mixed and used.

【０００４】一方、ＭＰＥＧー２オーディオ標準案であ
るＩＳＯ／ＩＥＣ１３８１８ー３によるオーディオコ
ーデックアルゴリズムは、サブバンドフィルタリング過
程、マトリックシング過程、第１スケールファクタ算出
過程、第１伝送パターン算出過程、心理音響モデルによ
るＳＭＲ（Ｓｉｇｎａｌ−ｔｏ−ＭａｓｋｉｎｇＲａ
ｔｉｏ）値の算出過程と、伝送チャネル割当過程、ダイ
ナミック漏話抑制過程、遅延補償値、予測係数及び予測
利得値算出過程、予測選択情報算出過程、修正されたＳ
ＭＲ値の算出過程、予測係数量子化過程、初期予測エラ
ー信号算出過程、第２スケールファクタ算出過程、第２
伝送パターン算出過程、修正されたＳＭＲ値を使用した
ビット割当過程、第１サブバンドサンプル量子化過程、
サブバンドサンプルの逆量子化過程、逆量子化されたサ
ブバンドサンプルを利用した最終予測エラー信号算出過
程、第３スケールファクタ算出過程、第３伝送パターン
算出過程、第２サブバンドサンプル量子化過程及びビッ
トストリームフォーマット過程よりなる。前記のオーデ
ィオコーデックアルゴリズムは極めて複雑であるだけで
なく、スケールファクタ算出過程と伝送パターン算出過
程を３回繰り返して行うなどのようにルーチンを数回繰
り返すべきであり、初期に入力された多チャネルオーデ
ィオ信号に対してのみ心理音響モデルが適用できる短所
があった。On the other hand, an audio codec algorithm according to ISO / IEC 13818-3, which is an MPEG-2 audio standard, includes a sub-band filtering process, a matrixing process, a first scale factor calculating process, a first transmission pattern calculating process, a psychoacoustic process. SMR (Signal-to-Masking Ra) by Model
tio) value calculation process, transmission channel allocation process, dynamic crosstalk suppression process, delay compensation value, prediction coefficient and prediction gain value calculation process, prediction selection information calculation process, modified S
MR value calculation process, prediction coefficient quantization process, initial prediction error signal calculation process, second scale factor calculation process, second
A transmission pattern calculation process, a bit allocation process using the corrected SMR value, a first subband sample quantization process,
Dequantizing the sub-band samples, calculating the final prediction error signal using the de-quantized sub-band samples, calculating the third scale factor, calculating the third transmission pattern, quantizing the second sub-band samples, It consists of a bit stream formatting process. The above-mentioned audio codec algorithm is not only very complicated, but also it is necessary to repeat the routine several times, such as repeating the scale factor calculation process and the transmission pattern calculation process three times. There is a disadvantage that the psychoacoustic model can be applied only to the signal.

【０００５】[0005]

【発明が解決しようとする課題】したがって、本発明の
目的は前述した問題点を解決するためにＡＰＣＭと各チ
ャネル間の相関関係を利用して多チャネルのオーディオ
信号を少量のビットで効率的に符号化するための多チャ
ネルオーディオ符号化方法を提供するにある。SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to solve the above-mentioned problems by efficiently using multi-channel audio signals with a small number of bits by utilizing the correlation between the APCM and each channel. An object of the present invention is to provide a multi-channel audio encoding method for encoding.

【０００６】本発明の他の目的は前記他チャネルオーデ
ィオ符号化方法を実現するに最も適合な符号化器を提供
するにある。It is another object of the present invention to provide an encoder most suitable for implementing the above-mentioned other channel audio encoding method.

【０００７】[0007]

【課題を解決するための手段】前記の目的を達成するた
めに他チャネルオーディオ符号化方法は、多チャネルに
入力されるオーディオ信号を符号化するための多チャネ
ルオーディオ符号化器において、時間領域のステレオオ
ーディオ信号と多チャネルオーディオ信号に対して計算
された各チャネルのマスキングレベルと、周波数領域の
ステレオオーディオ信号と多チャネルオーディオ信号に
対する各周波数別信号の大きさ情報と、各チャネル間の
相関関係に応じて前記各チャネルのオーディオ信号に対
して伝送するか否かを判断し、チャネルを割当てる多チ
ャネル処理過程と、前記多チャネル処理過程のチャネル
割当の結果と前記マスキングレベルに応じて前記各オー
ディオ信号に対するビットを割当てるビット割当過程
と、前記ビット割当過程でのビット割当の結果を利用し
て前記周波数領域の多チャネルオーディオ信号をＡＰＣ
Ｍ量子化する量子化過程とを含むことを特徴とする。According to another aspect of the present invention, there is provided a multi-channel audio encoder for encoding an audio signal input to multiple channels. The masking level of each channel calculated for the stereo audio signal and the multi-channel audio signal, the magnitude information of each frequency signal for the stereo audio signal and the multi-channel audio signal in the frequency domain, and the correlation between the channels. A multi-channel processing step of deciding whether or not to transmit the audio signal of each channel, and allocating a channel; and a result of channel allocation in the multi-channel processing step and the masking level. A bit allocation step of allocating bits to The multi-channel audio signal APC in the frequency domain by using the results of the bit allocation in extent
And a quantization step of performing M quantization.

【０００８】前記他の目的を達成するために本発明によ
る多チャネルオーディオ符号化器は、時間領域に存する
多チャネルオーディオ信号を入力してステレオオーディ
オ信号を形成するための第１ミキサーと、周波数領域に
変換された多チャネルオーディオ信号を入力してステレ
オオーディオ信号を形成するための第２ミキサーと、前
記第１ミキサーから出力される時間領域のステレオオー
ディオ信号と前記時間領域の多チャネルオーディオ信号
を入力して各チャネルのマスキングレベルを計算するた
めの心理音響部と、前記第２ミキサーから出力される周
波数領域のステレオオーディオ信号と周波数領域の多チ
ャネルオーディオ信号に対する各周波数別信号の大きさ
情報と、前記心理音響部で計算されたマスキングレベル
を入力して前記各チャネルのオーディオ信号に対して伝
送するか否かを判断し、チャネルを割当てるための多チ
ャネル処理器と、前記多チャネル処理器でのチャネル割
当の結果と前記マスキングレベルを入力して前記各オー
ディオ信号に対するビットを割当てるためのビット割当
器と、前記ビット割当器でのビット割当の結果を用いて
前記周波数領域の多チャネルオーディオ信号をＡＰＣＭ
量子化するための量子化器とを含むことを特徴とする。According to another aspect of the present invention, there is provided a multi-channel audio encoder according to the present invention, comprising: a first mixer for receiving a multi-channel audio signal in a time domain to form a stereo audio signal; A second mixer for forming a stereo audio signal by inputting the converted multi-channel audio signal, a time-domain stereo audio signal output from the first mixer, and a time-domain multi-channel audio signal And a psychoacoustic unit for calculating a masking level of each channel, and magnitude information of each frequency-dependent signal with respect to a frequency domain stereo audio signal and a frequency domain multi-channel audio signal output from the second mixer; Enter the masking level calculated by the psychoacoustic unit and A multi-channel processor for deciding whether or not to transmit the audio signal of the channel, and allocating a channel allocation result in the multi-channel processor and the masking level to each of the audio signals; And a bit allocator for allocating bits to the multi-channel audio signal in the frequency domain using the bit allocation result of the bit allocator.
And a quantizer for performing quantization.

【０００９】[0009]

【発明の実施の形態】以下、添付した図面に基づき本発
明を詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, the present invention will be described in detail with reference to the accompanying drawings.

【００１０】図１は本発明による多チャネルオーディオ
符号化器の一実施例によるブロック図である。FIG. 1 is a block diagram showing an embodiment of a multi-channel audio encoder according to the present invention.

【００１１】図１に示されたブロック図の構成は、多チ
ャネルオーディオ信号を周波数領域の信号に変えるフィ
ルタ１００と、フィルタ１００から出力される周波数領
域の多チャネルオーディオ信号をスレテオ信号とする第
２ミキサー１１０と、第２ミキサー１１０から出力され
る周波数領域のステレオオーディオ信号と多チャネルオ
ーディオ信号を入力して各周波数別信号の大きさ情報を
計算するＡＰＣＭ前処理部１２０と、多チャネル間の相
関関係と信号の大きさ情報とを利用してチャネルを割当
てる多チャネル処理器１３０と、時間領域の多チャネル
オーディオ信号をステレオ信号とする第１ミキサー１６
０と、時間領域の多チャネルオーディオ信号とスレテオ
信号を入力して各チャネルのマスキングレベルを計算す
る心理音響部１７０と、多チャネル処理器１３０の結果
と心理音響部１７０の結果を利用して各オーディオ信号
にビットを割当てるビット割当器１４０と、ビット割当
器１４０で割当てられたビットを利用して周波数領域の
多チャネルオーディオ信号を量子化する量子化器１５０
よりなる。The configuration of the block diagram shown in FIG. 1 includes a filter 100 for converting a multi-channel audio signal into a frequency-domain signal, and a second multi-channel audio signal output from the filter 100 as a stereo signal. A mixer 110, an APCM preprocessing unit 120 that receives a stereo audio signal in a frequency domain and a multi-channel audio signal output from the second mixer 110 and calculates magnitude information of a signal for each frequency, and a correlation between the multi-channels A multi-channel processor 130 for allocating channels using the relationship and signal size information, and a first mixer 16 for converting a multi-channel audio signal in the time domain into a stereo signal
0, a psycho-acoustic unit 170 that receives a multi-channel audio signal and a stereo signal in the time domain to calculate a masking level of each channel, and uses the result of the multi-channel processor 130 and the result of the psycho-acoustic unit 170 A bit allocator 140 for allocating bits to the audio signal, and a quantizer 150 for quantizing a frequency-domain multi-channel audio signal using the bits allocated by the bit allocator 140
Consisting of

【００１２】図２は図１において多チャネル処理器１３
０の詳細ブロック図であって、周波数領域の多チャネル
オーディオ信号の大きさを各周波数別に比較して最も小
さい信号を伝送する信号として定めてチャネルを割当て
るチャネル割当器１３１と、各チャネル間の相関係数に
応じて伝送するか否かを定めてチャネルを割当てる相関
係数処理器１３２より構成される。FIG. 2 shows a multi-channel processor 13 shown in FIG.
FIG. 7 is a detailed block diagram illustrating a multi-channel audio signal in the frequency domain for each frequency, comparing the magnitude of the multi-channel audio signal for each frequency, assigning the smallest signal, and assigning a channel; It comprises a correlation coefficient processor 132 which determines whether or not to transmit according to the number of relations and allocates a channel.

【００１３】図３は図１において量子化器１５０の詳細
ブロック図であって、ビット割当器１４０でのビット割
当の結果を用いて周波数領域の多チャネルオーディオ信
号を量子化する第１量子化器１５１と、第１量子化器１
５１で量子化された信号を量子化以前の信号として復元
する逆量子化器１５２と、逆量子化器１５２で復元され
たオーディオ信号と周波数領域の多チャネルオーディオ
信号とを混合してステレオ信号を形成するステレオ信号
形成器１５３と、ビット割当器１４０でのビット割当の
結果を用いてステレオ信号形成器１５３から出力される
周波数領域のステレオオーディオ信号を量子化する第２
量子化器１５４と、第１量子化器１５１で量子化された
信号と第２量子化器１５４で量子化された信号とを用い
てビットストリームを形成するビットストリーム形成器
１５５より構成される。FIG. 3 is a detailed block diagram of the quantizer 150 shown in FIG. 1. The first quantizer quantizes a multi-channel audio signal in the frequency domain using the result of bit allocation by the bit allocator 140. 151 and the first quantizer 1
An inverse quantizer 152 for restoring the signal quantized at 51 as a signal before quantization, and an audio signal restored by the inverse quantizer 152 and a multi-channel audio signal in the frequency domain are mixed to form a stereo signal. A second stereo signal generator 153 for quantizing a frequency-domain stereo audio signal output from the stereo signal generator 153 using the bit allocation result of the bit allocator 140;
It comprises a quantizer 154 and a bit stream former 155 for forming a bit stream using the signal quantized by the first quantizer 151 and the signal quantized by the second quantizer 154.

【００１４】図４は図２において相関係数処理器１３２
の動作を説明するためのフローチャートであり、４００
段階ないし４０３段階は与えられた多チャネルオーディ
オ信号のうち最も高い周波数で信号間の相関係数を求
め、相関係数を相互比較して最も大きい相関係数を有す
るチャネルを求める過程であり、４０４段階ないし４０
６段階は最も大きい相関係数を有するチャネルがステレ
オチャネルであるかを判断して当たる処理を行う過程で
あり、４０７段階ないし４１０段階は予想ビット発生量
が使用可能なビット量より多い場合、選ばれた周波数領
域の全てのオーディオ信号に対して前記過程を繰り返す
過程である。FIG. 4 is a block diagram of the correlation coefficient processor 132 shown in FIG.
4 is a flowchart for explaining the operation of FIG.
Steps 403 to 403 are steps of obtaining a correlation coefficient between signals at the highest frequency among the given multi-channel audio signals, and comparing the correlation coefficients with each other to obtain a channel having the largest correlation coefficient. Stage or 40
Step 6 is a step of determining whether the channel having the largest correlation coefficient is a stereo channel and performing a corresponding process. Steps 407 to 410 are selected when the expected bit generation amount is larger than the available bit amount. This is a process of repeating the above process for all audio signals in the obtained frequency domain.

【００１５】さて、本発明の動作を図１乃至図４を参照
して説明する。The operation of the present invention will be described with reference to FIGS.

【００１６】まず、図１を参照すれば、入力される多チ
ャネルオーディオ信号はフィルタ１００を通じて周波数
領域の信号に変わるようになる。Referring to FIG. 1, an input multi-channel audio signal is converted into a signal in a frequency domain through a filter 100.

【００１７】与えられた時間領域のオーディオ信号は既
存のステレオシステムで再生するに適合な形態に第１ミ
キサー１６０で混合される。一方、フィルタ１００で周
波数領域に変わったオーディオ信号も第２ミキサー１１
０で混合されてステレオシステムに当たる周波数領域の
オーディオ信号が形成される。The given time-domain audio signal is mixed by the first mixer 160 in a form suitable for reproduction on an existing stereo system. On the other hand, the audio signal changed to the frequency domain by the filter 100 is also transmitted to the second mixer 11.
The audio signals in the frequency domain corresponding to the stereo system are formed by mixing at 0.

【００１８】第１ミキサー１６０から出力される時間領
域のステレオオーディオ信号と元の時間領域の多チャネ
ルオーディオ信号は心理音響部１７０に入力されてそれ
ぞれのマスキングレベルが計算される。The stereo audio signal in the time domain output from the first mixer 160 and the multi-channel audio signal in the original time domain are input to the psychoacoustic unit 170 and their masking levels are calculated.

【００１９】周波数領域での元の多チャネルオーディオ
信号とステレオオーディオ信号はＡＰＣＭ前処理部１２
０に入力されてＡＰＣＭに必要な各周波数別オーディオ
信号の大きさ情報が計算される。The original multi-channel audio signal and stereo audio signal in the frequency domain are combined with the APCM pre-processing unit 12.
0 is inputted to calculate the magnitude information of the audio signal for each frequency required for the APCM.

【００２０】ＡＰＣＭ前処理部１２０から出力されるＡ
ＰＣＭに必要な各周波数別オーディオ信号の大きさ情報
と、心理音響部１７０から出力されるマスキングレベル
値は多チャネル処理器１３０に入力されてチャネル間の
相関関係とチャネル間の信号レベルの大きさの違いに応
じて伝送する信号と伝送しなくても良い信号とを区別す
る。この区別が不要な理由は伝送チャネルのチャネル容
量が与えられた多チャネル信号と新たに形成されたステ
レオ信号の両方を伝送するほど大きくないからである。
多チャネル処理器１３０の動作を図２を参照してさらに
詳細に説明すると次の通りである。A output from APCM preprocessing unit 120
The magnitude information of the audio signal for each frequency required for the PCM and the masking level value output from the psychoacoustic unit 170 are input to the multi-channel processor 130, and the correlation between channels and the magnitude of the signal level between channels are input. The signal to be transmitted is distinguished from the signal that does not need to be transmitted according to the difference. This distinction is unnecessary because the channel capacity of the transmission channel is not large enough to transmit both a given multi-channel signal and a newly formed stereo signal.
The operation of the multi-channel processor 130 will be described in more detail with reference to FIG.

【００２１】図２において、チャネル割当器１３１はＡ
ＰＣＭ前処理部（図１の１２０）から与えられた信号間
の大きさ情報を利用してオーディオ信号を伝送するか否
かを定めるが、即ち、多チャネルオーディオ信号を各周
波数別に大きさを比較して最も小さい信号を伝送する信
号として定める。一方、一定な時間区間で平均大きさが
最も小さい信号を伝送する信号として定める場合もあ
る。In FIG. 2, the channel allocator 131
It is determined whether to transmit the audio signal using the magnitude information between the signals given from the PCM preprocessing unit (120 in FIG. 1), that is, the magnitude of the multi-channel audio signal is compared for each frequency. And determine the smallest signal to be transmitted. On the other hand, there is a case where the signal having the smallest average magnitude is determined as a signal to be transmitted in a certain time section.

【００２２】相関係数処理器１３２は各チャネル間の相
関係数を利用してオーディオ信号を伝送するか否かを定
める。これを図４に示されたフローチャートを参照して
さらに詳細に説明すれば次の通りである。The correlation coefficient processor 132 determines whether to transmit an audio signal using the correlation coefficient between channels. This will be described in more detail with reference to the flowchart shown in FIG.

【００２３】図４において、４００段階では時間領域の
多チャネル信号から変換された周波数周波数領域の多チ
ャネル信号と周波数領域の多チャネル信号とを混合して
作った周波数領域のステレオ信号を入力し、４０１段階
〜４０３段階では最も高い周波数領域で最も大きさ相関
係数を有するチャネルを選択した後、４０４段階〜４０
５段階ではこのチャネルのうち一つがステレオチャネル
である場合、これに合うチャネルは大きさ成分のみ残し
て伝送しないと定める。４０６段階では若し元の多チャ
ネル中の二つが最も大きい相関係数を有すると、二つの
チャネルの平均値を一チャネルに入れ、残りチャネルは
大きさ成分のみ残して伝送しない。In FIG. 4, at step 400, a frequency-domain stereo signal formed by mixing a frequency-domain multi-channel signal converted from a time-domain multi-channel signal and a frequency-domain multi-channel signal is input. In steps 401 to 403, after selecting a channel having the largest correlation coefficient in the highest frequency region, steps 404 to 40 are performed.
In step 5, when one of the channels is a stereo channel, it is determined that the channel corresponding to the stereo channel is not transmitted except for the size component. In step 406, if two of the original multi-channels have the largest correlation coefficient, the average value of the two channels is put into one channel, and the other channels do not transmit, leaving only the magnitude component.

【００２４】４０７段階〜４０８段階では予想ビット発
生量を計算して使用可能なビット量より少ない場合本フ
ローチャートを終了し、大きい場合選ばれた周波数領域
の全てのオーディオ信号が処理されたかを判断する（４
０９段階）。In steps 407 to 408, the expected bit generation amount is calculated and, if less than the available bit amount, this flowchart is terminated. If it is larger, it is determined whether all audio signals in the selected frequency domain have been processed. (4
09 stage).

【００２５】４０９段階での判断の結果、選ばれた周波
数領域の信号の全てが処理された場合、４１０段階で最
も高い周波数領域より低い周波数領域の信号を選択して
４０２段階に復帰し、全部処理されない場合４１１段階
でその次の大きい相関係数を有するチャネルを選んで前
述したように同一な方法で処理する。As a result of the determination in step 409, if all the signals in the selected frequency domain have been processed, a signal in a frequency domain lower than the highest frequency domain is selected in step 410, and the process returns to step 402. If not, in step 411, the channel having the next larger correlation coefficient is selected and processed in the same manner as described above.

【００２６】このように過程を繰り返して全てのチャネ
ルを使用しても生成されるデータの量が多ければ、その
下の周波数領域に行って全体過程を繰り返す。この際、
生成されるデータ量は伝送される信号のマスキングレベ
ルより所定値ほど低い水準に量子化歪曲が生じるように
するビット数から算出することができる。If the amount of data to be generated is large even when all the channels are used by repeating the above-described process, the process is repeated in the frequency domain below the generated data. On this occasion,
The amount of data to be generated can be calculated from the number of bits that causes quantization distortion to occur at a level lower than the masking level of the transmitted signal by a predetermined value.

【００２７】心理音響部１７０で計算された各チャネル
のマスキングレベル、多チャネル処理器１３０の処理結
果、ＡＰＣＭ前処理部１２０の処理結果とを利用してビ
ット割当器１４０では各オーディオ信号にビットを割当
てる。即ち、多チャネル処理器１３０で伝送しないと定
められた信号にはビットを割り当てず、伝送されると定
められた信号とステレオ信号に対してのみビットを割当
てる。Using the masking level of each channel calculated by the psychoacoustic unit 170, the processing result of the multi-channel processor 130, and the processing result of the APCM preprocessing unit 120, the bit allocator 140 assigns bits to each audio signal. Assign. That is, bits are not assigned to a signal determined not to be transmitted by the multi-channel processor 130, but bits are assigned only to a signal determined to be transmitted and a stereo signal.

【００２８】量子化器１５０ではビット割当器１４０の
結果を用いてフィルタ１００の出力信号に対してＡＰＣ
Ｍ量子化を遂行し、この結果を他の情報と共に伝送す
る。これを図３を参照して、さらに詳細に説明すれば、
量子化器１５０は先ず割当てられたビットを利用してフ
ィルタ１００から出力される元の他チャネル信号を符号
化した後、この信号を復号化して伝送しない信号と共に
混合して更にステレオ信号を作る。このステレオ信号を
割当てられたビットを利用して符号化する。The quantizer 150 uses the result of the bit allocator 140 to apply APC to the output signal of the filter 100.
Perform M quantization and transmit the result together with other information. This will be described in more detail with reference to FIG.
The quantizer 150 first encodes the original other channel signal output from the filter 100 using the allocated bits, and then decodes this signal and mixes it with a signal not to be transmitted to produce a further stereo signal. The stereo signal is encoded using the allocated bits.

【００２９】[0029]

【発明の効果】前述したように本発明による多チャネル
オーディオ符号化器及び符号化方法ではＡＰＣＭと各チ
ャネル間の相関関係を利用して多チャネルのオーディオ
信号を符号化することにより、少ない数の伝送チャネル
でも多くのチャネルのオーディオ信号を伝送することが
でき、既存のＡＰＣＭ復号化器と互換されるように伝送
することが可能である。As described above, in the multi-channel audio encoder and the encoding method according to the present invention, a multi-channel audio signal is encoded using the correlation between the APCM and each channel, so that a small number of audio signals are encoded. Audio signals of many channels can be transmitted through the transmission channel, and can be transmitted in a manner compatible with existing APCM decoders.

【００３０】また、復号化してステレオシステムで再生
することも可能であり、伝送されたステレオ信号で更に
多チャネルオーディオ信号を作ったり、この信号を再び
混合してスレテオ信号とする場合にも符号化歪曲が耳に
は聞こえなくなる。It is also possible to decode the data and reproduce it on a stereo system. When a multi-channel audio signal is further formed from the transmitted stereo signal, or when this signal is mixed again to form a stereo signal, the encoding is performed. Distortion becomes inaudible.

[Brief description of the drawings]

【図１】本発明による多チャネルオーディオ符号化器の
一実施例によるブロック図である。FIG. 1 is a block diagram of a multi-channel audio encoder according to an embodiment of the present invention.

【図２】図１において多チャネル処理器の詳細ブロック
図である。FIG. 2 is a detailed block diagram of a multi-channel processor in FIG.

【図３】図１において量子化器の詳細ブロック図であ
る。FIG. 3 is a detailed block diagram of a quantizer in FIG. 1;

【図４】図２において相関係数処理器の動作を説明する
ためのフローチャートである。FIG. 4 is a flowchart for explaining the operation of the correlation coefficient processor in FIG. 2;

フロントページの続き (56)参考文献特開平１−318327（ＪＰ，Ａ) 特開平４−360331（ＪＰ，Ａ) 特開平５−227039（ＪＰ，Ａ) 特表平８−501666（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/38 G10L 19/00 H04B 14/04 Continuation of front page (56) References JP-A-1-318327 (JP, A) JP-A-4-360331 (JP, A) JP-A-5-227039 (JP, A) , A) (58) Fields investigated (Int. Cl. ⁷ , DB name) H03M 7/38 G10L 19/00 H04B 14/04

Claims

(57) [Claims]

1. A first mixer for inputting a multi-channel audio signal existing in a time domain to form a stereo audio signal, and inputting a multi-channel audio signal converted to a frequency domain to form a stereo audio signal. A psycho-acoustic unit for calculating a masking level of each channel by inputting a stereo audio signal in the time domain and a multi-channel audio signal in the time domain output from the first mixer; Inputting the stereo audio signal in the frequency domain output from the second mixer and the magnitude information of each frequency signal for the multi-channel audio signal in the frequency domain, and the masking level calculated from the psychoacoustic unit, and Determine whether to transmit audio signals and assign channels. A multi-channel processor for assigning, a bit allocation unit for inputting a result of channel allocation of the multi-channel processor and the masking level and allocating bits for each audio signal; and a bit allocation in the bit allocator. And a quantizer for APCM-quantizing the frequency-domain multi-channel audio signal using the result of the multi-channel audio signal.

2. The multi-channel processor, comprising: a channel allocator that compares the magnitude of the multi-channel audio signal in the frequency domain for each frequency, determines the signal having the smallest magnitude as a signal to be transmitted, and allocates a channel; 2. The multi-channel system according to claim 1, further comprising a correlation coefficient processor for calculating a correlation coefficient between the respective channels, and determining whether or not to transmit according to the calculated correlation coefficient. Audio encoder.

3. The channel allocator according to claim 2, wherein a signal having the smallest average magnitude for a predetermined time in each frequency band of the multi-channel audio signal is determined as a transmission signal, and a channel is allocated. A multi-channel audio encoder as described.

4. The correlation coefficient processor determines that when a stereo channel is selected as a channel having the largest correlation coefficient in the highest frequency band, a signal corresponding to the stereo channel is not transmitted while leaving only a magnitude component. 3. The multi-channel audio encoder according to claim 2, wherein:

5. The correlation coefficient processor calculates an average value of audio signals transmitted to the two channels when one of the two channels is selected as a channel having the largest correlation coefficient in the highest frequency band. 3. The multi-channel audio encoder according to claim 2, wherein the remaining channel is determined not to be transmitted while leaving only the magnitude component.

6. The correlation coefficient processor selects a channel having the largest correlation coefficient for the highest frequency band, determines whether or not to transmit according to the characteristics of the channel, and determines the highest frequency band. 3. The multi-channel audio encoder according to claim 2, wherein when all the multi-channel audio signals are processed for the band, the processing is repeated for a frequency region below the multi-channel audio signal.

7. The correlation coefficient processor calculates an expected bit generation amount using the masking level, and when the expected bit generation amount is larger than an available bit amount, the correlation coefficient processor calculates the expected bit generation amount with respect to the lower frequency domain. 7. The multi-channel audio encoder according to claim 6, wherein the processing steps are repeated.

8. The first quantizer, wherein the quantizer quantizes the multi-channel audio signal in the frequency domain using a result of bit allocation in the bit allocator; An inverse quantizer for restoring the quantized signal as a signal before quantization; and a stereo for forming a stereo signal by mixing the audio signal restored by the inverse quantizer with the multi-channel audio signal in the frequency domain. A signal quantizer; a second quantizer for quantizing a frequency-domain stereo audio signal output from the stereo signal generator using a result of the bit allocation in the bit allocator; and the first quantizer. And a bit stream generator for forming a bit stream using the signal quantized in step (a) and the signal quantized in the second quantizer. Multi-channel audio encoder according.

9. A multi-channel audio encoder for encoding an audio signal input to a multi-channel, comprising: a masking level of each channel calculated according to the stereo audio signal in the time domain and the multi-channel audio signal; The size information of each frequency signal for the stereo audio signal and the multi-channel audio signal in the frequency domain, and whether or not to transmit the audio signal of each channel is determined based on the correlation between the channels, and the channel is determined. A multi-channel processing step of allocating, a bit allocation step of allocating bits to each audio signal according to a result of channel allocation in the multi-channel processing step and the masking level, and a result of bit allocation in the bit allocation step. The multi-channel audio signal in the frequency domain A multi-channel audio encoding method, comprising: a quantization step of performing APCM quantization.

10. The quantization step includes: a first quantization step of APCM-quantizing a frequency-domain multi-channel audio signal using bits allocated in the bit allocation step; and a quantization step in the first quantization step. An inverse quantization step of inversely quantizing the decoded signal to restore the decoded signal; and a stereo for forming a stereo signal by mixing the audio signal restored in the inverse quantization step and the multi-channel audio signal in the frequency domain. A signal forming step; and a second quantization step of further APCM-quantizing a frequency-domain stereo audio signal formed in the stereo signal forming step using the bits allocated in the bit allocation step. The multi-channel audio encoding method according to claim 9.