200939207 ; 六、發明說明: • 【發明所屬之技術領域】 本發明係關於一種音頻訊號處理技術,特別係關於一種音頻 訊號之編碼及解碼方法及其裝置。 【先前技術】 通常’音頻訊號編碼裝置壓縮一音頻訊號為一單聲道(mono) 或身歷聲(stereo)類型之縮混(d〇Wnmix)訊號,而不是壓縮多通道音 ❹頻訊號之各個通道。音頻訊號編碼裝置傳輸壓縮之縮混訊號連同 一空間資訊訊號(或者辅助資料訊號)至一解碼裝置,或者儲存此壓 縮縮混訊號及此空間資訊訊號於一儲存媒體中。 這種情況下’在縮混多通道音頻訊號提取之空間資訊訊號, 係用於自一壓縮縮混訊號復原一初始多通道音頻訊號。 空間-貝訊號包含一標頭(^ea(jer)及一空間資訊。並且,組態 資訊係包含於此標頭中。標頭為用以解釋空間資訊之資訊。 一音頻afl號解碼裝置利用包含於標頭中之組態資訊解碼空間 身訊’包含於標頭巾之組態資訊被傳輸至—解碼裝置,或者連同 空間資訊一起儲存於一儲存媒體中。 一音頻戒號編碼裝置多工(multiplex) —編碼縮混訊號及此空 間資訊訊號-起為一位元流形式,並接著傳輸此多工訊號至一解 .馬裝1_由於組態資訊通常為不變的,因此包含有組態資訊之標 •頭被插入-位7L流中一次。由於傳輸之組態資訊係初始插入音頻 訊號-人因此叙如自一任意時間點複製此音頻訊號時,由於不 4 200939207 存在、、:aHfl貞彳音頻訊號解碼裝置在解碼空間資訊時存在一問 題也就疋-兒’假如在廣播、隨選視訊或類似情況下,由於音頻 訊號係經由者之請求自—特定時間點複製 ,而不是由初始部 稷製’因此音頻訊號解碼裝置不能夠使用包含於音頻訊號中傳輸 之組態資訊。由此,不能夠解竭空間資訊。 【發明内容】 #於以上的問題,本發明的主要目的在於提供-種音頻訊號 之編碼及解碼方献其裝置,其透過使得標縣擇性地包含於空 間資訊訊號之框(frame)中,藉以能夠解碼此音頻訊號。 本發明之另-目的在於提供—種音頻峨之編碼及解碼方法 及其震置,其透過使得複數個標頭包含於—朗資織號中,進 而即使音頻訊號自-隨機點複製,也能夠解碼此音頻訊號。 人因此’為達上述目的’本發明所揭露之音頻訊號解碼方法包 ❺ W下步驟:接收包含有-縮混訊號及-空間資訊訊號之音頻訊 如果-標頭係包含於空間資訊訊號中,則自標頭中提取組態 =訊;提取空間資訊訊號包含之空_訊;以及藉由組態資訊及 空間資訊轉換縮混訊號為一多通道訊號。 【實施方式】 有關本發明的特徵與實作,茲配合圖式作最佳實施例詳細說 明如下。 11 0 為了理解本發明,於音頻訊號解碼裝置及方法之前,首先對 、音頻訊號之編碼裝置及方法作出解釋。然而,本發明之解碣裝置 5 200939207 及方法並不侷限於下文之解碼裝置及其方法。並且,本發明適用 於利用空間肓訊產生多通道之音頻編褐方案,以及 1/2-層瓜)及進階音頻編碼(AAC)等。 「第1圖」為本發明-實關之音頻訊狀構造示意圖其 中音頻訊號自s頻訊號編碼裝置傳輸至音頻訊號解碼裝置。 請參考「第1圖」’ -音頻訊號包含—音頻描述符1G1,一縮 混訊被103以及'空間貢訊訊號105。 © 假如使用編碼方案用以複製廣播或類似物之音頻訊號,音頻 頻描㈣1G1及縮混訊號1〇3。本 發明可以包含空間資訊訊號1〇5為輔助資料。為了使音頻訊號解 碼裝置不需分析音頻訊號而瞭解音頻之編碼解碼器(c〇dec)之基本 資訊,音頻訊號可以選擇性地包含音頻描述符1〇1。音頻描述符 101由音頻解碼必須之少量基本資驗成,例如傳輪音頻訊號之傳 輸率,通道數量,壓縮資料之取樣解,指示當前使用編碼解碼 〇 器之識別符,及其它類似物等等。 音頻訊號解碼裝置藉由音健述符1G1,能_解音頻訊號所 使用之編碼譯碼器之類型。特別地,利用音頻描述符101,音頻“訊 號解碼裝置能夠瞭解接收之音頻訊號是否是利用空間資1訊: 105及縮混訊號103復原一多通道之訊號。這時,多通首了、勹人 一虛擬三維環繞(surround)以及一實際多通道。 3 稽由二維環繞技 術’透過一或多工通道使得具有結合一起之空 呤又工間資訊訊號105及 縮混訊號103之音頻訊號可聽見。 6 200939207 ; I健述符1G1之定⑽社於音頻減所包含之縮混訊號 • 103或空間資訊訊號105,例如,音頻描述符1〇1定位於一顯示音 頻訊號之獨立區域中。 如果;^頭/又有提供至縮混訊號103,音頻訊號解碼裝置則能夠 利用音頻描述符101來解碼縮混訊號1〇3。 縮混訊號103為產生於多通道縮混的訊號。縮混訊號1〇3能 夠產生自音頻訊號解碼裝置(圖中未顯示)所包含之一縮混單元(圖 © 中未顯示)’或者人工產生。 縮混訊號103能夠分類至包含空間資訊訊號1〇5之一組内, 或者不包含標頭之一組内。 如果縮混贿1G3包含標頭,標酬藉由—轉元包含於各 個框中。如果航訊號1G3不包♦標頭,如上文所描述,縮混訊 號103能夠透過一音頻訊號解碼裝置使用音頻描述符1〇1進行解 碼。縮混訊號103或者具有包含標頭用於各個框之形式,或者具 ❹有不包含標頭之形式。並且,縮混訊號1〇3以相同方式包含於一 音頻訊號中,直至内容結束。 空間資訊訊號105也可以分類至包含有標頭及空間資訊之一 組内,以及僅包含空間資訊不包含標頭之一組内。空間資訊訊號 1〇5之標頭區別於縮混訊號103之標頭,由於其不必要相同地插入 各個框中。特別地,空間資訊訊號1〇5能夠同時使用包含有標頭 -之框及不包含標頭之框。包含於空間資訊訊號105之標頭中的大 多數資訊為組態資訊,其透過解釋空間資訊來解碼空間資訊。 7 200939207 : 「第2圖」為本發明之另一實施例之音頻訊號之構造示意圖, ‘ 其中此音頻訊號自一音頻訊號編碼裝置傳輸至一音頻訊號解碼裝 置。 請參考「第2圖」’一音頻訊號包含縮混訊號1〇3及空間資訊 訊號105。並且,音頻訊號以框排列之基本位元流(ES)形式存在。 各個縮混訊號103及空間資訊訊號1〇5偶爾以一獨立基本位 元流形式傳輸至一音頻訊號解碼裝置。並且如「第2圖」顯示, ❹縮混訊號1〇3及空間資訊信號1〇5能夠結合為一基本位元流形 式’以傳輸至音頻訊號解碼裝置。 假如縮混訊號103及空間資訊訊號1〇5結合為一基本位元流 形式’且傳輸至音頻訊號解碼裝置,則空間資訊訊號1〇5能夠包 含於縮混訊號103之辅助資料或附加資料(延伸資料)之位置。 並且’音頻訊號可包含訊號識別資訊,以顯示空間資訊訊號 105是否結合於縮混訊號103。 © 空間資訊訊號1〇5之框能夠被分類為包含有標頭201及空間 資訊203之一組,以及僅包含空間資訊203之一組。特別地,空 間資訊訊號105能夠同時使用包含標頭2〇1之框以及不包含標頭 201之框。 在本發明中’標頭201插入空間資訊訊號1〇5中至少一次。 特別地’音頻訊號編碼裝置可以插入標頭201於空間資訊訊號105 - 之各個框中’週期性地插入標頭201於空間資訊訊號105之框的 各個固定間隔中,或者非週期性地插入標頭201於空間資訊訊號 8 200939207 :105之框的各個任意間隔中。 • 音頻訊號可包含用以指示標頭201是否包含於框中之資訊。 如果標頭201係包含於空間資訊訊號105中,音頻訊號解碼 裝置則自標頭201中提取組態資訊205,並接著依照此組態資訊 205解碼於標頭2〇1之後傳輸之空間資訊203。由於標頭2〇1為透 過解釋空間資訊203用來解碼之資訊,因此標頭201則於音頻訊 號傳輸之較早階段被傳輸。 ❹ 如果標頭201不包含於空間資訊訊號1〇5中,音頻訊號解碼 裝置則利用較早階段傳輸之標頭201來解碼空間資訊2〇3。 如果標頭201在音頻訊號由音頻編碼裝置傳輸至音頻解碼裝 置之同時丟失,或者如果以位元流形式傳輸之音頻訊號自其中間 部解碼以用於廣播或類似物等,音頻訊號解碼裝置則不能夠使用 先前傳輸之標頭20b這種情況下,音頻訊號解碼裝置自標頭2〇1 中提取組態資訊205’其中此標頭2〇1係區別於最初插入音頻訊號 中之先⑴t頭201 ’並且然、後能夠利用提取之組態資訊挪解碼此 曰頻《。這時,自插入音頻訊號中之標頭2〇1提取之組態資訊 2〇5可以等同於或區別於先前之組態資訊挪,其中此先前組態資 訊205提取自較早階段傳輸之標頭2〇1。 如果^頭2〇1為變化的,組態資訊2〇5可提取自一新的標頭 , 提取的組心訊205被解碼,並且然後於標頭201之後傳輪 ,^工間貝訊2〇3被解碼。如果標頭2〇ι為不變的,則需要判斷新 杯頭2〇ι疋否等同於之前傳輸之舊標頭2⑽。如果上述兩種標頭 9 200939207 :2〇1係互相不同的’則能夠偵測-錯誤出現於音頻棘傳輸路徑上 * 之音頻訊號中。 自空間資観號1〇5之標頭201提取之組態資訊2〇5為用於 解釋空間資訊203之資訊。 空間資訊訊號105能夠包含下面資訊(下文中稱作為,,時間對 準資訊(time align information)")’用以在透過音頻訊號解碼裝置 使用縮混訊號103及空間資訊訊號105產生多通道之過程中區別 〇 兩訊號之時間延遲偏差。 自音頻訊號編碼裝置傳輸至音頻訊號解碼裝置之音頻訊號透 過一解多工單元(demultiplexing unit)(圖中未顯示)剖析,並接著被 分離至縮混訊號103及空間資訊訊號1〇5中。 由解多工單元分離之縮混訊號103被解碼,解碼之縮混訊號 103利用空間資訊訊號1〇5產生一多通道。在透過結合縮混訊號 103及空間資訊訊號105產生多通道時,音頻訊號解碼裝置能夠藉 © 由包含於組態資訊205中之時間對準資訊(圖中未顯示),以調整兩 訊號之同步,結合兩訊號之起點位置及類似物,其中組態資訊2〇5 係提取自空間資訊訊號105之標頭201。 時間槽之位置資訊207係包含於空間資訊訊號1〇5之空間資 訊203中’其中一參數將應用於此時間槽。作為一空間參數(空間 尾接(spatial cue)) ’其中存在有指示音頻訊號間之能源偏差之通道 ' 級別(channel level)偏差,指示音頻訊號之親密性及相似度之通道 ’ 間相關(interchannel correlations),指示利用其它訊號預測音頻訊號 200939207 值之係數之通道預測係數(channel predicti〇n c〇棚⑵她)。下文中, 各個空間尾接或郎尾接束紐稱作〃參數、200939207; VI. Description of the invention: • Technical field to which the invention pertains The present invention relates to an audio signal processing technique, and more particularly to a method and apparatus for encoding and decoding an audio signal. [Prior Art] Usually, the 'audio signal encoding device compresses an audio signal into a mono (mono) or stereo type (d〇Wnmix) signal instead of compressing each of the multi-channel audio and video signals. aisle. The audio signal encoding device transmits the compressed downmix signal together with a spatial information signal (or auxiliary data signal) to a decoding device, or stores the compressed downmix signal and the spatial information signal in a storage medium. In this case, the spatial information signal extracted in the downmix multi-channel audio signal is used to recover an initial multi-channel audio signal from a compressed downmix signal. The space-bein signal contains a header (^ea(jer) and a spatial information. And the configuration information is included in this header. The header is information for interpreting spatial information. An audio afl decoding device is utilized The configuration information decoding space contained in the header is configured to transmit the configuration information contained in the header to the decoding device, or stored together with the spatial information in a storage medium. An audio ring coding device is multiplexed ( Multiplex — The coded downmix signal and this spatial information signal are in the form of a one-bit stream, and then the multiplex signal is transmitted to a solution. The horse is installed 1_ Since the configuration information is usually unchanged, it contains the group. The header of the state information is inserted into the bit 7L stream once. Since the configuration information of the transmission is the initial insertion of the audio signal - the person thus copies the audio signal from an arbitrary time point, since there is no 4 200939207, : The aHfl贞彳 audio signal decoding device has a problem in decoding spatial information. If it is in the broadcast, on-demand video or the like, the audio signal is requested by the user. Copying, rather than being initialized by the initial part', the audio signal decoding apparatus cannot use the configuration information contained in the audio signal. Therefore, the spatial information cannot be exhausted. [Summary of the Invention] The main object of the invention is to provide an apparatus for encoding and decoding audio signals, which can be selectively included in a frame of a spatial information signal, thereby enabling decoding of the audio signal. - The purpose is to provide an audio coding and decoding method and its impact, which can be decoded by causing a plurality of headers to be included in the Langzi weaving number, so that even if the audio signal is copied from the random point, the audio signal can be decoded. Therefore, the audio signal decoding method disclosed in the present invention includes the following steps: receiving an audio signal including a down-mixed signal and a spatial information signal, if the header is included in the spatial information signal. , extracting the configuration = message from the header; extracting the space information signal including the space__; and converting the downmix signal by configuring the information and the spatial information A multi-channel signal. [Embodiment] The features and implementations of the present invention are described in detail below with reference to the preferred embodiment. 11 0 In order to understand the present invention, prior to the audio signal decoding apparatus and method, first, The audio signal encoding apparatus and method are explained. However, the decoding apparatus 5 200939207 and the method of the present invention are not limited to the following decoding apparatus and method thereof. Moreover, the present invention is applicable to the generation of multi-channel audio coding by using spatial communication. Brown scheme, as well as 1/2-layer melon) and Advanced Audio Coding (AAC). The "Fig. 1" is a schematic diagram of the audio signal structure of the present invention - the audio signal is transmitted from the s-frequency signal encoding device to the audio signal decoding device. Please refer to "Figure 1" - the audio signal contains - audio descriptor 1G1, a reduced mix 103 and a 'space tribute signal 105. © If you use a coding scheme to copy audio signals from radio or similar, audio frequency (4) 1G1 and downmix signal 1〇3. The present invention may include spatial information signal 1〇5 as auxiliary material. In order to enable the audio signal decoding device to understand the basic information of the audio codec without analyzing the audio signal, the audio signal may optionally include the audio descriptor 1〇1. The audio descriptor 101 is verified by a small amount of basic information necessary for audio decoding, such as the transmission rate of the transmission audio signal, the number of channels, the sampling solution of the compressed data, the identifier of the currently used codec, and the like. . The audio signal decoding apparatus can decode the type of the codec used by the audio signal by the tone descriptor 1G1. In particular, with the audio descriptor 101, the audio "signal decoding device can know whether the received audio signal is using the spatial resource: 105 and the downmix signal 103 to recover a multi-channel signal. At this time, the multi-channel first, deaf A virtual three-dimensional surround and an actual multi-channel. 3 The two-dimensional surround technology 'through one or more channels makes the audio signal with the combined space and the inter-work information signal 105 and the downmix signal 103 audible. 6 200939207 ; I statement 1G1 (10) in the audio subtraction included in the downmix signal • 103 or spatial information signal 105, for example, the audio descriptor 1〇1 is located in a separate area of the display audio signal. The ^ head/ is provided to the downmix signal 103, and the audio signal decoding device can decode the downmix signal 1〇3 by using the audio descriptor 101. The downmix signal 103 is a signal generated by multi-channel downmixing. 1〇3 can be generated from one of the downmixing units (not shown in the figure) included in the audio signal decoding device (not shown) or manually generated. The downmix signal 103 can be classified to include empty In the group of information signals 1〇5, or not included in one of the headers. If the 1G3 is included in the header, the counterfeit is included in the box. If the flight number 1G3 is not included Header, as described above, the downmix signal 103 can be decoded by an audio signal decoding device using the audio descriptor 1-1. The downmix signal 103 either has a form including a header for each frame, or does not include The form of the header. Moreover, the downmix signal 1〇3 is included in an audio signal in the same manner until the end of the content. The spatial information signal 105 can also be classified into one of the groups including the header and the spatial information, and only includes The spatial information does not include a group of headers. The header of the spatial information signal 1〇5 is different from the header of the downmix signal 103, since it is not necessarily inserted into the respective frames in the same manner. In particular, the spatial information signal 1〇5 The box containing the header - and the header without the header can be used at the same time. Most of the information contained in the header of the spatial information signal 105 is configuration information, which decodes spatial information by interpreting spatial information. 7200939207: a schematic configuration of another embodiment of an audio signal "FIG. 2" embodiment of the present invention, 'in which the signal from the audio encoding apparatus of an audio signal is transmitted to an audio signal decoding apparatus. Please refer to "Figure 2". An audio signal contains a downmix signal 1〇3 and a spatial information signal 105. Also, the audio signal exists in the form of a basic bit stream (ES) arranged in a frame. Each of the downmix signal 103 and the spatial information signal 1〇5 is occasionally transmitted to an audio signal decoding apparatus in the form of a separate basic bit stream. And as shown in Fig. 2, the collapsed mixed signal 1〇3 and the spatial information signal 1〇5 can be combined into a basic bit stream form for transmission to the audio signal decoding apparatus. If the downmix signal 103 and the spatial information signal 1〇5 are combined into a basic bit stream form and transmitted to the audio signal decoding device, the spatial information signal 1〇5 can be included in the auxiliary data or additional data of the downmix signal 103 ( Extend the location of the data). And the audio signal may include signal identification information to indicate whether the spatial information signal 105 is combined with the downmix signal 103. The frame of the spatial information signal 1〇5 can be classified into a group including the header 201 and the spatial information 203, and only one of the spatial information 203. In particular, the spatial information signal 105 can simultaneously use a frame containing the header 2〇1 and a frame containing no header 201. In the present invention, the header 201 is inserted into the spatial information signal 1〇5 at least once. In particular, the 'audio signal encoding device can insert the header 201 in each frame of the spatial information signal 105 - periodically inserting the header 201 into each fixed interval of the frame of the spatial information signal 105, or inserting the label non-periodically. The header 201 is in any arbitrary interval of the frame of the spatial information signal 8 200939207:105. • The audio signal may contain information indicating whether the header 201 is included in the box. If the header 201 is included in the spatial information signal 105, the audio signal decoding device extracts the configuration information 205 from the header 201, and then decodes the spatial information 203 transmitted after the header 2〇1 according to the configuration information 205. . Since the header 2〇1 is the information used for decoding by interpreting the spatial information 203, the header 201 is transmitted at an earlier stage of the audio signal transmission. ❹ If the header 201 is not included in the spatial information signal 1〇5, the audio signal decoding apparatus decodes the spatial information 2〇3 by using the header 201 transmitted at an earlier stage. If the header 201 is lost while the audio signal is transmitted from the audio encoding device to the audio decoding device, or if the audio signal transmitted in the form of a bit stream is decoded from the middle portion for broadcasting or the like, the audio signal decoding device In the case where the previously transmitted header 20b cannot be used, the audio signal decoding apparatus extracts the configuration information 205' from the header 2〇1, wherein the header 2〇1 is different from the first (1)t header of the originally inserted audio signal. 201 'and then, can use the extracted configuration information to read and decode this frequency." At this time, the configuration information 2〇5 extracted from the header 2〇1 in the inserted audio signal may be identical to or different from the previous configuration information, wherein the previous configuration information 205 is extracted from the header transmitted at an earlier stage. 2〇1. If the header 2〇1 is changed, the configuration information 2〇5 can be extracted from a new header, the extracted group heart 205 is decoded, and then transmitted after the header 201, ^Works Beixun 2 〇3 is decoded. If the header 2〇ι is constant, it is necessary to determine whether the new cup 2〇ι疋 is equivalent to the old header 2(10) previously transmitted. If the above two headers 9 200939207 : 2 〇 1 are different from each other, then it is possible to detect - an error occurs in the audio signal of the audio spline transmission path *. The configuration information 2〇5 extracted from the header 201 of the space resource number 1〇5 is information for interpreting the spatial information 203. The spatial information signal 105 can include the following information (hereinafter referred to as "time align information") for generating a multi-channel using the downmix signal 103 and the spatial information signal 105 through the audio signal decoding device. In the process, the time delay deviation between the two signals is distinguished. The audio signal transmitted from the audio signal encoding device to the audio signal decoding device is parsed through a demultiplexing unit (not shown) and then separated into the downmix signal 103 and the spatial information signal 1〇5. The downmix signal 103 separated by the demultiplexing unit is decoded, and the decoded downmix signal 103 generates a multi-channel using the spatial information signal 1〇5. When the multi-channel is generated by combining the downmix signal 103 and the spatial information signal 105, the audio signal decoding device can adjust the synchronization of the two signals by using the time alignment information (not shown) included in the configuration information 205. , combining the starting position of the two signals and the like, wherein the configuration information 2〇5 is extracted from the header 201 of the spatial information signal 105. The time slot location information 207 is included in the spatial information signal 203 of the spatial information signal 〇5. One of the parameters will be applied to this time slot. As a spatial parameter (spatial cue), there is a channel's channel level deviation indicating the energy deviation between audio signals, indicating the intimacy and similarity of audio signals. Correlations), indicating channel prediction coefficients (channel predicti〇nc〇(2) she) that use other signals to predict the coefficient of the audio signal 200939207. In the following, each space tail or Lang tail bundle is called the 〃 parameter,
假如N個參數存在於空間資訊訊號105所包含之框中,則N 個參數被77別應用於各框之特定時間槽位置。如果用以指示參數 即將應用於框中包含之其中—時間槽之#訊被命名為時間槽之位 置資訊207,則音頻訊號解碼裝置利用此將應用有參數之時間槽之 位置貝4 207,以解碼空間資訊2〇3。這時,此參數係包含於空間 ❹資訊203中。 「第3圖」為本發明一實施例之音頻訊號解碼裝置之方塊圖。 請參考「第3 ®」’依照本發明之實施例之音観號解碼裝置 係包含一接收單元301及一提取單元303。 音頻訊號解碼裝置之接收單元3〇1藉由一輸入終端四丨,並透 過音頻訊號解碼裝置來接收以基本位元流形式傳輸之音頻訊號。 由音頻訊號解碼裝置接收之音頻訊號包含一音頻描述符l〇i © 與一縮混訊號103 ’並可以更包含空間資訊訊號105作為輔助資料 或附加資料(延伸資料)。 音頻訊號解碼裝置之提取單元303提取組態資訊205自接收 音頻訊號包含之標頭201 ’並隨後藉由一輸出終端〇UT1輸出提取 之組態資訊205。 音頻訊號可包含標頭識別資訊,用以識別標頭2〇1是否包含 \ 於一框中。 '· 音頻訊號解碼裝置藉由包含於音頻訊號中之標頭識別資訊, 200939207 以識別標頭201是否包含於此框中。如果標頭2〇ι包含其中則 ‘音頻訊號解碼裝置自標頭加提取組態資訊暮在本發明中,至 少一標頭201包含於空間資訊訊號1〇5中。 「第4圖」為本發明之另—實施例之音頻訊號解碼裝置之方 塊圖。 明參考「第4圖」,本發明之另一實施例之音頻訊號解碼裝置 下包含接收單元301,解多工單元401,核心解碼單元403,多通 © 道產生單元405,空間資訊解碼單元4〇7及提取單元3〇3。 音頻訊號解碼裝置之接收單元301藉由一輸出終端取2,以接 收以位元流形式傳輸之音頻訊號自一音頻訊號編碼裝置。並且, 接收單元301發送此接收音頻訊號至解多工單元4〇1。 解多工單元401分離接收單元301發送之音頻訊號為一編碼 縮混訊號103及一編碼空間資訊訊號1〇5。解多工單元401傳輸自 一位元流分離之編碼縮混訊號1〇3至核心解碼單元403,並傳輸自 ® 位元流分離之編碼空間資訊訊號105至提取單元303。 編碼縮混訊號103由核心解碼單元403解碼,並然後傳輸至 多通道產生單元405。編碼空間資訊訊號1〇5包含標頭201及空間 資訊203。 如果標頭201包含於編碼空間資訊訊號1〇5中,提取單元303 則提取組態資訊205自標頭201。提取單元303藉由音頻訊號包含 ' 之標頭識別資訊’能夠區別標頭201之存在。特別地,標頭識別 ' 資訊可以表示標頭201是否包含於空間資訊訊號1〇5所包含之框 12 200939207 -中。標頭識別資訊可以指示框的順序或者音頻訊號之位元序列, •如果制201包含於框中,則自標頭2〇1提取之組態資訊2〇5包 含於此框順序或位元序列中。 藉由標頭識別資訊,如果判斷標頭2〇1包含於框中,則提取 單元303自包含於框中之標頭201提取組態資訊2〇5。然後,解碼 提取之組態資訊205。 空間資訊解碼單元407依照解碼之組態資訊2〇5,以解碼包含 ❹ 於框中之空間資訊203。 並且,多通道產生單元405利用解碼縮混訊號1〇3及解碼空 間資訊203 ’以產生-多通道訊號,並且然後藉由一輸出終端〇UT2 輸出產生之多通道訊號。 「第5圖」為本發明一實施例之音頻訊號解碼方法之流程圖。 請參考「第5圖」,-音頻訊號解碼裝置接收空間資訊訊號 105,其中空間資訊訊號105係由一音頻訊號編碼裝置以位元流形 〇 式傳輸(步驟501)。 如上文所述,空間資訊訊號1〇5能夠分類為以獨立於縮混訊 號103之位元流傳輸之一組,或者分類為結合縮混訊號1〇3 一起 傳輸之一組。 曰頻訊號之解多工單元401係分離所接收之音頻訊號為編碼 縮此訊號103及編碼空間資訊訊號1〇5。編碼空間資訊訊號1〇5 '包含標頭201及空間資訊203。如果標頭201包含於空間資訊訊號 105之框中,音頻訊號解碼裝置則識別標頭2〇1(步驟5〇3)。 13 200939207 音頻訊號解碼裝置提取組態資訊205自標頭201(步驟505)。 並且’音頻訊號解碼裝置利用提取之組態資訊205,以解碼空 間資訊203(步驟507)。 「第6圖」為本發明之另一實施例之音頻訊號解碼方法之流 程圖。 請參考「第6圖」,一音頻訊號解碼裝置接收空間資訊訊號 105,其中空間資訊訊號1〇5係由一音頻訊號編碼裝置以位元流形 © 式傳輸(步驟501)。 如上文所述,空間資訊訊號105能夠被分類為以獨立於縮混 訊號103之位元流傳輸之一組,或者分類為包含於縮混訊號1〇3 之輔助資料或延伸資料中一起傳輸之一組。 音頻訊號之解多工單元401分離接收之音頻訊號為編碼縮混 訊號103及編碼空間資訊訊號1〇5。編碼空間資訊訊號1〇5包含標 頭201及空間資訊203。音頻訊號解碼裝置判斷標頭2〇1是否包含 ❹ 於框中(步驟601)。 如果標頭201包含於框中,音頻訊號解碼裝置則識別標頭 2〇1(步驟 503)。 音頻訊號解碼裝置然後提取組態資訊205自標頭201(步驟 505) ° 音頻訊號解碼裝置判斷自標頭2〇1提取之組態資訊205是否 為從包含於空間資訊訊號1〇5中之第一標頭2〇1提取的組態資訊 • 205(步驟 603)。 200939207 : 如果組態資訊205提取自首先由音頻訊號提取的標頭201,音 • 頻訊號解碼裝置則解碼組態資訊205(步驟611),並依照此解碼組 態資訊205,以解碼於組態資訊2〇5之後傳輸之空間資訊203。 如果自音頻訊號提取之標頭201不是首先從空間資訊訊號 105提取之標頭201,音頻訊號解碼裝置則判斷自標頭201提取之 組態資訊205是否等同於自第一標頭2〇1提取之組態資訊2〇5(步 驟 605)。 ❹ 如果此提取組態資訊205等同於提取自第一標頭201之組態 資205’音頻訊號解碼裝置則利用此提取自標頭2〇1之解碼組態 資訊205,以解碼空間資訊2〇3。 如果此提取組態資訊205不同於從第一標頭2〇1提取之組態 >訊205, θ頻矾號解碼裝置則判斷是否一錯誤出現於由音頻訊號 編碼裝置至音親聽碼裝置之傳輸路徑上之音頻訊號(步驟 607) ° ❿ 如果此、、且態> 祝205為變化的,即使組態資訊2〇5不同於自 第-標頭2〇1提取之組態資訊2〇5,上述錯誤也不會出現。因此, 曰頻訊號解碼裝置更新標頭201為新的標頭2叫步驟609)。音頻 訊號解碼裝置然後解碼提取自此更新標頭201之組態資訊(步驟 611)。 依照此解碼組態資訊205,音頻訊號解碼裝置解碼於此組態資 訊205之後傳輪之空間資訊2Q3。 ,如果組態資訊挪為不變的,且不同於提取自第一標頭2〇1 15 200939207 : 之組態資訊205,則表示錯誤會出現於音頻訊號傳輸路徑上。因 . 此’音頻訊號解碼裝置除去包含於框中之空間資訊203,其中此空 間資訊203中包含有錯誤組態資訊205,或者修正空間資訊203 之錯誤(步驟613)。 「第7圖」為本發明之又一實施例之音頻訊號解碼方法之流 程圖。 請參考「第7圖」,一音頻訊號解碼装置接收空間資訊訊號 €> 1〇5,其中空間資訊訊號105係由一音頻訊號編碼裝置以位元流形 式傳輸(步驟501)。 音頻訊號之解多工單元401分離所接收音頻訊號為編碼縮混 訊號103及解碼空間資訊訊號105。這時,即將參數表示之時間槽 之位置資訊207係包含於此空間資訊訊號1〇5中。 音頻訊號解碼裝置從空間資訊203中提取時間槽之位置資訊 207(步驟 701)。 ® 音頻訊號解碼裝置利用提取之時間槽之位置資訊,透過調整 即將應用有一參數之時間槽位置’以應用此參數至對靡時間槽(步 驟 703)。 「第8圖」為本發明一實施例之獲得表示時間槽數目之位置 =貝訊之方法流程圖。表不時間槽數目之位置資訊為分配用以表示 此時間槽之位置資訊207之位元數目。 表示時間槽數目之位置資訊能夠透過以下步驟而發現,其中 ’一第-參數係應用於此時間槽:自此時間槽數目減去此參數數 16 200939207 •’目’增加1域減法絲,雜增讀_ 2為紅聽,並應 用-⑽1函數至此對數值。_地,麵翻有第—參數之時間 槽數目之位置資訊能夠透過公式ceil(10雜·i+1))而獲得,其中Y 與ί分別表示時間槽數目及參數數目。 假设NU然數,則表示應用有_产參數之時間槽 數目之位置資訊被表示為制有Nth參數之時間槽之位置資訊 07這時應用有N參數之時間槽之位置資訊2〇7能夠透過以 ©下步驟發現’即對存在應用有Nth參數之時間槽及應用有_产參 數之時間槽之間之時間槽,增加其數目至由㈣ώ參數應用之時 間槽之位置資§〖’以及增加1至此增加值(步驟8G1)。特別地, (N+1)參數應用之時間槽之位置資訊可透過公式』⑼+明+1)+1 獲得’其+ r(N+l)係指示存在於應用有阶#參數之時間槽及應 用有Ν&參數之時間槽之間之時間槽數目。 如果Νώ參數應用之時間槽之位置資訊207被發現,則能夠獲 得表不數目之時間槽位置資訊,以表示應用有⑼+丨产參數之時間 撕立置。特別地’透過自時間槽數目中減去應用於一框之參數數 目及應用有參數之時間槽位置資訊,以及增加料)至此減法 值’能夠發_示數目之日夺間槽位置資訊,以表示@+1产參數應 用之時間槽位置。特別地,透過公式ceil(1〇g2(k_i+N+H(N))),能 夠發現應用有(^1产參數之時間槽數目之位置資訊,其中"f、夕 及j(N)刀別表不時間槽數目、參數數目及參數表不之時 間槽之位置資訊205。 17 200939207 ' 如果以上述方式獲得表科㈣數目之位置資訊,則表示應 •用有_产參數之時間槽數目之位置資訊具有與„ n„成反比之 刀配位兀數目。即’依據夕N//,表示應用有阶#參數之時間槽 數目之位置資訊為一變化值。 第9圖」為本發明之再_實施例之音頻訊號解碼方法之流 程圖。 日頻减解碼裂置接收一音頻訊號自-音頻訊號編碼裝置 (v驟901) a頻錢包含音頻描述符⑻,縮混訊號⑽及空間 資訊訊號105。 音頻訊號解规取音頻錢包含之音頻描述符 101(步驟 9〇3)。指示音頻編碼解(⑺㈣之—識別符係包含於此音頻描述 101 中。 音頻訊號解碼裝置藉由錢描述符跡咖翻音頻訊號包 ^含有縮混訊號⑽及空間資訊訊號105。特別地,音頻訊號解碼裝 置利用空間資訊訊號105,能夠區別傳輸之音頻訊號為用以產生多 通道之訊號(步驟905)。 並且,音頻訊號解碼裝置藉由空間資訊訊號1〇5哺換縮混 訊號103為一多通道訊號。如上文所描述,標頭2〇1能夠以各個 預定間隔包含於空間資訊訊號105中。 工業應用 ’ 丨如上文所述’本發狀音舰號編碼及解碼方法及其裝置 能夠使得一標頭選擇性地包含於一空間資訊訊號中。 18 200939207 並且如果複數個標頭包含於空間資訊訊號中,本發明之立 頻訊號編碼及解碼方法及魏置錢解碼空間資訊,即使是^ 音頻訊號解碼裝置自任意點複製此音頻訊號。 〜雖然本發明以前述之較佳實施例揭露如上,然:其並非用以限 疋本發明,任何熟習相像技藝者,在不脫離本發明之精神和範圍 Ο 〇田可作些許之更動與潤飾,因此本發明之專利保護範圍須視 本祝明書所附之申請專利細所界定者為準。 【圖式簡單說明】 第1圖為本發明—實施例之音頻訊號之構造示意圖; 第2圖為本發明之另—實施例之音頻訊號之構造示意圖; 第3圖為本發明—實施例之音頻訊號解碼裝置之方塊圖; 第4圖為本發明之另一實施例之音頻訊號解碼裝置之方塊 Β ; 第5圖為本發明—實關之音舰贿碼方法之流麵; 第6圖為本發明之另一實施例之音頻訊號解碼方法之流程 第7圖為本發明之又一實施例之音頻訊號解碼方法之流程If N parameters exist in the frame contained in the spatial information signal 105, the N parameters are applied to the specific time slot position of each frame by 77. If it is used to indicate that the parameter is to be applied to the frame contained therein, the time slot is named as the time slot position information 207, the audio signal decoding device uses the position of the time slot in which the parameter is applied. Decode spatial information 2〇3. At this time, this parameter is included in the space ❹ information 203. Fig. 3 is a block diagram of an audio signal decoding apparatus according to an embodiment of the present invention. Please refer to "Third 3"". The audio signal decoding apparatus according to the embodiment of the present invention includes a receiving unit 301 and an extracting unit 303. The receiving unit 301 of the audio signal decoding device receives the audio signal transmitted in the form of a basic bit stream through an input terminal and through the audio signal decoding device. The audio signal received by the audio signal decoding device includes an audio descriptor l〇i © and a downmix signal 103 ′ and may further include a spatial information signal 105 as auxiliary data or additional data (extended data). The extracting unit 303 of the audio signal decoding device extracts the configuration information 205 from the header 201' included in the received audio signal and then outputs the extracted configuration information 205 through an output terminal 〇UT1. The audio signal may include header identification information to identify whether the header 2〇1 contains a \ in a box. '· The audio signal decoding device uses the header identification information included in the audio signal, 200939207 to identify whether the header 201 is included in the frame. If the header 2〇ι contains therein, then the 'audio signal decoding device extracts the configuration information from the header. In the present invention, at least one header 201 is included in the spatial information signal 1〇5. Fig. 4 is a block diagram of an audio signal decoding apparatus according to another embodiment of the present invention. Referring to FIG. 4, an audio signal decoding apparatus according to another embodiment of the present invention includes a receiving unit 301, a demultiplexing unit 401, a core decoding unit 403, a multi-channel source generating unit 405, and a spatial information decoding unit 4. 〇7 and extraction unit 3〇3. The receiving unit 301 of the audio signal decoding device takes 2 by an output terminal to receive the audio signal transmitted in the form of a bit stream from an audio signal encoding device. And, the receiving unit 301 sends the received audio signal to the demultiplexing unit 4〇1. The audio signal transmitted by the demultiplexing unit 401 and the receiving unit 301 is a coded downmix signal 103 and an encoded spatial information signal 1〇5. The demultiplexing unit 401 transmits the coded downmix signal 1〇3 separated from the bit stream to the core decoding unit 403, and transmits the coded spatial information signal 105 separated from the bit stream to the extracting unit 303. The code downmix signal 103 is decoded by the core decoding unit 403 and then transmitted to the multi-channel generating unit 405. The code space information signal 1〇5 includes a header 201 and spatial information 203. If the header 201 is included in the code space information signal 1〇5, the extraction unit 303 extracts the configuration information 205 from the header 201. The extracting unit 303 can distinguish the existence of the header 201 by the audio signal containing the 'header identification information'. In particular, the header identification 'information can indicate whether the header 201 is included in the frame 12 200939207 - contained in the spatial information signal 1〇5. The header identification information may indicate the order of the frames or the sequence of bits of the audio signal. • If the system 201 is included in the frame, the configuration information 2〇5 extracted from the header 2〇1 is included in the frame order or the bit sequence. in. By the header identifying information, if it is judged that the header 2〇1 is included in the frame, the extracting unit 303 extracts the configuration information 2〇5 from the header 201 included in the frame. The extracted configuration information 205 is then decoded. The spatial information decoding unit 407 decodes the spatial information 203 contained in the frame in accordance with the decoded configuration information 2〇5. Moreover, the multi-channel generating unit 405 uses the decoded downmix signal 1〇3 and the decoded spatial information 203' to generate a multi-channel signal, and then outputs the generated multi-channel signal through an output terminal 〇UT2. FIG. 5 is a flowchart of an audio signal decoding method according to an embodiment of the present invention. Please refer to Fig. 5, the audio signal decoding device receives the spatial information signal 105, wherein the spatial information signal 105 is transmitted by the audio signal encoding device in a bit stream (step 501). As described above, the spatial information signal 1〇5 can be classified as a group of bit stream transmissions independent of the downmix signal 103, or classified as a group of transmissions combined with the downmix signal 1〇3. The multiplexed signal 401 of the 曰 frequency signal is separated from the received audio signal by the encoded reduced signal 103 and the encoded spatial information signal 〇5. The coded space information signal 1〇5' includes a header 201 and spatial information 203. If the header 201 is included in the frame of the spatial information signal 105, the audio signal decoding device recognizes the header 2〇1 (step 5〇3). 13 200939207 The audio signal decoding device extracts the configuration information 205 from the header 201 (step 505). And the 'audio signal decoding apparatus uses the extracted configuration information 205 to decode the spatial information 203 (step 507). Fig. 6 is a flow chart showing an audio signal decoding method according to another embodiment of the present invention. Referring to FIG. 6, an audio signal decoding device receives a spatial information signal 105, wherein the spatial information signal 〇5 is transmitted by a voice signal encoding device in a bit stream format (step 501). As described above, the spatial information signal 105 can be classified as one of a group stream transmitted independently of the downmix signal 103, or classified as an auxiliary material or extension data included in the downmix signal 1〇3. A group. The audio signal separated by the audio signal multiplexing unit 401 is the encoded downmix signal 103 and the encoded spatial information signal 1〇5. The coded space information signal 1〇5 includes a header 201 and spatial information 203. The audio signal decoding means judges whether or not the header 2 〇 1 is included in the frame (step 601). If the header 201 is included in the frame, the audio signal decoding means recognizes the header 2〇1 (step 503). The audio signal decoding device then extracts the configuration information 205 from the header 201 (step 505). The audio signal decoding device determines whether the configuration information 205 extracted from the header 2〇1 is from the space information signal 1〇5. A header 2〇1 extracted configuration information • 205 (step 603). 200939207: If the configuration information 205 is extracted from the header 201 first extracted by the audio signal, the audio/frequency signal decoding device decodes the configuration information 205 (step 611), and decodes the configuration information 205 according to the decoding configuration information. Spatial information 203 transmitted after the information 2〇5. If the header 201 extracted from the audio signal is not the header 201 first extracted from the spatial information signal 105, the audio signal decoding apparatus determines whether the configuration information 205 extracted from the header 201 is equivalent to being extracted from the first header 2〇1. The configuration information 2〇5 (step 605). ❹ If the extracted configuration information 205 is equivalent to the configuration resource 205' audio signal decoding device extracted from the first header 201, the decoding configuration information 205 extracted from the header 2〇1 is used to decode the spatial information. 3. If the extracted configuration information 205 is different from the configuration > 205 extracted from the first header 2〇1, the θ-frequency decoding device determines whether an error occurs in the audio signal encoding device to the audiophile encoding device. The audio signal on the transmission path (step 607) ° ❿ If this, and the state > 205 is changed, even if the configuration information 2〇5 is different from the configuration information extracted from the first-header 2〇1 2 〇5, the above error will not appear. Therefore, the chirp signal decoding device updates the header 201 to a new header 2 as step 609). The audio signal decoding device then decodes the configuration information extracted from this update header 201 (step 611). According to the decoding configuration information 205, the audio signal decoding device decodes the spatial information 2Q3 of the transmission after the configuration information 205. If the configuration information is unchanged and different from the configuration information 205 extracted from the first header 2〇1 15 200939207 :, an error will appear on the audio signal transmission path. The 'audio signal decoding device' removes the spatial information 203 contained in the frame, wherein the spatial information 203 contains the error configuration information 205, or corrects the error of the spatial information 203 (step 613). Fig. 7 is a flow chart showing an audio signal decoding method according to still another embodiment of the present invention. Referring to Fig. 7, an audio signal decoding apparatus receives a spatial information signal € > 1, wherein the spatial information signal 105 is transmitted by a voice signal encoding apparatus in a bit stream (step 501). The audio signal demultiplexing unit 401 separates the received audio signals into a coded downmix signal 103 and a decoded spatial information signal 105. At this time, the position information 207 of the time slot indicated by the parameter is included in the space information signal 1〇5. The audio signal decoding means extracts the position information 207 of the time slot from the space information 203 (step 701). The audio signal decoding device uses the position information of the extracted time slot to adjust the time slot position to which a parameter is to be applied to apply this parameter to the time slot (step 703). Fig. 8 is a flow chart showing a method for obtaining the position of the number of time slots = Beixun according to an embodiment of the present invention. The location information indicating the number of time slots is the number of bits allocated to indicate the location information 207 of the time slot. The location information indicating the number of time slots can be found by the following steps, where the 'one-parameter parameter is applied to this time slot: since the number of time slots minus this parameter number 16 200939207 • 'mesh' increases 1 domain subtraction method, miscellaneous Add _ 2 for red listening and apply the -(10)1 function to this logarithmic value. _ Ground, the surface has the first parameter time. The position information of the number of slots can be obtained by the formula ceil (10 · · i +1)), where Y and ί respectively represent the number of time slots and the number of parameters. Assuming that the NU number is used, the position information indicating the number of time slots to which the parameter is applied is expressed as the position information of the time slot in which the Nth parameter is generated. At this time, the position information of the time slot to which the N parameter is applied can be transmitted through © The next step is to find the time slot between the time slot in which the Nth parameter is applied and the time slot in which the application has the parameter. Increase the number to the position of the time slot applied by the (4) parameter. § ' and increase 1 The value is added so far (step 8G1). In particular, the position information of the time slot of the (N+1) parameter application can be obtained by the formula 』(9)+明+1)+1, and its + r(N+l) indicates that the time slot exists in the application with the order # parameter. And the number of time slots between time slots with Ν & parameters. If the location information 207 of the time slot to which the parameter is applied is found, it is possible to obtain a time slot location information indicating the number of times the application has the (9) + production parameter tearing. Specifically, 'by subtracting the number of parameters applied to a frame from the number of time slots and applying the time slot position information of the parameter, and adding the material to the subtraction value can send the number of the day of the slot position information to Indicates the time slot location of the @+1 production parameter application. In particular, by using the formula ceil (1〇g2(k_i+N+H(N))), it is possible to find the location information of the number of time slots in which the parameter is generated, where "f, eve, and j(N) The number of time slots, the number of parameters, and the position information of the time slot of the parameter table are not listed in the 205. 17 200939207 ' If the position information of the number of the table (4) is obtained in the above manner, it means that the number of time slots with the _ production parameter should be used. The position information has a number of knives that are inversely proportional to „n„, that is, 'based on the eve N//, indicating that the position information of the number of time slots in which the order # parameter is applied is a change value. FIG. 9 is the present invention. The flowchart of the audio signal decoding method of the embodiment _. The frequency-reduction decoding split receives an audio signal from the audio-audio signal encoding device (v 901). The frequency includes the audio descriptor (8), the downmix signal (10) and the space. Information signal 105. The audio signal is extracted to the audio descriptor 101 included in the audio money (step 9〇3). The audio coding solution ((7)(4)-identifier is included in the audio description 101. The audio signal decoding device is provided by the money Descriptor trace coffee audio signal package ^ The audio signal decoding device (10) and the spatial information signal 105. In particular, the audio signal decoding device uses the spatial information signal 105 to distinguish the transmitted audio signal into a signal for generating a plurality of channels (step 905). The spatial information signal 1〇5 feeds down the mixed signal 103 as a multi-channel signal. As described above, the header 2〇1 can be included in the spatial information signal 105 at predetermined intervals. Industrial Applications ' As described above' The sounding ship number encoding and decoding method and apparatus thereof enable a header to be selectively included in a spatial information signal. 18 200939207 and if a plurality of headers are included in the spatial information signal, the vertical frequency signal of the present invention The encoding and decoding method and the Wei Qian Qian decoding spatial information, even if the audio signal decoding device copies the audio signal from any point. - Although the present invention is disclosed above in the preferred embodiment, it is not limited to According to the invention, any skilled person can make some changes and refinements without departing from the spirit and scope of the present invention. The scope of patent protection shall be subject to the definition of the patent application attached to the present specification. [Simplified description of the drawings] Fig. 1 is a schematic view showing the construction of an audio signal according to the present invention; - FIG. 3 is a block diagram of an audio signal decoding apparatus according to an embodiment of the present invention; FIG. 4 is a block diagram of an audio signal decoding apparatus according to another embodiment of the present invention; FIG. 6 is a flow chart of a method for decoding an audio signal according to another embodiment of the present invention; FIG. 7 is a flow chart of an audio signal decoding method according to another embodiment of the present invention. FIG. 7 is an audio signal decoding according to still another embodiment of the present invention. Method flow
El · 園, 弟8圖為本發明一實施例之獲得表示數量之位置資訊之方法 流程圖;以及 第9圖為本發明之再一實施例之音頻訊號解碼方法之流程 19 200939207 . 圖。 【主要元件符號說明】 101 103 105 201 © 203 205 207 301 303 401 403 ❹ 405 407 IN1 IN2 音頻描述符 縮混訊號 空間資訊訊號 標頭 空間資訊 組態資訊 時間槽之位置資訊 接收單元 提取單元 解多工單元 核心解碼單元 多通道產生單元 空間資訊解碼單元 輸入終端 輸出終端 步驟501 接收空間資訊訊號 步驟503 識別標頭 步驟505 提取組態資訊 步驟507 解碼空間資訊 20 200939207 : 步驟601 • 步驟603 步驟605 組態資訊? 步驟607 步驟609 步驟611 ❹ 步驟701 步驟703 步驟801 步驟803 置資訊 步驟901 步驟903 © 步驟905 道 存在標頭? 組態資訊提取自第一標頭? 提取之組態資訊是否等同於自第一標頭提取之 偵測到錯誤出現? 更新標頭 解碼組態資訊 提取時間槽之位置資訊 應用參數至時間槽 獲得應用有Νώ參數之時間槽之位置資訊 獲得表示應用有(N+l)th參數之時間槽數目之位 接收音頻訊號 提取音頻描述符 藉由空間資訊訊號識別音頻訊號是否產生多通 21El. Garden, FIG. 8 is a flow chart showing a method for obtaining position information of a quantity according to an embodiment of the present invention; and FIG. 9 is a flow chart of an audio signal decoding method according to still another embodiment of the present invention. 19 200939207 . [Main component symbol description] 101 103 105 201 © 203 205 207 301 303 401 403 ❹ 405 407 IN1 IN2 Audio Descriptor Downmix Signal Space Information Signal Header Space Information Configuration Information Time Slot Position Information Receiving Unit Extraction Unit Solution Unit Core Decoding Unit Multi-Channel Generating Unit Spatial Information Decoding Unit Input Terminal Output Terminal Step 501 Receiving Spatial Information Signal Step 503 Identifying Header Step 505 Extracting Configuration Information Step 507 Decoding Spatial Information 20 200939207: Step 601 • Step 603 Step 605 Group Information? Step 607 Step 609 Step 611 ❹ Step 701 Step 703 Step 801 Step 803 Set information Step 901 Step 903 © Step 905 Road Is there a header? Is the configuration information extracted from the first header? Is the extracted configuration information equivalent to the detection of an error from the first header extraction? Update the header decoding configuration information extraction time slot location information application parameter to the time slot to obtain the location information of the time slot to which the parameter is applied, and obtain the bit-receiving audio signal extraction indicating the number of time slots in which the (N+l)th parameter is applied. The audio descriptor identifies whether the audio signal generates multi-pass by the spatial information signal.