JP5134048B2

JP5134048B2 - Decoding apparatus and method, recording medium, and program

Info

Publication number: JP5134048B2
Application number: JP2010166191A
Authority: JP
Inventors: 数史佐藤; 輝彦鈴木; 修春原; 陽一矢ヶ崎
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2010-07-23
Filing date: 2010-07-23
Publication date: 2013-01-30
Anticipated expiration: 2022-04-26
Also published as: JP2010239667A

Description

本発明は、復号装置および方法、記録媒体、並びにプログラムに関し、例えば、画像信号を従来よりも高い圧縮率で符号化し、伝送または蓄積する場合に用いて好適な復号装置および方法、記録媒体、並びにプログラムに関する。 The present invention relates to a decoding apparatus and method , a recording medium, and a program . For example, a decoding apparatus and method , a recording medium, and a decoding apparatus that are suitable for use when encoding, transmitting, or storing an image signal at a higher compression rate than before. Regarding the program .

近年、画像をディジタル信号として取り扱い、当該ディジタル信号を効率よく伝送、蓄積することを目的として、画像情報特有の冗長性を利用して、離散コサイン変換等の直交変換と動き補償により圧縮するMPEG(Moving Picture Expert Group）等の方式に準拠した装置が、放送局などの情報配信、および一般家庭における情報受信の双方において普及しつつある。 In recent years, with the aim of handling images as digital signals and efficiently transmitting and storing the digital signals, MPEG (compressed by orthogonal transform such as discrete cosine transform and motion compensation, using redundancy unique to image information. A device compliant with a system such as Moving Picture Expert Group) is becoming popular for both information distribution in broadcasting stations and information reception in general households.

特に、MPEG２(ISO/IEC 13818-2)圧縮方式は、汎用性がある画像圧縮方式として定義された規格であり、飛び越し走査画像および順次走査画像の双方、並びに標準解像度画像および高精細画像を網羅する標準であって、例えばDVD(Digital Versatile Disk)規格に代表されるように、プロフェッショナル用途およびコンシューマー用途の広範なアプリケーションに現在広く用いられている。 In particular, the MPEG2 (ISO / IEC 13818-2) compression method is a standard defined as a general-purpose image compression method that covers both interlaced and progressively scanned images, standard resolution images, and high-definition images. For example, as represented by the DVD (Digital Versatile Disk) standard, it is currently widely used in a wide range of applications for professional use and consumer use.

MPEG２圧縮方式を用いることにより、例えば、７２０×４８０画素を持つ標準解像度の飛び越し走査画像に対しては４乃至８Mbps、１９２０×１０８８画素を持つ高解像度の飛び越し走査画像に対しては１８乃至２２Mbpsの符号量（ビットレート）を割り当てることで、高い圧縮率と良好な画質の実現が可能である。 By using the MPEG2 compression method, for example, 4 to 8 Mbps for a standard resolution interlaced scanning image having 720 × 480 pixels, and 18 to 22 Mbps for a high resolution interlaced scanning image having 1920 × 1088 pixels. By assigning a code amount (bit rate), it is possible to realize a high compression rate and good image quality.

ところで、MPEG２は、主として放送用に適合する高画質符号化を対象としていたが、より高い圧縮率の符号化方式には対応していなかったので、より高い圧縮率の符号化方式として、MPEG４符号化方式の標準化が行われた。画像符号化方式に関しては、１９９８年１２月にISO/IEC 14496-2としてその規格が国際標準に承認された。 By the way, MPEG2 was mainly intended for high-quality encoding suitable for broadcasting, but since it did not correspond to a higher compression rate encoding method, MPEG4 code was used as a higher compression rate encoding method. The standardization method was standardized. Regarding the image coding system, the standard was approved as an international standard as ISO / IEC 14496-2 in December 1998.

さらに、近年、テレビ会議用の画像符号化を当初の目的として、国際電気連合の電気通信標準化部門であるITU-T(International Telecommunication Union − Telecommunication Standardization Sector)によるＨ.２６Ｌ（ITU-T Q6/16 VCEG）と称される標準の規格化が進められている。 Furthermore, in recent years, for the purpose of video coding for video conferences, H.26L (ITU-T Q6 / 16) by ITU-T (International Telecommunication Union-Telecommunication Standardization Sector), which is the telecommunications standardization department of the International Telecommunication Union A standard called VCEG) is being developed.

Ｈ．２６Ｌは、MPEG２やMPEG４などの従来の符号化方式に比較して、符号化処理、および復号処理により多くの演算量が必要となるが、より高い符号化効率が実現されることが知られている。 H. 26L requires a larger amount of calculation for encoding and decoding compared to conventional encoding methods such as MPEG2 and MPEG4, but is known to achieve higher encoding efficiency. Yes.

またさらに、現在、MPEG４の活動の一環としてITU-Tと共同で、Ｈ．２６Ｌに基づいた、Ｈ．２６Ｌではサポートされない機能を取り入れた、より高い符号化効率を実現する符号化技術の標準化が、Joint Model of Enhanced-Compression Video Codingとして進められている。 In addition, as a part of MPEG4 activities, H.C. Based on H.26L. Standardization of an encoding technique that realizes higher encoding efficiency that incorporates a function that is not supported by 26L is being promoted as Joint Model of Enhanced-Compression Video Coding.

ここで、離散コサイン変換またはカルーネン・レーベ変換等の直交変換と動き補償とを利用した従来の画像情報符号化装置について、図１を参照して説明する。図１は、従来の画像情報符号化装置の構成の一例を示している。 Here, a conventional image information encoding apparatus using orthogonal transform such as discrete cosine transform or Karhunen-Loeve transform and motion compensation will be described with reference to FIG. FIG. 1 shows an example of the configuration of a conventional image information encoding apparatus.

当該画像情報符号化装置において、アナログ信号である入力画像信号は、Ａ／Ｄ変換部１によってディジタル信号に変換された後、画面並べ替えバッファ２に供給される。画面並べ替えバッファ２は、Ａ／Ｄ変換部１からの画像情報を、当該画像情報符号化装置が出力する画像圧縮情報のGOP(Group of Pictures)構造に応じて、フレームの並べ替えを行う。 In the image information encoding apparatus, an input image signal which is an analog signal is converted into a digital signal by the A / D conversion unit 1 and then supplied to the screen rearrangement buffer 2. The screen rearrangement buffer 2 rearranges the frame of the image information from the A / D conversion unit 1 according to the GOP (Group of Pictures) structure of the compressed image information output from the image information encoding device.

まず、イントラ（画像内）符号化が行われる画像について説明する。画面並び替えバッファ２において、イントラ符号化が行われる画像については、その画像情報が加算器３を介して直交変換部４に供給される。 First, an image on which intra (intra-image) encoding is performed will be described. In the screen rearrangement buffer 2, the image information is supplied to the orthogonal transform unit 4 via the adder 3 for the image on which intra coding is performed.

直交変換部４では、画像情報に対して直交変換（離散コサイン変換、またはカルーネン・レーベ変換等）が施され、得られた変換係数が量子化部５に供給される。量子化部５では、蓄積バッファ７に蓄積された変換係数のデータ量に基づくレート制御部８からの制御に従い、直交変換部４から供給された変換係数に対して量子化処理が施こされる。 In the orthogonal transform unit 4, orthogonal transform (discrete cosine transform, Karhunen-Loeve transform, or the like) is performed on the image information, and the obtained transform coefficient is supplied to the quantization unit 5. In the quantization unit 5, quantization processing is performed on the transform coefficients supplied from the orthogonal transform unit 4 in accordance with control from the rate control unit 8 based on the data amount of transform coefficients accumulated in the accumulation buffer 7. .

可逆符号化部６では、量子化部５から供給された量子化された変換係数や量子化スケール等から符号化モードが決定され、決定された符号化モードに対して可逆符号化（可変長符号化、または算術符号化等）が施こされ、画像符号化単位のヘッダ部に挿入される情報が形成される。また、符号化された符号化モードは、蓄積バッファ７に供給されて蓄積される。蓄積バッファ７に蓄積された、符号化された符号化モードは、画像圧縮情報として後段に出力される。 The lossless encoding unit 6 determines an encoding mode from the quantized transform coefficient, quantization scale, and the like supplied from the quantization unit 5, and performs lossless encoding (variable length code) on the determined encoding mode. Or arithmetic coding) is performed, and information to be inserted into the header portion of the image coding unit is formed. The encoded encoding mode is supplied to and stored in the storage buffer 7. The encoded encoding mode stored in the storage buffer 7 is output to the subsequent stage as image compression information.

また、可逆符号化部６では、量子化された変換係数に対して可逆符号化が施され、符号化された変換係数が蓄積バッファ７に蓄積させる。蓄積バッファ７に蓄積された、符号化された変換係数も、画像圧縮情報として後段に出力される。 Further, the lossless encoding unit 6 performs lossless encoding on the quantized transform coefficient and causes the accumulation buffer 7 to store the encoded transform coefficient. The encoded transform coefficient stored in the storage buffer 7 is also output to the subsequent stage as image compression information.

逆量子化部９では、量子化部５によって量子化された変換係数が逆量子化される。逆直交変換部１０では、逆量子化された変換係数に対して逆直交変換処理が施されて復号画像情報が生成される。生成された復号画像情報は、フレームメモリ１１に蓄積される。 In the inverse quantization unit 9, the transform coefficient quantized by the quantization unit 5 is inversely quantized. In the inverse orthogonal transform unit 10, inverse orthogonal transform processing is performed on the inversely quantized transform coefficient to generate decoded image information. The generated decoded image information is stored in the frame memory 11.

次に、インター（画像間）符号化が行われる画像について説明する。画面並び替えバッファ２において、インター符号化が行われる画像については、その画像情報が加算器３および動き予測・補償部１２に供給される。 Next, an image on which inter (inter-image) encoding is performed will be described. In the screen rearrangement buffer 2, the image information is supplied to the adder 3 and the motion prediction / compensation unit 12 for the image to be inter-coded.

動き予測・補償部１２では、画面並び替えバッファ２からのインター符号化が行われる画像に対応する、参照するための画像情報がフレームメモリ１１から読み出され、動き予測・補償処理を施して参照画像情報が生成され、加算器３に供給される。また、動き予測・補償部１２で動き予測・補償処理の際に得られた動きベクトル情報は、可逆符号化部６に供給される。 In the motion prediction / compensation unit 12, image information for reference corresponding to an image to be inter-coded from the screen rearrangement buffer 2 is read from the frame memory 11, and is subjected to motion prediction / compensation processing for reference. Image information is generated and supplied to the adder 3. Also, the motion vector information obtained in the motion prediction / compensation process by the motion prediction / compensation unit 12 is supplied to the lossless encoding unit 6.

加算器３では、動き予測・補償部１２からの参照画像情報が、画面並び替えバッファ２からのインター符号化が行われる画像の画像情報との差分信号に変換される。 In the adder 3, the reference image information from the motion prediction / compensation unit 12 is converted into a difference signal from the image information of the image to be inter-coded from the screen rearrangement buffer 2.

インター符号化が行われる画像を処理する場合、直交変換部４では、差分信号に対して直交変換が施され、得られる変換係数が量子化部５に供給される。量子化部５では、レート制御部８からの制御に従い、直交変換部４から供給された変換係数に対して量子化処理を施される。 When processing an image on which inter-coding is performed, the orthogonal transform unit 4 performs orthogonal transform on the difference signal, and the obtained transform coefficient is supplied to the quantization unit 5. In the quantization unit 5, a quantization process is performed on the transform coefficient supplied from the orthogonal transform unit 4 in accordance with the control from the rate control unit 8.

可逆符号化部６では、量子化部５によって量子化された変換係数および量子化スケール、並びに動き予測・補償部１２から供給された動きベクトル情報等に基づいて符号化モードが決定され、決定された符号化モードに対して可逆符号化が施され、画像符号化単位のヘッダ部に挿入される情報が生成される。符号化された符号化モードは蓄積バッファ７に蓄積される。蓄積バッファ７に蓄積された符号化された符号化モードは、画像圧縮情報として出力される。 The lossless encoding unit 6 determines and determines an encoding mode based on the transform coefficient and quantization scale quantized by the quantization unit 5 and the motion vector information supplied from the motion prediction / compensation unit 12. The encoding mode is subjected to lossless encoding, and information to be inserted into the header portion of the image encoding unit is generated. The encoded encoding mode is stored in the storage buffer 7. The encoded encoding mode stored in the storage buffer 7 is output as image compression information.

また、可逆符号化部６では、動き予測・補償部１２からの動きベクトル情報に対して可逆符号化処理が施され、画像符号化単位のヘッダ部に挿入される情報が生成される。 Further, the lossless encoding unit 6 performs lossless encoding processing on the motion vector information from the motion prediction / compensation unit 12 to generate information to be inserted into the header portion of the image encoding unit.

なお、インター符号化が行われる画像を処理する場合における逆量子化部９以降の処理については、イントラ符号化を施される画像を処理する場合と同様であるので、その説明を省略する。 In addition, since the process after the inverse quantization part 9 in the case of processing the image in which inter coding is performed is the same as that in the case of processing the image subjected to intra coding, description thereof is omitted.

次に、図１に示した従来の画像情報符号化装置が出力する画像圧縮情報を入力とし、画像信号を復元する従来の画像情報復号装置について、図２を参照して説明する。図２は、従来の画像情報復号装置の構成の一例を示している。 Next, a conventional image information decoding apparatus that restores an image signal using image compression information output from the conventional image information encoding apparatus shown in FIG. 1 as an input will be described with reference to FIG. FIG. 2 shows an example of the configuration of a conventional image information decoding apparatus.

当該画像情報復号装置において、入力された画像圧縮情報は、蓄積バッファ２１に一時的に格納された後、可逆復号化部２２に転送される。可逆復号化部２２は、予め定められている画像圧縮情報のフォーマットに基づき、画像圧縮情報に対して可逆復号（可変長復号、または算術復号等）を施し、ヘッダ部に格納された符号化モード情報を取得して逆量子化部２３に供給する。また同様に、可逆復号化部２２は、量子化されている変換係数を取得して逆量子化部２３に供給する。さらに、可逆復号化部２２は、復号するフレームがインター符号化されたものである場合には、画像圧縮情報のヘッダ部に格納された動きベクトル情報についても復号し、その情報を動き予測・補償部２８に供給する。 In the image information decoding apparatus, the input image compression information is temporarily stored in the accumulation buffer 21 and then transferred to the lossless decoding unit 22. The lossless decoding unit 22 performs lossless decoding (variable length decoding, arithmetic decoding, or the like) on the image compression information based on a predetermined format of the image compression information, and stores the encoding mode stored in the header portion. Information is acquired and supplied to the inverse quantization unit 23. Similarly, the lossless decoding unit 22 acquires the quantized transform coefficient and supplies it to the inverse quantization unit 23. Further, when the frame to be decoded is inter-coded, the lossless decoding unit 22 also decodes the motion vector information stored in the header portion of the image compression information, and the information is motion-predicted / compensated. Supplied to the unit 28.

逆量子化部２３は、可逆復号化部２２から供給された量子化されている変換係数を逆量子化し、得られる変換係数を逆直交変換部２４に供給する。逆直交変換部２４は、予め定められている画像圧縮情報のフォーマットに基づき、変換係数に対して逆直交変換（逆離散コサイン変換、または逆カルーネン・レーベ変換等）を施す。 The inverse quantization unit 23 inversely quantizes the quantized transform coefficient supplied from the lossless decoding unit 22 and supplies the obtained transform coefficient to the inverse orthogonal transform unit 24. The inverse orthogonal transform unit 24 performs inverse orthogonal transform (inverse discrete cosine transform or inverse Karhunen-Labe transform) on the transform coefficient based on a predetermined format of image compression information.

ここで、対象となるフレームがイントラ符号化されたものである場合には、逆直交変換が施された画像情報は、加算器２５を介して画面並べ替えバッファ２６に格納され、Ｄ／Ａ変換部２７によってアナログ信号に変換されて後段に出力される。逆直交変換が施された画像情報は、フレームメモリ２９にも格納される。 Here, when the target frame is intra-coded, the image information subjected to the inverse orthogonal transform is stored in the screen rearrangement buffer 26 via the adder 25 and is subjected to D / A conversion. The signal is converted into an analog signal by the unit 27 and output to the subsequent stage. The image information subjected to the inverse orthogonal transform is also stored in the frame memory 29.

また、対象となるフレームがインター符号化されたものである場合には、動き予測・補償部２８では、可逆復号化部２２からの動きベクトル情報とフレームメモリ２９に格納された画像情報とに基づいて参照画像が生成され、加算器２５に供給される。加算器２５では、動き予測・補償部２８からの参照画像と逆直交変換部２５の出力とが合成されて画像情報が生成される。なお、その他の処理については、イントラ符号化されたフレームと同様であるため、説明を省略する。 When the target frame is inter-coded, the motion prediction / compensation unit 28 is based on the motion vector information from the lossless decoding unit 22 and the image information stored in the frame memory 29. Thus, a reference image is generated and supplied to the adder 25. In the adder 25, the reference image from the motion prediction / compensation unit 28 and the output of the inverse orthogonal transform unit 25 are combined to generate image information. The other processing is the same as that of the intra-encoded frame, and thus description thereof is omitted.

ところで、Ｈ．２６Ｌにおいては、可逆符号化方式として、可変長符号化の一種であるUVLC(Universal Variable Length Code)と、算術符号化の一種であるCABAC(Context-based adaptive binary arithmetic coding)の２種類が定義されており、ユーザは可逆符号化方式にUVLCまたはCABACの一方を選択して適用することが可能である。可逆符号化方式がUVLCであるかCABACであるかを示す情報は、画像圧縮情報中において、RTPレイヤのRTP Parameter Set Packetに含まれる、Entropy Codingと称されるフィールドにおいて指定される。 H. 26L defines two types of lossless encoding methods: UVLC (Universal Variable Length Code), which is a kind of variable-length coding, and CABAC (Context-based adaptive binary arithmetic coding), which is a kind of arithmetic coding. The user can select and apply either UVLC or CABAC as the lossless encoding method. Information indicating whether the lossless encoding method is UVLC or CABAC is specified in a field called Entropy Coding included in the RTP Parameter Set Packet of the RTP layer in the image compression information.

ここで、CABACが属する算術符号化について説明する。算術符号化においては、任意のメッセージ（複数のアルファベット記号から構成される）は半開区間０．０≦ｘ＜１．０上の１点として表され、この点の座標から符号が生成される。 Here, the arithmetic coding to which CABAC belongs will be described. In arithmetic coding, an arbitrary message (consisting of a plurality of alphabet symbols) is represented as one point on a half-open interval 0.0 ≦ x <1.0, and a code is generated from the coordinates of this point.

まず、アルファベットを構成する記号の出現確率を元に、半開区間０．０≦ｘ＜１．０を、各記号に対応するサブ区間に分割する。 First, based on the appearance probability of symbols constituting the alphabet, the half-open interval 0.0 ≦ x <1.0 is divided into sub-intervals corresponding to the respective symbols.

図３は、記号ｓ₁乃至ｓ₇の発生確率と、サブ区間の分割の一例を示している。算術符号化においては、図３に示すように、各記号の累積出現確率を元にサブ区間の上限と下限が決定される。記号ｓ_i（ｉ＝１，２，・・・，７）に対するサブ区間の下限は、記号ｓ_i-1のサブ区間の上限であり、記号ｓ_iに対応するサブ区間の上限は、そのサブ区間の下限に記号ｓ_iの出現確率を加えた値である。 FIG. 3 shows an example of the occurrence probabilities of symbols s _{1 to} s ₇ and the division of sub-intervals. In arithmetic coding, as shown in FIG. 3, the upper and lower limits of the sub-interval are determined based on the cumulative appearance probability of each symbol. The lower limit of the subsection for the symbol s _i (i = 1, 2,..., 7) is the upper limit of the subsection of the symbol s _i−1 , and the upper limit of the subsection corresponding to the symbol s _i is the sub This is a value obtained by adding the appearance probability of the symbol s _i to the lower limit of the section.

いま、メッセージとして、（ｓ₂ｓ₁ｓ₃ｓ₆ｓ₇）が入力されたものとする。ただし、記号ｓ₇は、メッセージの終了を表す終端記号であり、終端記号が現れた時点でメッセージが終了するものとする。算術符号化法は、メッセージ（ｓ₂ｓ₁ｓ₃ｓ₆ｓ₇）に対し、図４に示すように、メッセージを構成する各記号に対応するサブ区間の計算を実行する。すなわち、図３に割り当てられた区間を、次の記号の累積出現確率に応じて分割する。最終的に得られるサブ区間が、そのメッセージを表す値の含まれる区間となる。したがって、この区間内の値であれば一意にメッセージの復元を行うことができる。ただし、符号化の効率を考慮して、その半開区間内で２のべき乗表現が可能な数によってメッセージを表すようにする。 It is assumed that (s ₂ s ₁ s ₃ s ₆ s ₇ ) is input as a message. However, the symbol s ₇ is a terminal symbol indicating the end of the message, and the message ends when the terminal symbol appears. In the arithmetic coding method, as shown in FIG. 4, the sub-interval corresponding to each symbol constituting the message is calculated for the message (s ₂ s ₁ s ₃ s ₆ s ₇ ). That is, the section allocated in FIG. 3 is divided according to the cumulative appearance probability of the next symbol. The sub-interval finally obtained is an interval including a value representing the message. Therefore, the message can be uniquely restored as long as the value is within this interval. However, in consideration of encoding efficiency, the message is represented by a number that can be expressed as a power of 2 within the half-open interval.

すなわち、この例では、次式（１）を考慮すると、次式（２）が半開区間０．２１１６４≦ｘ＜０．２１１７に含まれるメッセージを表す値となる。 That is, in this example, when the following expression (1) is considered, the following expression (2) is a value representing a message included in the half-open section 0.21164 ≦ x <0.2117.

２^-1
＝０．５
２^-2
＝０．２５
２^-3
＝０．１２５
２^-4
＝０．０６２５
２^-5
＝０．０３１２５
２^-6
＝０．０１５６２５
２^-7
＝０．００７８１２５
２^-8
＝０．００３９０６２５
２^-9
＝０．００１９５３１２５
２^-10＝０．０００９７６５６２５
２^-11＝０．０００４８８２８１２５
２^-12＝０．０００２４４１４０６２５
・
・
・

・・・（１）
２^-3＋２^-4＋２^-6＋２^-7＋２^-11＋２^-12＝０．２１１６６９９２１８７５
・・・（２） 2 ^-1
= 0.5
2 ^-2
= 0.25
2 ^-3
= 0.125
2 ^-4
= 0.0625
^2-5
= 0.03125
2 ^-6
= 0.015625
2 ^-7
= 0.0078125
^2-8
= 0.00390625
2 ^-9
= 0.001953125
2 ⁻¹⁰ = 0.0009765625
2 ^-11 = 0.00048828125
2 ^-12 = 0.000244140625
・
・
・

... (1)
2 ⁻³ +2 ⁻⁴ +2 ⁻⁶ +2 ⁻⁷ +2 ⁻¹¹ +2 ⁻¹² = 0.211669921875
... (2)

したがって、メッセージ（ｓ₂ｓ₁ｓ₃ｓ₆ｓ₇）に対応する符号の符号長は、２^-1乃至２^-12までを表現できるように１２ビットであればよく、メッセージ（ｓ₂ｓ₁ｓ₃ｓ₆ｓ₇）は、（００１１０１１０００１１）に符号化される。 Therefore, the code length of the code corresponding to the message (s ₂ s ₁ s ₃ s ₆ s ₇ ) may be 12 bits so that 2 ^{−1 to} 2 ⁻¹² can be expressed, and the message (s ₂ s ₁ s ₃ s ₆ s ₇ ) is encoded into (0011011000011).

次に、Ｈ．２６Ｌで定義されているCABACについて説明する。なお、CABACの詳細については、非特許文献１に開示されている。CABACは、同じくＨ．２６Ｌで定義されているUVLCと比較して、以下の３つの特徴を有している。 Next, H.I. The CABAC defined in 26L will be described. Details of CABAC are disclosed in Non-Patent Document 1. CABAC is also H.264. Compared with UVLC defined by 26L, it has the following three characteristics.

第１の特徴は、符号化されるそれぞれの記号に対して適切なコンテキストモデルを用い、それぞれ独立した確率モデルに基づいた算術符号化を行うことで、シンボル間の冗長性を排除できることである。 The first feature is that redundancy between symbols can be eliminated by using an appropriate context model for each symbol to be encoded and performing arithmetic encoding based on independent probability models.

第２の特徴は、算術符号化において、それぞれの記号に対して非整数値の符号量（ビット）を割り当てることが可能であり、エントロピに近い符号化効率を得ることが可能であることである。 The second feature is that, in arithmetic coding, a code amount (bit) of a non-integer value can be assigned to each symbol, and coding efficiency close to entropy can be obtained. .

第３の特徴は、例えば動きベクトルの統計データは、ビットレートやシーケンスのみならず、空間、時間的に異なるものであって一定ではないが、適応型符号化を行うことにより、これらの変化に追従した符号化が可能となることである。 The third feature is that, for example, statistical data of motion vectors is different not only in bit rate and sequence, but also in space and time, and is not constant. It is possible to follow the encoding.

図５は、CABACを適用したCABAC符号化器の一般的な構成を示している。当該CABAC符号化器において、コンテキストモデル化部３１は、画像圧縮情報における任意のシンタクス要素に関して、まず、過去の履歴に応じて、シンタクス要素のシンボル（記号）を適切なコンテキストモデルに変換する。このようなモデル化をコンテキストモデル化と称する。画像圧縮情報中のそれぞれのシンタクス要素に対するコンテキストモデルについては後述する。 FIG. 5 shows a general configuration of a CABAC encoder to which CABAC is applied. In the CABAC encoder, the context modeling unit 31 first converts a symbol (symbol) of a syntax element into an appropriate context model according to a past history regarding an arbitrary syntax element in the image compression information. Such modeling is called context modeling. The context model for each syntax element in the image compression information will be described later.

２値化部３２は、２値化されていないシンボルを２値化する。適応２値算術符号化部３３では、２値化されたシンボルに対して、確率推定部３４によって確率推定がなされ、符号化エンジン３５によって確率推定に基づく適応算術符号化が施される。適応算術符号化処理が行われた後、関連するモデルの更新が行われるため、それぞれのモデルは実際の画像圧縮情報の統計に応じた符号化処理を行うことが可能となる。 The binarization unit 32 binarizes symbols that have not been binarized. In the adaptive binary arithmetic coding unit 33, probability estimation is performed on the binarized symbol by the probability estimation unit 34, and adaptive arithmetic coding based on the probability estimation is performed by the coding engine 35. Since the relevant model is updated after the adaptive arithmetic coding process is performed, each model can perform the coding process according to the statistics of the actual image compression information.

ここで、画像圧縮情報中のシンタクス要素であるマクロブロックタイプMB_type(MB_type)、動きベクトル情報(MVD)、および参照フレームパラメータ(Ref_frame)を算術符号化するコンテキストモデルについて説明する。 Here, a context model that arithmetically encodes the macroblock type MB_type (MB_type), motion vector information (MVD), and reference frame parameter (Ref_frame), which are syntax elements in the image compression information, will be described.

MB_typeのコンテキストモデル生成について、イントラフレームとインターフレームに分けて説明する。 MB_type context model generation will be described separately for intra frames and inter frames.

イントラフレームにおいて、図６に示すようにマクロブロックＡ，Ｂ，Ｃが配置されている場合、マクロブロックＣのMB_typeに対応するコンテキストモデルctx_mb_type_intra(C)は、次式（３）によって定義される。なお、イントラフレームにおいて、マクロブロックのモードは、Intra４×４、またはIntra１６×１６である。
ctx_mb_type_intra(C)＝Ａ＋Ｂ
・・・（３） In the intra frame, when macro blocks A, B, and C are arranged as shown in FIG. 6, the context model ctx_mb_type_intra (C) corresponding to the MB_type of the macro block C is defined by the following equation (3). In the intra frame, the mode of the macro block is Intra 4 × 4 or Intra 16 × 16.
ctx_mb_type_intra (C) = A + B
... (3)

ただし、式（３）において、Ａは、マクロブロックＡがIntra４×４である場合には０であり、Intra１６×１６である場合には１である。同様に、Ｂは、マクロブロックＢがIntra４×４である場合には０であり、Intra１６×１６である場合には１である。したがって、コンテキストモデルctx_mb_type_intra(C)は、０，１，２のいずれかの値をとる。 However, in Expression (3), A is 0 when the macroblock A is Intra 4 × 4, and 1 when the macroblock A is Intra 16 × 16. Similarly, B is 0 when the macroblock B is Intra 4 × 4, and 1 when the macro block B is Intra 16 × 16. Therefore, the context model ctx_mb_type_intra (C) takes one of 0, 1, and 2.

インターフレームにおいて、図６に示すようにマクロブロックＡ，Ｂ，Ｃが配置されている場合、マクロブロックＣのMB_typeに対応するコンテキストモデルctx_mb_type_inter(C)は、当該インターフレームがＰピクチャである場合、次式（４）によって定義される。また、当該インターフレームがＢピクチャである場合、次式（５）によって定義される。
ctx_mb_type_inter(C)＝((A==Skip)?0:1)＋((B==Skip)?0:1)
・・・（４）
ctx_mb_type_inter(C)＝((A==Direct)?0:1)＋((B==Direct)?0:1)・・・（５） In the inter frame, when macro blocks A, B, and C are arranged as shown in FIG. 6, the context model ctx_mb_type_inter (C) corresponding to the MB_type of the macro block C indicates that the inter frame is a P picture. It is defined by the following equation (4). When the inter frame is a B picture, it is defined by the following equation (5).
ctx_mb_type_inter (C) = ((A == Skip)? 0: 1) + ((B == Skip)? 0: 1)
... (4)
ctx_mb_type_inter (C) = ((A == Direct)? 0: 1) + ((B == Direct)? 0: 1) (5)

ただし、式（４）において、演算子((A==Skip)?0:1)は、マクロブロックＡがSkipモードである場合には０を示し、マクロブロックＡがSkipモードではない場合には１を示すものとする。同様に、演算子((B==Skip)?0:1)は、マクロブロックＢがSkipモードである場合には０を示し、マクロブロックＢがSkipモードではない場合には１を示すものとする。 However, in Expression (4), the operator ((A == Skip)? 0: 1) indicates 0 when the macroblock A is in Skip mode, and when the macroblock A is not in Skip mode. 1 shall be shown. Similarly, the operator ((B == Skip)? 0: 1) indicates 0 when the macroblock B is in Skip mode, and indicates 1 when the macroblock B is not in Skip mode. To do.

また、式（５）において、演算子((A==Direct)?0:1)は、マクロブロックＡがDirectモードである場合には０を示し、マクロブロックＡがDirectモードではない場合には１を示すものとする。演算子((B==Direct)?0:1)は、マクロブロックＢがDirectモードである場合には０を示し、マクロブロックＢがDirectモードではない場合には１を示すものとする。 In equation (5), the operator ((A == Direct)? 0: 1) indicates 0 when the macroblock A is in the Direct mode, and when the macroblock A is not in the Direct mode. 1 shall be shown. The operator ((B == Direct)? 0: 1) indicates 0 when the macroblock B is in the Direct mode, and indicates 1 when the macroblock B is not in the Direct mode.

したがって、インターフレーム（Ｐピクチャ）におけるマクロブロックＣのMB_typeに対応するコンテキストモデルctx_mb_type_inter(C)は、Ｐピクチャである場合とＢピクチャである場合に対して、それぞれ３種類の値を取ることになる。 Therefore, the context model ctx_mb_type_inter (C) corresponding to the MB_type of the macroblock C in the inter frame (P picture) takes three values for each of the case of the P picture and the case of the B picture. .

次に、動きベクトル情報(MVD)のコンテキストモデル生成について説明する。 Next, context vector generation of motion vector information (MVD) will be described.

画像圧縮情報に含まれる、注目するマクロブロックに対応する動きベクトル情報は、隣接するマクロブロックに対応する動きベクトルとの予測誤差として符号化されている。いま、図７に示すように配置されているマクロブロックＡ，Ｂ，Ｃのうち、注目するマクロブロックＣに対する評価関数ｅ_k（Ｃ）を次式（６）によって定義する。ここで、ｋ＝０は水平成分、ｋ＝１は垂直成分を示す。
ｅ_k（Ｃ）＝｜ｍｖｄ_k（Ａ）｜＋｜ｍｖｄ_k（Ｂ）｜
・・・（６） The motion vector information corresponding to the macro block of interest included in the image compression information is encoded as a prediction error with the motion vector corresponding to the adjacent macro block. Now, of the macroblocks A, B, and C arranged as shown in FIG. 7, the evaluation function e _k (C) for the macroblock C of interest is defined by the following equation (6). Here, k = 0 indicates a horizontal component, and k = 1 indicates a vertical component.
e _k (C) = | mvd _k (A) | + | mvd _k (B) |
... (6)

ただし、式（６）において、ｍｖｄ_k（Ａ），ｍｖｄ_k（Ｂ）は、それぞれ、マクロブロックＣに隣接するマクロブロックＡ，Ｂに対する動きベクトル予測誤差である。 In Equation (6), mvd _k (A) and mvd _k (B) are motion vector prediction errors for macroblocks A and B adjacent to macroblock C, respectively.

なお、式（６）に関し、マクロブロックＣが画枠の左端に存在しており、マクロブロックＡ，Ｂの一方が存在しないような場合、動きベクトル予測誤差ｍｖｄ_k（Ａ）またはｍｖｄ_k（Ｂ）に関する情報を得ることができないので、式（６）の右辺における対応する項は無視する。このように定義されたｅ_k（Ｃ）に対応するコンテキストモデルctx_mvd(C,k)は、次式（７−１）乃至（７−３）のように定義される。
ctx_mvd(C,k)＝０
ｅ_k（Ｃ）＜３
・・・（７−１）
ctx_mvd(C,k)＝１
３２＜ｅ_k（Ｃ）
・・・（７−２）
ctx_mvd(C,k)＝２
３≦ｅ_k（Ｃ）≦３２
・・・（７−３） Note that, regarding the equation (6), when the macroblock C exists at the left end of the image frame and one of the macroblocks A and B does not exist, the motion vector prediction error mvd _k (A) or mvd _k (B ) Cannot be obtained, the corresponding term on the right side of equation (6) is ignored. The context model ctx_mvd (C, k) corresponding to e _k (C) defined in this way is defined as the following equations (7-1) to (7-3).
ctx_mvd (C, k) = 0
e _k (C) <3
... (7-1)
ctx_mvd (C, k) = 1
32 <e _k (C)
... (7-2)
ctx_mvd (C, k) = 2
3 ≦ e _k (C) ≦ 32
... (7-3)

動きベクトル情報(MVD)のコンテキストモデル生成は、図８に示すように行われる。すなわち、マクロブロックＣに対する動きベクトル予測誤差ｍｖｄ_k（Ｃ）は、絶対値｜ｍｖｄ_k（Ｃ）｜と符号に分離される。絶対値｜ｍｖｄ_k（Ｃ）｜は２値化される。２値化された絶対値｜ｍｖｄ_k（Ｃ）｜の第１のｂｉｎ（最左端の値）は、上述したコンテキストモデルctx_mvd(C,k)を用いて符号化する。第２のｂｉｎ（左端から２番目の値）はコンテキストモデル３を用いて符号化する。同様に、第３，４のｂｉｎは、それぞれコンテキストモデル４，５を用いて符号化する。第５以降のｂｉｎは、コンテキストモデル６を用いて符号化する。ｍｖｄ_k（Ｃ）の符号は、コンテキストモデル７を用いて符号化する。このように、運動ベクトル情報(MVD)は、８種類のコンテキストモデルを用いて符号化される。 The context model generation of the motion vector information (MVD) is performed as shown in FIG. That is, the motion vector prediction error mvd _k (C) for the macroblock C is separated into an absolute value | mvd _k (C) | and a sign. The absolute value | mvd _k (C) | is binarized. The first bin (the leftmost value) of the binarized absolute value | mvd _k (C) | is encoded using the context model ctx_mvd (C, k) described above. The second bin (second value from the left end) is encoded using the context model 3. Similarly, the third and fourth bins are encoded using context models 4 and 5, respectively. The fifth and subsequent bins are encoded using the context model 6. The code of mvd _k (C) is encoded using the context model 7. Thus, motion vector information (MVD) is encoded using eight types of context models.

次に、参照フレームパラメータ(Ref_frame)を符号化するコンテキストモデルについて説明する。 Next, a context model for encoding the reference frame parameter (Ref_frame) will be described.

インターフレームに対して、２枚以上の参照フレームが用いられる場合、インターフレームの各マクロブロックに対して参照フレームに関する情報が設定される。図６に示すように配置されたマクロブロックＡ，Ｂ，Ｃにおいて、マクロブロックＡ，Ｂそれぞれに対する参照フレームパラメータをＡ，Ｂとした場合、マクロブロックＣに対するコンテキストモデルctx_ref_frame(C)は、次式（８）によって定義される。
ctx_ref_frame(C)＝((A==0)?0:1)＋２((B==0)?0:1)
・・・（８） When two or more reference frames are used for an inter frame, information on the reference frame is set for each macroblock of the inter frame. In the macroblocks A, B, and C arranged as shown in FIG. 6, when the reference frame parameters for the macroblocks A and B are A and B, the context model ctx_ref_frame (C) for the macroblock C is Defined by (8).
ctx_ref_frame (C) = ((A == 0)? 0: 1) +2 ((B == 0)? 0: 1)
... (8)

ただし、式（８）において、演算子((A==0)?0:1)は、マクロブロックＡの参照フレームパラメータ０である場合には０を示し、マクロブロックＡの参照フレームパラメータが０ではない場合には１を示すものとする。同様に、演算子((B==0)?0:1)は、マクロブロックＢの参照フレームパラメータ０である場合には０を示し、マクロブロックＢの参照フレームパラメータが０ではない場合には１を示すものとする。 However, in Expression (8), the operator ((A == 0)? 0: 1) indicates 0 when the reference frame parameter of macroblock A is 0, and the reference frame parameter of macroblock A is 0. If not, 1 is shown. Similarly, the operator ((B == 0)? 0: 1) indicates 0 when the reference frame parameter of the macroblock B is 0, and when the reference frame parameter of the macroblock B is not 0, 1 shall be shown.

参照フレームパラメータ(Ref_frame)を符号化するコンテキストモデルは、式（８）によって４種類が定義される。さらに、第２のｂｉｎに対するコンテキストモデル、および第３以降のｂｉｎに対するコンテキストモデルが定義される。 Four types of context models for encoding the reference frame parameter (Ref_frame) are defined by Expression (8). Further, a context model for the second bin and a context model for the third and subsequent bins are defined.

次に、Ｈ．２６Ｌの画像圧縮情報中に含まれるテキスチャ情報に関するシンタクス要素であるコードブロックパターン(CBR)、イントラ予測モード(IPRED)、および（RUN,LEVEL）情報を算出符号化するコンテキストモデルについて説明する。 Next, H.I. A context model for calculating and encoding code block pattern (CBR), intra prediction mode (IPRED), and (RUN, LEVEL) information, which are syntax elements related to texture information included in the 26L image compression information, will be described.

始めに、コードブロックパターンに関するコンテキストモデルについて説明する。Intra１６×１６マクロブロック以外のコードブロックパターンに関する取り扱いは以下のように定義されている。 First, the context model regarding the code block pattern will be described. The handling of code block patterns other than Intra16 × 16 macroblocks is defined as follows.

すなわち、輝度信号に関しては、Intra１６×１６マクロブロックに含まれる、４つの８×８ブロックそれぞれに対して１ビットずつ、合計４ビットのCBPビットが含まれている。図６に示すようにマクロブロックＡ，Ｂ，Ｃが配置されている場合、マクロブロックＣの輝度信号に対応するコンテキストモデルctx_cbp_luma(C)は、次式（９）によって定義される。
ctx_cbp_luma(C)＝Ａ＋２Ｂ
・・・（９）
ただし、式（９）において、Ａは、マクロブロックＡの輝度信号のCBPビットであり、Ｂは、マクロブロックＢの輝度信号のCBPビットである。 That is, regarding the luminance signal, a total of 4 CBP bits are included, one for each of the four 8 × 8 blocks included in the Intra 16 × 16 macroblock. When macroblocks A, B, and C are arranged as shown in FIG. 6, the context model ctx_cbp_luma (C) corresponding to the luminance signal of the macroblock C is defined by the following equation (9).
ctx_cbp_luma (C) = A + 2B
... (9)
In Equation (9), A is the CBP bit of the luminance signal of the macroblock A, and B is the CBP bit of the luminance signal of the macroblock B.

CBPフィールドの残り２ビットは色差信号に関するものである。マクロブロックＣの色差信号に対応するコンテキストモデルctx_cbp_chroma_sig(C)は、次式（１０）によって定義される。
ctx_cbp_chroma_sig(C)＝Ａ＋２Ｂ
・・・（１０）
ただし、式（１０）において、Ａは、マクロブロックＡの色差信号のCBPビットであり、Ｂは、マクロブロックＢの色差信号のCBPビットである。 The remaining 2 bits of the CBP field relate to the color difference signal. The context model ctx_cbp_chroma_sig (C) corresponding to the color difference signal of the macroblock C is defined by the following equation (10).
ctx_cbp_chroma_sig (C) = A + 2B
... (10)
In Equation (10), A is the CBP bit of the color difference signal of the macroblock A, and B is the CBP bit of the color difference signal of the macroblock B.

ここで、マクロブロックＣの色差信号に対応するコンテキストモデルctx_cbp_chroma_sig(C)が０ではない場合、すなわち、色差信号のＡＣ成分が存在する場合、次式（１１）によって定義されるマクロブロックＣの色差信号のＡＣ成分に対応するコンテキストモデルctx_cbp_chroma_ac(C)が符号化される必要がある。
ctx_cbp_chroma_ac(C)＝Ａ＋２Ｂ
・・・（１１）ただし、式（１１）において、Ａは、マクロブロックＡに対応するcbp_chroma_ac decisionであり、Ｂは、マクロブロックＢに対応するcbp_chroma_ac decisionである。 Here, when the context model ctx_cbp_chroma_sig (C) corresponding to the color difference signal of the macroblock C is not 0, that is, when the AC component of the color difference signal exists, the color difference of the macroblock C defined by the following equation (11) The context model ctx_cbp_chroma_ac (C) corresponding to the AC component of the signal needs to be encoded.
ctx_cbp_chroma_ac (C) = A + 2B
(11) However, in Expression (11), A is cbp_chroma_ac decision corresponding to the macroblock A, and B is cbp_chroma_ac decision corresponding to the macroblock B.

式（９）乃至（１１）によって定義されるコンテキストモデルは、イントラマクロブロックとインターマクロブロックのそれぞれに対して別個に定義されるので、２４（＝２×３×４）種類のコンテキストモデルが定義されることになる。 Since the context models defined by the equations (9) to (11) are defined separately for each of the intra macro block and the inter macro block, 24 (= 2 × 3 × 4) types of context models are defined. Will be.

さらに、Intra６×１６マクロブロックに対しては、２値化されたAC decisionに対して１種類のコンテキストモデルが定義され、色差信号の各成分それぞれに対して１種のコンテキストモデルが定義されている。 Furthermore, for an Intra 6 × 16 macroblock, one type of context model is defined for the binarized AC decision, and one type of context model is defined for each component of the color difference signal. .

次に、イントラ予測モード(IPRED)に関するコンテキストモデルについて説明する。ここで、Ｈ．２６Ｌにおいて定義されている６種類（ラベル０乃至５）のイントラ予測モードについて、図９および図１０を参照して説明する。図９は、マクロブロックを分割した４×４ブロックに存在する画素ａ乃至ｐと、隣接する各４×４ブロック内に存在する画素Ａ乃至Ｉを示している。図１０のラベル１乃至５は、それぞれラベル１乃至５のイントラ予測モードの方向を示している。ラベル０のイントラ予測モードは、ＤＣ予測モード(DC Prediction)である。 Next, a context model related to the intra prediction mode (IPRED) will be described. Here, H. Six types (labels 0 to 5) of intra prediction modes defined in H.26L will be described with reference to FIGS. FIG. 9 shows pixels a to p existing in a 4 × 4 block obtained by dividing a macroblock and pixels A to I existing in each adjacent 4 × 4 block. Labels 1 to 5 in FIG. 10 indicate directions of the intra prediction modes of labels 1 to 5, respectively. The intra prediction mode labeled 0 is a DC prediction mode (DC Prediction).

ラベル０のイントラ予測モードにおいては、画素ａ乃至ｐが次式（１２）に従って予測される。
画素ａ乃至ｐ＝（Ａ＋Ｂ＋Ｃ＋Ｄ＋Ｅ＋Ｆ＋Ｇ＋Ｈ）//８
・・・（１２）ただし、式（１２）乃至次式（１５）において、Ａ乃至Ｉは、それぞれ画素Ａ乃至Ｉを示しており、記号”//”は、除算した結果を丸め込む演算を意味している。 In the intra prediction mode labeled 0, pixels a to p are predicted according to the following equation (12).
Pixels a to p = (A + B + C + D + E + F + G + H) / 8
(12) However, in the expressions (12) to (15), A to I indicate the pixels A to I, respectively, and the symbol “//” means an operation for rounding the result of division. doing.

なお、ラベル０のイントラ予測モードにおいて、８画素Ａ乃至Ｈのうち、４画素（例えば、画素Ａ乃至Ｄ）が画枠内に存在しない場合、式（１２）は用いられず、残りの４画素（いまの場合、画素Ｅ乃至Ｈ）の平均値が、画素ａ乃至ｐの予測値とされる。また、８画素Ａ乃至Ｈの全てが画枠内に存在しない場合も、式（１２）は用いられず、所定の値（例えば、１２８）が画素ａ乃至ｐの予測値とされる。 In addition, in the intra prediction mode of label 0, when 4 pixels (for example, pixels A to D) out of 8 pixels A to H do not exist in the image frame, Expression (12) is not used and the remaining 4 pixels The average value of the pixels E to H in this case is the predicted value of the pixels a to p. Also, when all of the eight pixels A to H are not present in the image frame, the equation (12) is not used, and a predetermined value (for example, 128) is set as the predicted value of the pixels a to p.

ラベル１のイントラ予測モードは、Vertical/Diagonal Predictionと称される。ラベル１のイントラ予測モードは、４画素Ａ乃至Ｄが画枠内に存在する場合にだけ用いられる。この場合、画素ａ乃至ｐのそれぞれが、次式（１３−１）乃至（１３−６）に従って予測される。
画素ａ
＝（Ａ＋Ｂ）//２
・・・（１３−１）
画素ｅ
＝Ｂ
・・・（１３−２）
画素ｂ，ｉ＝（Ｂ＋Ｃ）//２
・・・（１３−３）
画素ｆ，ｍ＝Ｃ
・・・（１３−４）
画素ｃ，ｊ＝（Ｃ＋Ｄ）//２
・・・（１３−５）
画素ｄ，ｇ，ｈ，ｋ，ｌ，ｎ，ｏ，ｐ
＝Ｄ
・・・（１３−６） The intra prediction mode of label 1 is called Vertical / Diagonal Prediction. The intra prediction mode of label 1 is used only when 4 pixels A to D are present in the image frame. In this case, each of the pixels a to p is predicted according to the following equations (13-1) to (13-6).
Pixel a
= (A + B) / 2
... (13-1)
Pixel e
= B
(13-2)
Pixel b, i = (B + C) / 2
... (13-3)
Pixel f, m = C
... (13-4)
Pixel c, j = (C + D) / 2
... (13-5)
Pixels d, g, h, k, l, n, o, p
= D
... (13-6)

ラベル２のイントラ予測モードは、Vertical Predictionと称される。ラベル２のイントラ予測モードは、４画素Ａ乃至Ｄが画枠内に存在する場合にだけ用いられる。この場合、例えば、画素ａ，ｅ，ｉ，ｍの予測値として画素Ａが用いられ、画素ｂ，ｆ，ｊ，ｎの予測値として画素Ｂが用いられる。 The intra prediction mode of label 2 is called Vertical Prediction. The intra prediction mode of label 2 is used only when 4 pixels A to D are present in the image frame. In this case, for example, the pixel A is used as the predicted value of the pixels a, e, i, and m, and the pixel B is used as the predicted value of the pixels b, f, j, and n.

ラベル３のイントラ予測モードは、Diagonal Predictionと称される。ラベル１のイントラ予測モードは、９画素Ａ乃至Ｉが画枠内に存在する場合にだけ用いられる。この場合、画素ａ乃至ｐのそれぞれが、次式（１４−１）乃至（１３−７）に従って予測される。
画素ｍ
＝（Ｈ＋２Ｇ＋Ｆ）//４
・・・（１４−１）
画素ｉ，ｎ
＝（Ｇ＋２Ｆ＋Ｅ）//４
・・・（１４−２）
画素ｅ，ｊ，ｏ
＝（Ｆ＋２Ｅ＋Ｉ）//４
・・・（１４−３）
画素ａ，ｆ，ｋ，ｐ＝（Ｅ＋２Ｉ＋Ａ）//４
・・・（１４−４）
画素ｂ，ｇ，ｌ
＝（Ｉ＋２Ａ＋Ｂ）//４
・・・（１４−５）
画素ｃ，ｈ
＝（Ａ＋２Ｂ＋Ｃ）//４
・・・（１４−６）
画素ｄ
＝（Ｂ＋２Ｃ＋Ｄ）//４
・・・（１４−７） The intra prediction mode labeled 3 is called Diagonal Prediction. The intra prediction mode of label 1 is used only when 9 pixels A to I are present in the image frame. In this case, each of the pixels a to p is predicted according to the following equations (14-1) to (13-7).
Pixel m
= (H + 2G + F) / 4
... (14-1)
Pixel i, n
= (G + 2F + E) / 4
(14-2)
Pixels e, j, o
= (F + 2E + I) // 4
... (14-3)
Pixels a, f, k, p = (E + 2I + A) / 4
... (14-4)
Pixels b, g, l
= (I + 2A + B) / 4
... (14-5)
Pixels c, h
= (A + 2B + C) / 4
... (14-6)
Pixel d
= (B + 2C + D) // 4
... (14-7)

ラベル４のイントラ予測モードは、Horizontal Predictionと称される。ラベル４のイントラ予測モードは、４画素Ｅ乃至Ｈが画枠内に存在する場合にだけ用いられる。この場合、例えば、画素ａ，ｂ，ｃ，ｄの予測値として画素Ｅが用いられ、画素ｅ，ｆ，ｇ，ｈの予測値として画素Ｆが用いられる。 The intra prediction mode of label 4 is called Horizontal Prediction. The intra prediction mode of label 4 is used only when 4 pixels E to H are present in the image frame. In this case, for example, the pixel E is used as the predicted value of the pixels a, b, c, and d, and the pixel F is used as the predicted value of the pixels e, f, g, and h.

ラベル５のイントラ予測モードは、Horizontal/Diagonal Predictionと称される。ラベル５のイントラ予測モードは、４画素Ｅ乃至Ｈが画枠内に存在する場合にだけ用いられる。この場合、画素ａ乃至ｐのそれぞれが、次式（１５−１）乃至（１５−６）に従って予測される。
画素ａ
＝（Ｅ＋Ｆ）//２
・・・（１５−１）
画素ｂ
＝Ｆ
・・・（１５−２）
画素ｃ，ｅ＝（Ｆ＋Ｇ）//２
・・・（１５−３）
画素ｆ，ｄ＝Ｇ
・・・（１５−４）
画素ｉ，ｇ＝（Ｇ＋Ｈ）//２
・・・（１５−５）
画素ｈ，ｊ，ｋ，ｌ，ｍ，ｎ，ｏ，ｐ
＝Ｈ
・・・（１５−６） The intra prediction mode labeled 5 is called Horizontal / Diagonal Prediction. The intra prediction mode of label 5 is used only when 4 pixels E to H are present in the image frame. In this case, each of the pixels a to p is predicted according to the following equations (15-1) to (15-6).
Pixel a
= (E + F) / 2
... (15-1)
Pixel b
= F
... (15-2)
Pixel c, e = (F + G) / 2
... (15-3)
Pixel f, d = G
... (15-4)
Pixel i, g = (G + H) / 2
... (15-5)
Pixels h, j, k, l, m, n, o, p
= H
... (15-6)

ラベル０乃至５のイントラ予測モードに対しては、それぞれ２つのコンテキストモデルが定義されている。すなわち、１つは、それぞれのモードに対する第１のｂｉｎであり、もう１つは、それぞれのモードに対する第２のｂｉｎである。これらに加え、Intra１６×１６モードの２ビットに対して１つずつコンテキストモデルが定義されている。したがって、イントラ予測モードに対しては、合計１４のコンテキストモデルが定義されている。 Two context models are defined for each of the intra prediction modes labeled 0 to 5. That is, one is the first bin for each mode and the other is the second bin for each mode. In addition to these, one context model is defined for two bits of the Intra 16 × 16 mode. Therefore, a total of 14 context models are defined for the intra prediction mode.

次に、(RUN,LEVEL)に関するコンテキストモデルについて説明する。 Next, a context model relating to (RUN, LEVEL) will be described.

Ｈ．２６Ｌにおいては、２次元離散コサイン変換係数を１次元に並べ替えるスキャン方式として、図１１Ａ，Ｂに示す２種類の方法が定義されている。図１１Ａに示すシングルスキャン方式は、イントラマクロブロックに対する輝度信号であって、かつ、量子化パラメータＱＰが２４よりも小さい場合以外に用いられる方式である。図１１Ｂに示すダブルスキャン方式は、シングルスキャン方式が用いられない場合に用いられる。 H. In 26L, two types of methods shown in FIGS. 11A and 11B are defined as scan methods for rearranging two-dimensional discrete cosine transform coefficients in one dimension. The single scan method shown in FIG. 11A is a method used for a case where the luminance signal is for an intra macroblock and the quantization parameter QP is smaller than 24. The double scan method shown in FIG. 11B is used when the single scan method is not used.

インターマクロブロックおよび量子化パラメータＱＰが２４以上であるイントラマクロブロックでは、平均して４×４マクロブロックに対する非零係数は１つであり、１ビットのEOB（End Of Block）信号で十分であるが、量子化パラメータＱＰが２４よりも小さいイントラマクロブロックの輝度信号に関しては、２つ以上の非零係数が存在するため、１ビットのEOB信号では不十分である。このため、図１１Ｂに示すダブルスキャン方式が用いられる。 In an intra macroblock and an intra macroblock having a quantization parameter QP of 24 or more, on average, there is one non-zero coefficient for a 4 × 4 macroblock, and a 1-bit EOB (End Of Block) signal is sufficient. However, for the luminance signal of an intra macroblock whose quantization parameter QP is smaller than 24, since there are two or more non-zero coefficients, a 1-bit EOB signal is not sufficient. For this reason, the double scan method shown in FIG. 11B is used.

(RUN,LEVEL)に対するコンテキストモデルは、図１２に示すように、上述したスキャン方式の区別、ＤＣ／ＡＣブロックタイプの区別、輝度信号／色差信号の区別、イントラマクロブロック／インターマクロブロックの区別に応じて９種類が定義されている。 As shown in FIG. 12, the context model for (RUN, LEVEL) includes the above-described scan method distinction, DC / AC block type distinction, luminance signal / chrominance signal distinction, and intra macroblock / intermacroblock distinction. Nine types are defined accordingly.

LEVEL情報は符号と絶対値に分離される。図１２に示した対応するCtx_run_levelに応じて、４つのコンテキストモデルが定義される。すなわち、第１のコンテキストモデルは符号に対してのものであり、第２のコンテキストモデルは第１のｂｉｎに対してのものであり、第２のコンテキストモデルは第２のｂｉｎに対してのものであり、第４のコンテキストモデルはそれ以降のｂｉｎに対して定義されたものである。 LEVEL information is separated into a sign and an absolute value. Four context models are defined according to the corresponding Ctx_run_level shown in FIG. That is, the first context model is for the code, the second context model is for the first bin, and the second context model is for the second bin. The fourth context model is defined for subsequent bins.

LEVELが０ではない場合（EOBでない場合）には、以下に述べるRUNが符号化される。RUNに対してであるが、図１２に示された、それぞれのCtx_run_levelに対して、第1のｂｉｎと第２以降のｂｉｎについて、それぞれ２つずつのコンテキストモデルが定義されている。 When LEVEL is not 0 (not EOB), RUN described below is encoded. For RUN, two context models are defined for each of the first bin and the second and subsequent bins for each Ctx_run_level shown in FIG.

Ｈ．２６Ｌの画像圧縮情報において、マクロブロックレベルで設定され得る、量子化に関するパラメータDquantに対するコンテキストモデルについて説明する。 H. A context model for the quantization parameter Dquant that can be set at the macroblock level in the 26L image compression information will be described.

パラメータDquantは、マクロブロックに対するコードブロックパターンが、非零の直交変換係数を含む場合、またはマクロブロックが１６×１６Intra Codedである場合に設定される。パラメータDquantは、−１６乃至１６の値を取り得る。マクロブロックに対する量子化パラメータQUANT_newは、画像圧縮情報中のパラメータDquantを用いた次式（１６）によって算出される。
QUANT_new＝modulo₃₂（QUANT_old＋Dquant＋３２）
・・・（１６）ただし、式（１６）において、QUANT_oldは、直前の符号化または復号に用いられた量子化パラメータである。 The parameter Dquant is set when the code block pattern for the macroblock includes a non-zero orthogonal transform coefficient, or when the macroblock is 16 × 16 Intra Coded. The parameter Dquant can take a value of -16 to 16. The quantization parameter QUANT _new for the macroblock is calculated by the following equation (16) using the parameter Dquant in the image compression information.
QUANT _new = modulo ₃₂ (QUANT _old + Dquant + 32)
(16) However, in equation (16), QUANT _old is a quantization parameter used for the previous encoding or decoding.

図６に示すように配置されたマクロブロックＣのパラメータDquantに対する第１のコンテキストモデルctx_dquant(C)は、次式（１７）のように定義される。
ctx_dquant(C)＝（Ａ！＝０）
・・・（１７）ただし、式（１７）において、Ａは、マクロブロックＡのパラメータDquantの値を示している。第１のｂｉｎに対しては第２のコンテキストモデルが、第２以降のｂｉｎに対しては第２のコンテキストモデルが定義されている。 The first context model ctx_dquant (C) for the parameter Dquant of the macroblock C arranged as shown in FIG. 6 is defined as the following equation (17).
ctx_dquant (C) = (A! = 0)
(17) However, in equation (17), A indicates the value of the parameter Dquant of the macroblock A. A second context model is defined for the first bin, and a second context model is defined for the second and subsequent bins.

以上説明した様々なコンテキストモデルに対し、入力となるシンボルが２値化されていない場合には、そのシンボルを入力前に２値化する必要がある。MB_type以外のシンタクス要素は、図１３に示す対応関係によって２値化される。 For the various context models described above, when the input symbol is not binarized, it is necessary to binarize the symbol before input. Syntax elements other than MB_type are binarized according to the correspondence shown in FIG.

Ｐピクチャに対して１０種類定義されているMB_typeは、図１４Ａに示す対応関係によって２値化される。また、Ｂピクチャに対して１７種類定義されているMB_typeは、図１４Ｂに示す対応関係によって２値化される。 MB_type defined for 10 types of P picture is binarized by the correspondence shown in FIG. 14A. Also, MB_type defined for 17 types for B picture is binarized by the correspondence shown in FIG. 14B.

以上説明した様々なコンテキストモデルに対応するレジスタは、事前に計算された値によって予め初期化されており、各シンボルを符号化する際、一連のコンテキストモデルに対するｂｉｎの発生頻度が逐次更新され、次のシンボルの符号化を行う際の判定に用いられる。 The registers corresponding to the various context models described above are initialized in advance with values calculated in advance, and when encoding each symbol, the occurrence frequency of bins for a series of context models is sequentially updated. This is used for the determination when the symbols are encoded.

しかしながら、与えられたコンテキストモデルに対する発生頻度が予め定められた値を超えた場合には、頻度カウンタは縮小処理が行われる。このように周期的にスケーリング処理を行うことで、動的なシンボルの発生に対応することを容易なものとしている。 However, when the occurrence frequency for a given context model exceeds a predetermined value, the frequency counter is reduced. By periodically performing scaling processing in this way, it is easy to cope with the dynamic generation of symbols.

Ｈ．２６Ｌにおいて、２値化されたシンボルの算術符号化方式については、現在のところ、非特許文献２に開示されている方法が適用されている。 H. In 26L, the method disclosed in Non-Patent Document 2 is currently applied to the arithmetic coding method of binarized symbols.

ところで、MPEG２においては、入力となる画像信号が飛び越し走査フォーマットであった場合、マクロブロックレベルでフィールド／フレーム適応型符号化処理が可能とされている。 By the way, in MPEG2, when an input image signal is an interlaced scanning format, field / frame adaptive encoding processing can be performed at a macroblock level.

現在、Ｈ．２６Ｌにはそのような仕様は定義されていないが、非特許文献３には、Ｈ．２６Ｌの仕様を、マクロブロックレベルでフィールド／フレーム適応型符号化処理を可能とするように拡張することが提案されている。 Currently H. Such a specification is not defined in H.26L. It has been proposed to extend the 26L specification to allow field / frame adaptive coding at the macroblock level.

非特許文献３に提案されている、マクロブロックレベルでフィールド／フレーム適応型符号化処理について説明する。 A field / frame adaptive encoding process at the macroblock level proposed in Non-Patent Document 3 will be described.

現在のＨ．２６Ｌにおいては、マクロブロックにおける動き予測・補償の単位として、図１５に示すような７種類のモード（mode１乃至７）が定義されている。 Current H. In 26L, seven types of modes (modes 1 to 7) as shown in FIG. 15 are defined as motion prediction / compensation units in a macroblock.

非特許文献３においては、画像圧縮情報のマクロブロックに対応するシンタクスとして、図１６に示すように、RunとMB_typeの間にFrame/Field Flagを持つことが提案されている。Frame/Field Flagの値が０である場合、当該マクロブロックはフレームベースの符号化が施されることを示し、Frame/Field Flagの値が１である場合、フィールドベースの符号化が施されることを示している。 Non-Patent Document 3 proposes having a Frame / Field Flag between Run and MB_type as syntax corresponding to a macroblock of image compression information, as shown in FIG. When the value of Frame / Field Flag is 0, this indicates that the macroblock is subjected to frame-based encoding. When the value of Frame / Field Flag is 1, field-based encoding is performed. It is shown that.

Frame/Field Flagの値が１である場合（すなわち、フィールドベースの符号化が施される場合）、マクロブロック内の画素は、図１７に示すように行単位で画素の並べ替えが行われる。 When the value of Frame / Field Flag is 1 (that is, when field-based encoding is performed), the pixels in the macroblock are rearranged in units of rows as shown in FIG.

Frame/Field Flagの値が１である場合、マクロブロックにおける動き予測・補償の単位として、図１５のmode３乃至７に相当する、図１８に示す５種類のモード（mode１ａ乃至５ａ）が定義されている。 When the value of Frame / Field Flag is 1, five modes (modes 1a to 5a) shown in FIG. 18 corresponding to modes 3 to 7 in FIG. 15 are defined as motion prediction / compensation units in the macroblock. Yes.

例えば、図１８のmode２ａにおいて、マクロブロックを４分割した８×８ブロック０乃至３のうち、ブロック０，１は同一のフィールドパリティに属し、また、ブロック２，３は同一のフィールドパリティに属する。また例えば、図１８のmode３ａにおいて、マクロブロックを８分割した４×８ブロック０乃至８のうち、ブロック０乃至３は同一のフィールドパリティに属し、また、ブロック４乃至７は同一のフィールドパリティに属する。 For example, in mode 2a of FIG. 18, among the 8 × 8 blocks 0 to 3 obtained by dividing the macroblock into four, blocks 0 and 1 belong to the same field parity, and blocks 2 and 3 belong to the same field parity. Also, for example, in mode 3a in FIG. 18, among 4 × 8 blocks 0 to 8 obtained by dividing a macroblock into eight, blocks 0 to 3 belong to the same field parity, and blocks 4 to 7 belong to the same field parity. .

Frame/Field Flagの値が１である場合のイントラ予測モードについて説明する。例えば、図９に示した４×４ブロックに位置する画素ａ乃至ｐは、Frame/Field Flagの値が１である場合においても、隣接する４×４ブロックに位置する画素Ａ乃至Ｉを用いてイントラ予測が行われるが、画素ａ乃至ｐ、および画素Ａ乃至Ｉが全て同一フィールドパリティに属していることが特徴である。 The intra prediction mode when the value of Frame / Field Flag is 1 will be described. For example, the pixels a to p located in the 4 × 4 block shown in FIG. 9 use the pixels A to I located in the adjacent 4 × 4 block even when the value of the Frame / Field Flag is 1. Intra prediction is performed, but the pixels a to p and the pixels A to I all belong to the same field parity.

画素Ａ乃至Ｉが、画素ａ乃至ｐと同一のマクロブロックに属している場合について、図１９を参照して説明する。マクロブロックを１６分割した４×４ブロック７に存在する画素ａ乃至ｐは、隣接するブロック２，３，６の端に存在する画素Ａ乃至Ｉを用いてイントラ予測が行われる。 A case where the pixels A to I belong to the same macroblock as the pixels a to p will be described with reference to FIG. The pixels a to p existing in the 4 × 4 block 7 obtained by dividing the macroblock into 16 are subjected to intra prediction using the pixels A to I existing at the ends of the adjacent blocks 2, 3, and 6.

画素Ａ乃至Ｉが、画素ａ乃至ｐとは異なるマクロブロックに属する場合について、図２０を参照して説明する。 A case where the pixels A to I belong to a different macroblock from the pixels a to p will be described with reference to FIG.

図２０Ａは、処理対象としているマクロブロックの左側のマクロブロックと、上側のマクロブロックに対するFrame/Field Flagの値がそれぞれ１である場合を示している。この場合、処理対象としているマクロブロックを１６分割した４×４ブロックＣに存在する画素のイントラ予測は、左側のマクロブロックを１６分割したブ４×４ブロックＡに存在する画素と、上側のマクロブロックを１６分割した４×４ブロックＢに存在する画素を用いて行われる。４×４ブロックＣ'に存在する画素のイントラ予測は、４×４ブロックＡ'に存在する画素と、４×４ブロックＢ'に存在する画素を用いて行われる。 FIG. 20A shows a case where the value of the Frame / Field Flag is 1 for the macroblock on the left side of the macroblock to be processed and the macroblock on the upper side. In this case, the intra prediction of the pixels existing in the 4 × 4 block C obtained by dividing the macroblock to be processed into 16 blocks is performed using the pixels existing in the block 4 × 4 block A obtained by dividing the left macroblock into 16 blocks and the upper macroblock. This is performed using pixels existing in a 4 × 4 block B obtained by dividing the block into 16 blocks. Intra prediction of pixels existing in the 4 × 4 block C ′ is performed using pixels existing in the 4 × 4 block A ′ and pixels existing in the 4 × 4 block B ′.

図２０Ｂは、処理対象としているマクロブロックに対するFrame/Field Flagの値が１であり、その左側および上側のマクロブロックに対するFrame/Field Flagの値がそれぞれ０である場合を示している。この場合、処理対象としているマクロブロックを１６分割した４×４ブロックＣに存在する画素のイントラ予測は、左側のマクロブロックを１６分割した４×４ブロックＡに存在する画素と、上側のマクロブロックを１６分割した４×４ブロックＢに存在する画素を用いて行われる。４×４ブロックＣ'に存在する画素のイントラ予測は、４×４ブロックＡ'に存在する画素と、４×４ブロックＢに存在する画素を用いて行われる。 FIG. 20B shows a case where the value of Frame / Field Flag for the macroblock to be processed is 1, and the values of Frame / Field Flag for the left and upper macroblocks are 0, respectively. In this case, the intra prediction of the pixels existing in the 4 × 4 block C obtained by dividing the macroblock to be processed into 16 blocks is performed on the pixels existing in the 4 × 4 block A obtained by dividing the left macroblock into 16 blocks and the upper macroblock. Is performed using pixels existing in a 4 × 4 block B obtained by dividing 16 into 16 blocks. Intra prediction of pixels existing in the 4 × 4 block C ′ is performed using pixels existing in the 4 × 4 block A ′ and pixels existing in the 4 × 4 block B.

次に、色差信号のイントラ予測について、図２１を参照して説明する。Frame/Field Flagの値が１である場合、色差信号のイントラ予測モードは１種類だけが定義されている。 Next, intra prediction of color difference signals will be described with reference to FIG. When the value of Frame / Field Flag is 1, only one type of intra prediction mode for color difference signals is defined.

図２１において、Ａ乃至Ｄは、それぞれ色差信号の４×４ブロックを示す。ブロックＡ，Ｂは、第１フィールドに属し、ブロックＣ，Ｄは、第２フィールドに属する。ｓ₀乃至ｓ₂は、ブロックＡ乃至Ｄに隣接するブロックのうち、第１フィールドパリティに属するブロックに存在する色差信号の合計値である。ｓ₃至ｓ₅は、ブロックＡ乃至Ｄに隣接するブロックのうち、第２フィールドパリティに属するブロックに存在する色差信号の合計値である。 In FIG. 21, A to D indicate 4 × 4 blocks of color difference signals, respectively. Blocks A and B belong to the first field, and blocks C and D belong to the second field. s _{0 to} s ₂ are the total values of the color difference signals existing in the blocks belonging to the first field parity among the blocks adjacent to the blocks A to D. s ₃ to s ₅ are the total values of the color difference signals existing in the blocks belonging to the second field parity among the blocks adjacent to the blocks A to D.

ブロックＡ乃至Ｄにそれぞれ対応する予測値Ａ乃至Ｄは、ｓ₀乃至ｓ₅が全て画枠内に存在する場合、次式（１８）に従って予測される。
Ａ＝（ｓ₀＋ｓ₂＋４）／８
Ｂ＝（ｓ₁＋２）／４
Ｃ＝（ｓ₃＋ｓ₅＋４）／８
Ｄ＝（ｓ₄＋２）／４
・・・（１８） The predicted values A to D corresponding to the blocks A to D are predicted according to the following equation (18) when all of s _{0 to} s ₅ are present in the image frame.
A = (s ₀ + s ₂ +4) / 8
B = (s ₁ +2) / 4
C = (s ₃ + s ₅ +4) / 8
D = (s ₄ +2) / 4
... (18)

ただし、ｓ₀乃至ｓ₅のうち、ｓ₀，ｓ₁，ｓ₃，ｓ₄だけが画枠内に存在する場合、ブロックＡ乃至Ｄにそれぞれ対応する予測値Ａ乃至Ｄは、次式（１９）に従って予測される。
Ａ＝（ｓ₀＋２）／４
Ｂ＝（ｓ₁＋２）／４
Ｃ＝（ｓ₃＋２）／４
Ｄ＝（ｓ₄＋２）／４
・・・（１９） However, when only s ₀ , s ₁ , s ₃ , and s ₄ are present in the image frame among s _{0 to} s ₅ , predicted values A to D corresponding to the blocks A to D are expressed by the following equation (19 ) To be predicted.
A = (s ₀ +2) / 4
B = (s ₁ +2) / 4
C = (s ₃ +2) / 4
D = (s ₄ +2) / 4
... (19)

さらに、ｓ₀乃至ｓ₅のうち、ｓ₂ｓ₅だけが画枠内に存在する場合、ブロックＡ乃至Ｄにそれぞれ対応する予測値は、次式（２０）に従って予測される。
Ａ＝（ｓ₂＋２）／４
Ｂ＝（ｓ₂＋２）／４
Ｃ＝（ｓ₅＋２）／４
Ｄ＝（ｓ₅＋２）／４
・・・（２０） Furthermore, when only s ₂ s ₅ out of s _{0 to} s ₅ are present in the image frame, the predicted values corresponding to the blocks A to D are predicted according to the following equation (20).
A = (s ₂ +2) / 4
B = (s ₂ +2) / 4
C = (s ₅ +2) / 4
D = (s ₅ +2) / 4
... (20)

図２２は、上述したようにイントラ予測された後の色差信号の残差成分を符号化する方法を示している。すなわち、それぞれの４×４ブロックに対して直交変換処理を施した後、第１フィールドおよび第２フィールドの直流成分を用いて図示するような２×２ブロックが生成され、再び直交変換処理が施される。 FIG. 22 shows a method of encoding the residual component of the color difference signal after intra prediction as described above. That is, after orthogonal transform processing is performed on each 4 × 4 block, a 2 × 2 block as illustrated is generated using the DC components of the first field and second field, and orthogonal transform processing is performed again. Is done.

次に、Frame/Field Flagの値が１である場合の動き予測・補償処理について説明する。Frame/Field Flagの値が１である場合、動き予測補償モードとしては、インター１６×１６モード、インター８×１６モード、インター８×８モード、インター４×８モード、インター４×４モードの６種類のモードが存在する。 Next, motion prediction / compensation processing when the value of Frame / Field Flag is 1 will be described. When the value of the Frame / Field Flag is 1, the motion prediction / compensation mode includes 6 modes of inter 16 × 16 mode, inter 8 × 16 mode, inter 8 × 8 mode, inter 4 × 8 mode, and inter 4 × 4 mode. There are different types of modes.

例えば、インター１６×１６モードは、インター８×１６モードにおける第１フィールドに対する動きベクトル情報、第２フィールドに対する動きベクトル情報、および参照フレームが同等であるモードである。 For example, the inter 16 × 16 mode is a mode in which the motion vector information for the first field, the motion vector information for the second field, and the reference frame in the inter 8 × 16 mode are equivalent.

これら６種類の動き予測補償モードに対して、それぞれCode_Number０乃至５が割り当てられている。 Code_Number 0 to 5 are assigned to these six types of motion prediction compensation modes, respectively.

現在のＨ．２６Ｌにおいては、図２３に示すような、複数の参照フレームを設けることができるマルチプルフレーム予測が規定されている。現在のフレームベースのＨ．２６Ｌの規格において、参照フレームに関する情報は、マクロブロックレベルで定義されており、直前に符号化されたフレームに対し、Code_Number０が割り当てられており、その１乃至５回前に符号化されたフレームに対し、それぞれCode_Number１乃至５が割り当てられている。 Current H. In H.26L, multiple frame prediction capable of providing a plurality of reference frames as shown in FIG. 23 is defined. Current frame-based H.264 In the 26L standard, information on the reference frame is defined at the macroblock level, Code_Number0 is assigned to the frame encoded immediately before, and the frame encoded one to five times before is assigned to the frame. On the other hand, Code_Number 1 to 5 are assigned, respectively.

これに対して、フィールドベース符号化を行う場合、直前に符号化されたフレームの第１フィールドに対してCode_Number０が割り当てられ、当該フレームの第２フィールドに対してCode_Number１が割り当てられる。その１回前に符号化されたフレームの第１フィールドに対してCode_Number２が割り当てられ、当該フレームの第２フィールドに対してCode_Number３が割り当てられる。さらに１回前に符号化されたフレームの第１フィールドに対してCode_Number４が割り当てられ、第２フィールドに対してCode_Number５が割り当てられる。 On the other hand, when performing field-based encoding, Code_Number0 is assigned to the first field of the frame encoded immediately before, and Code_Number1 is assigned to the second field of the frame. Code_Number2 is assigned to the first field of the frame encoded one time before, and Code_Number3 is assigned to the second field of the frame. Furthermore, Code_Number4 is assigned to the first field of the frame encoded once before, and Code_Number5 is assigned to the second field.

また、フィールドベース符号化が行われるマクロブロックに対しては、第１フィールドに対する参照フィールドと、第２フィールドに対する参照フィールドが別個に規定される。 In addition, for a macroblock subjected to field-based coding, a reference field for the first field and a reference field for the second field are separately defined.

次に、Frame/Field Flagの値が１である場合の動きベクトル情報予測方式について説明するが、その前に、現在のＨ．２６Ｌにおいて規定されているメディアン予測について、図２４を参照して説明する。図２４に示す１６×１６マクロブロックＥに対応する１６×１６、８×８、または４×４動きベクトル情報は、隣接するマクロブロックＡ乃至Ｃの動きベクトル情報のメディアンを用いて予測される。 Next, a motion vector information prediction method when the value of Frame / Field Flag is 1 will be described. The median prediction defined in 26L will be described with reference to FIG. The 16 × 16, 8 × 8, or 4 × 4 motion vector information corresponding to the 16 × 16 macroblock E shown in FIG. 24 is predicted using the median of the motion vector information of the adjacent macroblocks A to C.

ただし、マクロブロックＡ乃至Ｃのうち、画枠内に存在しないものについては、対応する動きベクトル情報の値は０であるとしてメディアンを算出する。例えば、マクロブロックＤ，Ｂ，Ｃが画枠内に存在しない場合、予測値としてマクロブロックＡに対応する動きベクトル情報を用いる。また、マクロブロックＣが画枠内に存在しない場合、その代わりにマクロブロックＤの動きベクトル情報を用いてメディアンを算出する。 However, for macroblocks A to C that do not exist within the image frame, the median is calculated assuming that the value of the corresponding motion vector information is zero. For example, when the macroblocks D, B, and C do not exist in the image frame, motion vector information corresponding to the macroblock A is used as a predicted value. When the macro block C does not exist in the image frame, the median is calculated using the motion vector information of the macro block D instead.

なお、マクロブロックＡ乃至Ｄの参照フレームは必ずしも同一でなくてもよい。 Note that the reference frames of the macroblocks A to D are not necessarily the same.

次に、マクロブロックのブロックサイズが、８×１６、１６×８、８×４、または４×８である場合について、図２５を参照して説明する。なお、注目するマクロブロックＥとこれに隣接するマクロブロックＡ乃至Ｄは、図２４に示すように配置されているとする。 Next, the case where the block size of the macroblock is 8 × 16, 16 × 8, 8 × 4, or 4 × 8 will be described with reference to FIG. Note that the macro block E of interest and the macro blocks A to D adjacent thereto are arranged as shown in FIG.

図２５Ａは、マクロブロックＥ１，Ｅ２のブロックサイズが８×１６である場合を示している。左側のマクロブロックＥ１に関しては、左に隣接するマクロブロックＡがマクロブロックＥ１と同じフレームを参照している場合、マクロブロックＡの動きベクトル情報が予測値として用いられる。左に隣接するマクロブロックＡがマクロブロックＥ１と異なるフレームを参照している場合、上述したメディアン予測が適用される。 FIG. 25A shows a case where the block sizes of the macroblocks E1 and E2 are 8 × 16. Regarding the left macroblock E1, when the macroblock A adjacent to the left refers to the same frame as the macroblock E1, the motion vector information of the macroblock A is used as a prediction value. When the macroblock A adjacent to the left refers to a frame different from the macroblock E1, the median prediction described above is applied.

右側のマクロブロックＥ２に関しては、右上に隣接するマクロブロックＣがマクロブロックＥ２と同じフレームを参照している場合、マクロブロックＣの動きベクトル情報が予測値として用いられる。右上に隣接するマクロブロックＣがマクロブロックＥ２と異なるフレームを参照している場合、上述したメディアン予測が適用される。 Regarding the right macroblock E2, when the macroblock C adjacent to the upper right refers to the same frame as the macroblock E2, the motion vector information of the macroblock C is used as a predicted value. When the macroblock C adjacent to the upper right refers to a frame different from the macroblock E2, the median prediction described above is applied.

図２５Ｂは、マクロブロックＥ１，Ｅ２のブロックサイズが１６×８である場合を示している。上側のマクロブロックＥ１に関しては、上に隣接するマクロブロックＢがマクロブロックＥ１と同じフレームを参照している場合、マクロブロックＢの動きベクトル情報が予測値として用いられる。上に隣接するマクロブロックＢがマクロブロックＥ１と異なるフレームを参照している場合、上述したメディアン予測が適用される。 FIG. 25B shows a case where the block sizes of the macroblocks E1 and E2 are 16 × 8. As for the upper macroblock E1, when the macroblock B adjacent on the upper side refers to the same frame as the macroblock E1, the motion vector information of the macroblock B is used as a predicted value. When the macroblock B adjacent on the top refers to a frame different from the macroblock E1, the median prediction described above is applied.

下側のマクロブロックＥ２に関しては、左に隣接するマクロブロックＡがマクロブロックＥ２と同じフレームを参照している場合、マクロブロックＡの動きベクトル情報が予測値として用いられる。左に隣接するマクロブロックＡがマクロブロックＥ２と異なるフレームを参照している場合、上述したメディアン予測が適用される。 Regarding the lower macroblock E2, when the macroblock A adjacent to the left refers to the same frame as the macroblock E2, the motion vector information of the macroblock A is used as a predicted value. When the macroblock A adjacent to the left refers to a frame different from the macroblock E2, the median prediction described above is applied.

図２５Ｃは、マクロブロックＥ１乃至Ｅ８のブロックサイズが８×４である場合を示している。左側のマクロブロックＥ１乃至Ｅ４に対しては、上述したメディアン予測が適用され、右側のマクロブロックＥ５乃至Ｅ８に対しては、左側のマクロブロックＥ１乃至Ｅ４の動きベクトル情報が予測値として用いられる。 FIG. 25C shows a case where the block sizes of the macroblocks E1 to E8 are 8 × 4. The median prediction described above is applied to the left macroblocks E1 to E4, and the motion vector information of the left macroblocks E1 to E4 is used as a prediction value for the right macroblocks E5 to E8.

図２５Ｄは、マクロブロックＥ１乃至Ｅ８のブロックサイズが４×８である場合を示している。上側のマクロブロックＥ１乃至Ｅ４に対しては、上述したメディアン予測が適用され、下側のマクロブロックＥ５乃至Ｅ８に対しては、上側のマクロブロックＥ１乃至Ｅ４の動きベクトル情報が予測値として用いられる。 FIG. 25D shows a case where the block sizes of the macroblocks E1 to E8 are 4 × 8. The median prediction described above is applied to the upper macroblocks E1 to E4, and the motion vector information of the upper macroblocks E1 to E4 is used as a prediction value for the lower macroblocks E5 to E8. .

Frame/Field Flagの値が１である場合においても、動きベクトル情報の水平方向成分の予測に関しては、上述の方式に準ずる。しかしながら、垂直方向成分に関しては、フィールドベースのブロックとフレームベースのブロックが混在するため、以下のような処理を行う。なお、注目するマクロブロックＥとこれに隣接するマクロブロックＡ乃至Ｄは、図２４に示すように配置されているとする。 Even when the value of the Frame / Field Flag is 1, the prediction of the horizontal direction component of the motion vector information conforms to the above method. However, regarding the vertical component, since field-based blocks and frame-based blocks coexist, the following processing is performed. Note that the macro block E of interest and the macro blocks A to D adjacent thereto are arranged as shown in FIG.

マクロブロックＥをフレームベース符号化する場合であって、隣接するマクロブロックＡ乃至Ｄのいずれかがフィールドベース符号化されている場合、第１フィールドに対する動きベクトル情報の垂直方向成分と、第２フィールドに対する動きベクトル情報の垂直方向成分の平均値の２倍を算出し、これをフレームベースの動きベクトル情報に相当するものとして予測処理を行う。 When the macroblock E is frame-based encoded and any of the adjacent macroblocks A to D is field-based encoded, the vertical direction component of the motion vector information for the first field and the second field Twice the average value of the vertical component of the motion vector information for the motion vector information is calculated, and the prediction processing is performed assuming that this is equivalent to the frame-based motion vector information.

マクロブロックＥをフィールドベース符号化する場合であって、隣接するブロックＡ乃至Ｄのいずれかがフレームベース符号化されている場合、動きベクトル情報の垂直方向成分の値を２で割った商を、フィールドベースの動きベクトルに相当するものとして予測処理を行う。 When the macroblock E is field-based encoded and any of the adjacent blocks A to D is frame-based encoded, the quotient obtained by dividing the value of the vertical direction component of the motion vector information by 2, Prediction processing is performed assuming that it corresponds to a field-based motion vector.

"Video Compression Using Context-Based Adaptive Arithmetic Coding",Marpe et al,ICIO1"Video Compression Using Context-Based Adaptive Arithmetic Coding", Marpe et al, ICIO1 "Arithmetic Coding for Data Compression",(Witten et al. Comm. of the ACM,30 (6),1987,pp520-541)"Arithmetic Coding for Data Compression", (Witten et al. Comm. Of the ACM, 30 (6), 1987, pp520-541) "Interlace Coding Tools for H.26L Video Coding(L.Wang et al.,VCEG-O37,Dec.2001)""Interlace Coding Tools for H.26L Video Coding (L. Wang et al., VCEG-O37, Dec. 2001)"

ところで、非特許文献３においては、マクロブロックレベルのフィールド／フレーム符号化に必要なシンタクス要素が付加されており、また、動きベクトル情報等のシンタクス要素に関しても、そのセマンティクスが変更されているが、これに対して、新たなコンテキストモデルの導入、および既存のコンテキストモデルの変更がなされておらず、非特許文献３に提案された情報のみでは、CABAC方式を用いたマクロブロックレベルのフィールド／フレーム符号化を行うことが不可能である。 By the way, in Non-Patent Document 3, syntax elements necessary for field / frame encoding at the macroblock level are added, and the semantics of syntax elements such as motion vector information are also changed. On the other hand, the introduction of a new context model and the change of the existing context model have not been made, and only with the information proposed in Non-Patent Document 3, the field / frame code at the macroblock level using the CABAC method is used. It is impossible to make it.

CABAC方式は、UVLC方式に比較して符号化処理により多くの演算量を要するものの、より高い符号化効率を実現することが知られており、入力となる画像情報が飛び越し走査フォーマットであった場合にも、CABAC方式を用いたマクロブロックレベルのフィールド／フレーム符号化を実現できることが望ましい。 The CABAC method requires a larger amount of calculation processing than the UVLC method, but is known to achieve higher encoding efficiency, and the input image information is in an interlaced scanning format. In addition, it is desirable to be able to implement macroblock level field / frame coding using the CABAC method.

本発明はこのような状況に鑑みてなされたものであり、入力となる画像情報が飛び越し走査フォーマットであった場合にも、CABAC方式を用いたマクロブロックレベルのフィールド／フレーム符号化を可能とすることを目的とする。 The present invention has been made in view of such circumstances, and enables macroblock level field / frame encoding using the CABAC method even when the input image information is in an interlaced scanning format. For the purpose.

本発明の一側面は、画像情報が符号化された符号化データを復号する復号装置において、前記画像情報がフレームモードで符号化されているかフィールドモードで符号化されているかをマクロブロックレベルで示す符号化モード情報を用いて、復号の対象となる対象マクロブロックに隣接する隣接マクロブロックの動きベクトル予測誤差を前記対象マクロブロックの符号化モードにあわせるように変換して、前記対象マクロブロックの動きベクトル予測誤差に対応するコンテキストモデルを算出するコンテキストモデル手段と、前記コンテキストモデル手段により算出された前記コンテキストモデルを用いて、前記符号化データにコンテキスト適応算術復号を行うコンテキスト適応算術復号手段とを備える復号装置である。 One aspect of the present invention shows, on a macroblock level, whether the image information is encoded in a frame mode or a field mode in a decoding device that decodes encoded data in which the image information is encoded. Using the coding mode information, the motion vector prediction error of an adjacent macroblock adjacent to the target macroblock to be decoded is converted to match the coding mode of the target macroblock, and the motion of the target macroblock Context model means for calculating a context model corresponding to a vector prediction error; and context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding on the encoded data using the context model calculated by the context model means. It is a decoding device.

本発明の一側面は、また、画像情報が符号化された符号化データを復号する復号装置の復号方法であって、コンテキストモデル手段が、前記画像情報がフレームモードで符号化されているかフィールドモードで符号化されているかをマクロブロックレベルで示す符号化モード情報を用いて、復号の対象となる対象マクロブロックに隣接する隣接マクロブロックの動きベクトル予測誤差を前記対象マクロブロックの符号化モードにあわせるように変換して、前記対象マクロブロックの動きベクトル予測誤差に対応するコンテキストモデルを算出し、コンテキスト適応算術復号手段が、算出された前記コンテキストモデルを用いて、前記符号化データにコンテキスト適応算術復号を行う復号方法である。 One aspect of the present invention is also a decoding method of a decoding apparatus for decoding encoded data in which image information is encoded, wherein the context model means determines whether the image information is encoded in a frame mode or a field mode. The motion vector prediction error of the adjacent macroblock adjacent to the target macroblock to be decoded is matched with the coding mode of the target macroblock using the coding mode information indicating whether the data is encoded with the macroblock level. And a context model corresponding to the motion vector prediction error of the target macroblock is calculated, and context adaptive arithmetic decoding means uses the calculated context model to perform context adaptive arithmetic decoding on the encoded data. Is a decoding method for performing

本発明の一側面は、さらに、画像情報が符号化された符号化データを復号するコンピュータを、前記画像情報がフレームモードで符号化されているかフィールドモードで符号化されているかをマクロブロックレベルで示す符号化モード情報を用いて、復号の対象となる対象マクロブロックに隣接する隣接マクロブロックの動きベクトル予測誤差を前記対象マクロブロックの符号化モードにあわせるように変換して、前記対象マクロブロックの動きベクトル予測誤差に対応するコンテキストモデルを算出するコンテキストモデル手段、前記コンテキストモデル手段により算出された前記コンテキストモデルを用いて、前記符号化データにコンテキスト適応算術復号を行うコンテキスト適応算術復号手段として機能させるためのプログラムを記録したコンピュータ読み取り可能な記録媒体である。 According to another aspect of the present invention, there is provided a computer that decodes encoded data in which image information is encoded, and whether the image information is encoded in a frame mode or a field mode at a macroblock level. Using the coding mode information shown, the motion vector prediction error of an adjacent macroblock adjacent to the target macroblock to be decoded is converted to match the coding mode of the target macroblock, and the target macroblock Context model means for calculating a context model corresponding to a motion vector prediction error, and using the context model calculated by the context model means to function as context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding on the encoded data Recording program for A computer-readable recording medium.

本発明の一側面は、また、画像情報が符号化された符号化データを復号するコンピュータを、前記画像情報がフレームモードで符号化されているかフィールドモードで符号化されているかをマクロブロックレベルで示す符号化モード情報を用いて、復号の対象となる対象マクロブロックに隣接する隣接マクロブロックの動きベクトル予測誤差を前記対象マクロブロックの符号化モードにあわせるように変換して、前記対象マクロブロックの動きベクトル予測誤差に対応するコンテキストモデルを算出するコンテキストモデル手段、前記コンテキストモデル手段により算出された前記コンテキストモデルを用いて、前記符号化データにコンテキスト適応算術復号を行うコンテキスト適応算術復号手段として機能させるためのプログラムである。 According to another aspect of the present invention, there is provided a computer that decodes encoded data in which image information is encoded, and whether the image information is encoded in a frame mode or a field mode at a macroblock level. Using the coding mode information shown, the motion vector prediction error of an adjacent macroblock adjacent to the target macroblock to be decoded is converted to match the coding mode of the target macroblock, and the target macroblock Context model means for calculating a context model corresponding to a motion vector prediction error, and using the context model calculated by the context model means to function as context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding on the encoded data It is a program for.

本発明の一側面においては、画像情報がフレームモードで符号化されているかフィールドモードで符号化されているかをマクロブロックレベルで示す符号化モード情報が用いられて、復号の対象となる対象マクロブロックに隣接する隣接マクロブロックの動きベクトル予測誤差が対象マクロブロックの符号化モードにあわせるように変換されて、対象マクロブロックの動きベクトル予測誤差に対応するコンテキストモデルが算出され、その算出されたコンテキストモデルが用いられて、符号化データにコンテキスト適応算術復号が行われる。 In one aspect of the present invention, target macroblocks to be decoded using encoding mode information indicating at a macroblock level whether image information is encoded in a frame mode or a field mode. The motion vector prediction error of the adjacent macroblock adjacent to the target macroblock is converted so as to match the encoding mode of the target macroblock, and a context model corresponding to the motion vector prediction error of the target macroblock is calculated, and the calculated context model Is used to perform context adaptive arithmetic decoding on the encoded data.

以上のように、本発明によれば、飛び越し走査フォーマットの画像情報がCABAC方式を用いてマクロブロックレベルでフィールド／フレーム符号化されている圧縮画像情報を復号して、飛び越し走査フォーマットの画像情報を復元することが可能となる。 As described above, according to the present invention, the image information in the interlaced scanning format is decoded by decoding the compressed image information in which the image information in the interlaced scanning format is field / frame encoded at the macroblock level using the CABAC method. It can be restored.

直交変換処理と動き補償処理によって画像圧縮を実現する従来の画像情報符号化装置の構成を示すブロック図である。It is a block diagram which shows the structure of the conventional image information encoding apparatus which implement | achieves image compression by orthogonal transformation processing and motion compensation processing. 図１の画像情報符号化装置に対応する画像情報復号装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image information decoding apparatus corresponding to the image information encoding apparatus of FIG. 算術符号化処理における、記号の発生確率と割り当てられるサブ区間の対応関係の一例を示した図である。It is the figure which showed an example of the correspondence of the generation | occurrence | production probability of a symbol and the allocated sub-section in arithmetic coding processing. 算術符号化処理の一例を示す図である。It is a figure which shows an example of an arithmetic encoding process. CABAC符号化器の一般的な構成を示すブロック図である。It is a block diagram which shows the general structure of a CABAC encoder. MB_typeのコンテキストモデルを説明するための図である。It is a figure for demonstrating the context model of MB_type. 動きベクトル情報MVDのコンテキストモデルを説明するための図である。It is a figure for demonstrating the context model of motion vector information MVD. 動きベクトル情報MVDをコンテキストモデルに基づいて符号化する処理を説明するための図である。It is a figure for demonstrating the process which encodes the motion vector information MVD based on a context model. Ｈ．２６Ｌで定義されているイントラ予測モードを説明するための図である。H. It is a figure for demonstrating the intra prediction mode defined by 26L. ラベル１乃至５のイントラ予測モードの方向を説明するための図である。It is a figure for demonstrating the direction of the intra prediction modes of the labels 1-5. Ｈ．２６Ｌで定義されているシングルスキャン方式およびダブルスキャン方式を説明するための図である。H. It is a figure for demonstrating the single scan system and the double scan system defined by 26L. Ｈ．２６Ｌで定義されている、(RUN,LEVEL)に対応するコンテキストモデルを示す図である。H. It is a figure which shows the context model corresponding to (RUN, LEVEL) defined by 26L. Ｈ．２６Ｌにおける、MB_type以外のシンタクス要素を２値化する処理を説明するための図である。H. It is a figure for demonstrating the process which binarizes syntax elements other than MB_type in 26L. Ｈ．２６Ｌにおける、ＰピクチャおよびＢピクチャのMB_typeを２値化する処理を説明するための図である。H. It is a figure for demonstrating the process which binarizes MB_type of P picture and B picture in 26L. Ｈ．２６Ｌにおいて定義されている、マクロブロックにおける動き予測・補償の単位として７種類のモードを示す図である。H. It is a figure which shows seven types of modes as a unit of the motion estimation and compensation in a macroblock defined in 26L. マクロブロックレベルのフィールド／フレーム適応符号化が行えるように拡張された画像圧縮情報のシンタクスを示す図である。It is a figure which shows the syntax of the image compression information extended so that the field / frame adaptive encoding of a macroblock level could be performed. マクロブロックをフィールドベースで符号化する場合における、マクロブロックの画素の並べ替えを説明するための図である。It is a figure for demonstrating rearrangement of the pixel of a macroblock in the case of encoding a macroblock on a field basis. マクロブロックをフィールドベースで符号化する場合における、動き予測・補償の単位として定義されている５種類のモードを示す図である。It is a figure which shows five types of modes defined as a unit of motion prediction and compensation in the case of encoding a macroblock on a field basis. マクロブロックをフィールドベースで符号化する場合における、マクロブロック内でイントラ予測を行う動作原理を説明するための図である。It is a figure for demonstrating the operation principle which performs intra prediction in a macroblock in the case of encoding a macroblock on a field basis. マクロブロックをフィールドベースで符号化する場合における、マクロブロックをまたがってイントラ予測を行う動作原理を説明するための図である。It is a figure for demonstrating the operation principle which performs intra prediction across macroblocks in the case of encoding a macroblock on a field basis. マクロブロックをフィールドベースで符号化する場合における、色差信号に対するイントラ予測を行う動作原理を説明するための図である。It is a figure for demonstrating the operation principle which performs the intra prediction with respect to a colour-difference signal in the case of encoding a macroblock on a field basis. マクロブロックをフィールドベースで符号化する場合における、色差信号の残差成分を符号化する動作原理を説明するための図である。It is a figure for demonstrating the operation | movement principle which encodes the residual component of a colour-difference signal in the case of encoding a macroblock on a field basis. Ｈ．２６Ｌにおいて規定されているマルチプルフレーム予測を説明するための図である。H. It is a figure for demonstrating the multiple frame prediction prescribed | regulated in 26L. マクロブロックをフィールドベースで符号化する場合における、動きベクトル情報の予測方式を説明するための図である。It is a figure for demonstrating the prediction method of motion vector information in the case of encoding a macroblock on a field basis. Ｈ．２６Ｌで定められている各予測モードにおける動きベクトル情報の予測値を生成する処理を説明するための図である。H. It is a figure for demonstrating the process which produces | generates the predicted value of the motion vector information in each prediction mode defined by 26L. 本発明の一実施の形態である画像情報符号化装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the image information encoding apparatus which is one embodiment of this invention. 図２６の算術符号化部５８の構成例を示すブロック図である。FIG. 27 is a block diagram illustrating a configuration example of an arithmetic encoding unit 58 in FIG. 26. マクロブロックをフィールドベースで符号化する場合における、ＰピクチャおよびＢピクチャに属するマクロブロックのMB_typeを２値化するたためのテーブルを示す図である。It is a figure which shows the table for binarizing MB_type of the macroblock which belongs to P picture and B picture in the case of encoding a macroblock on a field basis. 図２６の画像情報符号化装置の対応する、本発明の一実施の形態である画像情報復号装置の構成例を示すブロック図である。FIG. 27 is a block diagram illustrating a configuration example of an image information decoding device according to an embodiment of the present invention, corresponding to the image information encoding device of FIG. 26.

以下、本発明を適用した画像情報符号化装置について、図２６を参照して説明する。当該画像情報符号化装置は、入力となる画像情報が飛び越し走査フォーマットであった場合にも、CABAC方式を用いて符号化処理を施すことができるものである。 Hereinafter, an image information encoding apparatus to which the present invention is applied will be described with reference to FIG. The image information encoding apparatus can perform encoding processing using the CABAC method even when input image information is in an interlaced scanning format.

当該画像情報符号化装置において、Ａ／Ｄ変換部５１は、アナログ信号である入力画像信号をディジタル信号に変換して、画面並べ替えバッファ５２に出力する。画面並べ替えバッファ５２は、Ａ／Ｄ変換部５１からの入力画像情報を、当該画像情報符号化装置の出力となる画像圧縮情報のGOP構造に応じて並び替えて、加算器５４に出力する。 In the image information encoding apparatus, the A / D conversion unit 51 converts an input image signal that is an analog signal into a digital signal and outputs the digital signal to the screen rearrangement buffer 52. The screen rearrangement buffer 52 rearranges the input image information from the A / D conversion unit 51 according to the GOP structure of the image compression information output from the image information encoding device, and outputs the rearranged image information to the adder 54.

フィールド／フレーム判定部５３は、処理対象の画像のマクロブロックを、フィールドベースで符号化する場合と、フレームベースで符号化する場合との符号化効率が高い方を判定し、対応するFrame/Field Flagを生成して、フィールド／フレーム変換部５５および算術符号化部５８に出力する。 The field / frame determination unit 53 determines whether the macroblock of the processing target image is field-based or frame-based encoded and has the higher encoding efficiency, and the corresponding Frame / Field A flag is generated and output to the field / frame conversion unit 55 and the arithmetic coding unit 58.

加算器５４は、処理対象のマクロブロックがインター符号化される場合、フィールド／フレーム判定部５３を介する入力画像と、動き予測・補償部６４からの参照画像との差分画像を生成して、フィールド／フレーム変換部５５および直交変換部５６に出力する。また、加算器５４は、処理対象のマクロブロックがイントラ符号化される場合、フィールド／フレーム判定部５３を介する入力画像をそのまま、フィールド／フレーム変換部５５および直交変換部５６に出力する。 When the macroblock to be processed is inter-coded, the adder 54 generates a difference image between the input image via the field / frame determination unit 53 and the reference image from the motion prediction / compensation unit 64, and / Output to frame conversion unit 55 and orthogonal conversion unit 56. Further, when the macroblock to be processed is intra-coded, the adder 54 outputs the input image that has passed through the field / frame determination unit 53 to the field / frame conversion unit 55 and the orthogonal transformation unit 56 as they are.

フィールド／フレーム変換部５５は、処理対象のマクロブロックがフィールドベースで符号化される場合、加算器５４からの入力画像をフィールド構造に変換して直交変換部５６に出力する。直交変換部５６は、入力される画像情報に対して直交変換（離散コサイン変換、またはカルーネン・レーベ変換等）を施し、得られる変換係数を量子化部５７に供給する。量子化部５７は、レート制御部６５らの制御に従い、直交変換部５６から供給された変換係数に対して量子化処理を施す。 When the processing target macroblock is encoded on a field basis, the field / frame conversion unit 55 converts the input image from the adder 54 into a field structure and outputs it to the orthogonal transformation unit 56. The orthogonal transform unit 56 performs orthogonal transform (discrete cosine transform, Karoonen-Loeve transform, etc.) on the input image information, and supplies the obtained transform coefficient to the quantization unit 57. The quantization unit 57 performs a quantization process on the transform coefficient supplied from the orthogonal transform unit 56 under the control of the rate control unit 65 and the like.

算術符号化部５８は、量子化部５７および動き予測・補償部６４から入力される各シンタクス要素、並びにフィールド／フレーム判定部５３からのFrame/Field FlagをCABAC方式に基づいて算術符号化し、蓄積バッファ５９に供給して蓄積させる。蓄積バッファ５９は、蓄積した画像圧縮情報を後段に出力する。 The arithmetic coding unit 58 arithmetically codes and stores each syntax element input from the quantization unit 57 and the motion prediction / compensation unit 64 and the Frame / Field Flag from the field / frame determination unit 53 based on the CABAC method. The data is supplied to the buffer 59 and accumulated. The accumulation buffer 59 outputs the accumulated image compression information to the subsequent stage.

逆量子化部６０は、量子化された直交変換係数を逆量子化して、逆直交変換部６１に出力する。逆直交変換部６１は、逆量子化された変換係数に対して逆直交変換処理を施して復号画像情報を生成し、フレームメモリ６２に供給して蓄積させる。フィールド／フレーム変換部６３は、処理対象とするマクロブロックをフィールドベースで符号化する場合、フレームメモリ６２に蓄積された復号画像情報をフィールド構造に変換して、動き予測・補償部６４に出力する。 The inverse quantization unit 60 performs inverse quantization on the quantized orthogonal transform coefficient and outputs the result to the inverse orthogonal transform unit 61. The inverse orthogonal transform unit 61 performs inverse orthogonal transform processing on the inversely quantized transform coefficients to generate decoded image information, which is supplied to the frame memory 62 and stored. The field / frame conversion unit 63 converts the decoded image information stored in the frame memory 62 into a field structure and outputs the macroblock to be processed to the motion prediction / compensation unit 64 when encoding a macroblock to be processed on a field basis. .

動き予測・補償部６４は、動き予測処理により、最適な予測モード情報および動きベクトル情報を生成して算術符号化部５８部に出力するとともに、予測画像を生成して加算器５４に出力する。レート制御部６５は、蓄積バッファ５９に蓄積されたデータ量に基づき、量子化部５７の動作のフィードバック制御を行う。制御部６６は、記録媒体６７に記録されている制御用プログラムに従い、当該画像情報符号化装置の各部を制御する。 The motion prediction / compensation unit 64 generates optimal prediction mode information and motion vector information by motion prediction processing and outputs them to the arithmetic encoding unit 58, and also generates a prediction image and outputs it to the adder 54. The rate control unit 65 performs feedback control of the operation of the quantization unit 57 based on the amount of data stored in the storage buffer 59. The control unit 66 controls each unit of the image information encoding apparatus according to the control program recorded on the recording medium 67.

次に、算術符号化部５８の動作原理について、図２７を参照して説明する。図２７は、算術符号化部５８の構成例を示している。算術符号化部５８においては、入力される画像圧縮情報のシンタクス要素のうち、まず、図１６に示したframe／field flagが、フレーム／フィールドフラグコンテクストモデル９１によって符号化される。 Next, the operation principle of the arithmetic encoding unit 58 will be described with reference to FIG. FIG. 27 shows a configuration example of the arithmetic encoding unit 58. In the arithmetic encoding unit 58, among the syntax elements of the input image compression information, first, the frame / field flag shown in FIG. 16 is encoded by the frame / field flag context model 91.

そして、処理対象となるマクロブロックがフレームベース符号化される場合、現在Ｈ．２６Ｌの標準で定められているフレームベースのコンテキストモデル９２が適用される。なお、２値化されていない値を持つシンタクス要素に関しては、２値化部９３によって２値化が施された後、算術符号化が行われる。 When the macroblock to be processed is frame-based encoded, the current H.264 A frame-based context model 92 defined in the 26L standard is applied. Note that a syntax element having a value that has not been binarized is binarized by the binarizing unit 93 and then subjected to arithmetic coding.

一方、処理対象となるマクロブロックがフィールド符号化される場合、以下のシンタクス要素に関しては、フィールドベースのコンテキストモデル９４が適用される。なお、２値化されていない値を持つシンタクス要素に関しては、２値化部９５によって２値化が施された後、算術符号化が行われる。すなわち、第１のシンタクス要素は、Ｉピクチャに対するMB_typeであり、第２のシンタクス要素はＰ／Ｂピクチャに対するMB_typeであり、第３のシンタクス要素は動きベクトル情報であり、第４のシンタクス要素は参照フィールドパラメータであり、第５のシンタクスはイントラ予測モードである。 On the other hand, when a macroblock to be processed is field-encoded, a field-based context model 94 is applied to the following syntax elements. Note that a syntax element having a value that has not been binarized is binarized by the binarization unit 95 and then subjected to arithmetic coding. That is, the first syntax element is MB_type for I picture, the second syntax element is MB_type for P / B picture, the third syntax element is motion vector information, and the fourth syntax element is a reference. It is a field parameter, and the fifth syntax is an intra prediction mode.

以下、図６に示すようにマクロブロックＡ，Ｂ，Ｃが配置されているとする。frame／field flagに関するコンテキストモデルについて説明する。マクロブロックＣのframe／field flagに関するコンテキストモデルctx_fifr_flag(C)は、次式（２１）によって定義される。
ctx_fifr_flag(C)＝ａ＋２ｂ
・・・（２１）ただし、式（２１）において、ａ，ｂは、それぞれマクロブロックＡ，Ｂのframe／field flagの値である。 Hereinafter, it is assumed that macroblocks A, B, and C are arranged as shown in FIG. A context model related to frame / field flag will be described. The context model ctx_fifr_flag (C) regarding the frame / field flag of the macroblock C is defined by the following equation (21).
ctx_fifr_flag (C) = a + 2b
(21) However, in equation (21), a and b are the values of the frame / field flag of macroblocks A and B, respectively.

次に、Ｉピクチャに対するMB_typeに関するコンテキストモデルについて説明する。frame／field flagが１である場合、Ｉピクチャに含まれるマクロブロックＣのMB_typeに対応するコンテキストモデルctx_mb_type_intra_field（C）は、式（３）と同様に次式（２２）によって定義される。
ctx_mb_type_intra_field（C）＝Ａ＋Ｂ
・・・（２２）ただし、式（２２）におけるＡ，Ｂは、式（３）におけるものと同様である。なお、隣接するマクロブロックＡ，Ｂは、フィールドベース符号化されていても、フレームベース符号化されていてもかまわない。 Next, a context model regarding MB_type for an I picture will be described. When the frame / field flag is 1, the context model ctx_mb_type_intra_field (C) corresponding to the MB_type of the macroblock C included in the I picture is defined by the following equation (22) similarly to the equation (3).
ctx_mb_type_intra_field (C) = A + B
(22) However, A and B in the equation (22) are the same as those in the equation (3). Adjacent macroblocks A and B may be field-based encoded or frame-based encoded.

次に、Ｐ／Ｂピクチャに対するMB_typeに関するコンテキストモデルについて説明する。マクロブロックＣがＰピクチャに含まれる場合、マクロブロックＣのMB_typeに対応するコンテキストモデルctx_mb_type_inter_field(C)は、次式（２３）によって定義される。また、Ｂピクチャに含まれる場合、次式（２４）によって定義される。
ctx_mb_type_inter_field(C)＝((A==skip)?0:1)＋２((B==skip)?0:1)
・・・（２３）
ctx_mb_type_inter_field(C)＝((A==Direct)?0:1)＋２((B==Direct)?0:1)
・・・（２４） Next, a context model related to MB_type for P / B pictures will be described. When the macroblock C is included in the P picture, the context model ctx_mb_type_inter_field (C) corresponding to the MB_type of the macroblock C is defined by the following equation (23). When included in a B picture, it is defined by the following equation (24).
ctx_mb_type_inter_field (C) = ((A == skip)? 0: 1) +2 ((B == skip)? 0: 1)
(23)
ctx_mb_type_inter_field (C) = ((A == Direct)? 0: 1) +2 ((B == Direct)? 0: 1)
... (24)

ただし、式（２３）における演算子((A==skip)?0:1)，((A==skip)?0:1)は、式（４）におけるものと同様であり、式（２４）における演算子((A==Direct)?0:1)，((B==Direct)?0:1)は、式（５）におけるものと同様である。隣接するマクロブロックＡ，Ｂは、フィールドベース符号化されていても、フレームベース符号化されていてもかまわない。 However, the operators ((A == skip)? 0: 1) and ((A == skip)? 0: 1) in the expression (23) are the same as those in the expression (4), and the expression (24 ) Operators ((A == Direct)? 0: 1) and ((B == Direct)? 0: 1) are the same as those in equation (5). Adjacent macroblocks A and B may be field-based encoded or frame-based encoded.

なお、２値化されていないＰピクチャのMB_typeは、図２８Ａに示すテーブルによって２値化される。また、２値化されていないＢピクチャのMB_typeは、図２８Ｂに示すテーブルによって２値化される。 Note that the MB_type of the P picture that has not been binarized is binarized by the table shown in FIG. 28A. The MB_type of the B picture that has not been binarized is binarized by the table shown in FIG. 28B.

適応２値算術符号化部９６では、２値化されたシンボルに対して、確率推定部９７によって確率推定がなされ、符号化エンジン９８によって確率推定に基づく適応算術符号化が施される。適応算術符号化処理が行われた後、関連するモデルの更新が行われるため、それぞれのモデルは実際の画像圧縮情報の統計に応じた符号化処理を行うことが可能となる。 In the adaptive binary arithmetic coding unit 96, probability estimation is performed on the binarized symbol by the probability estimating unit 97, and adaptive arithmetic coding based on the probability estimation is performed by the encoding engine 98. Since the relevant model is updated after the adaptive arithmetic coding process is performed, each model can perform the coding process according to the statistics of the actual image compression information.

フレームベース符号化されるマクロブロックに対しては、Ｐピクチャに属する場合、１０種類のMB_typeが定義されている。一方、フィールドベース符号化されるマクロブロックに対しては、Ｐピクチャに属する場合、前記１６種類のうち、１６×１６モード、および８×１６モードが定義されていない。すなわち、フィールドベース符号化されるマクロブロックに対しては、Ｐピクチャに関して８種類のMB_typeが定義されている。 For macroblocks to be frame-based encoded, 10 types of MB_type are defined when belonging to a P picture. On the other hand, for a macroblock to be field-based encoded, when belonging to a P picture, the 16 × 16 mode and the 8 × 16 mode are not defined among the 16 types. That is, eight types of MB_type are defined for P pictures for field-based macroblocks.

フレームベース符号化されるマクロブロックに対しては、Ｂピクチャに関して１８種類のMB_typeが定義されている。一方、フィールドベース符号化されるマクロブロックに対しては、Ｂピクチャに属する場合、前記１８種類のうち、前方向１６×１６モード、後方向１６×１６モード、前方向８×１６モード、および後方向８×１６モードが定義されていない。すなわち、フィールドベース符号化されるマクロブロックに対しては、Ｂピクチャに関して１４種類のMB_typeが定義されている。 For macroblocks to be frame-based encoded, 18 types of MB_type are defined for B pictures. On the other hand, for a macroblock to be field-based encoded, when belonging to a B picture, among the 18 types, the forward 16 × 16 mode, the backward 16 × 16 mode, the forward 8 × 16 mode, and the backward Direction 8x16 mode is not defined. That is, 14 types of MB_type are defined for B pictures for field-based encoded macroblocks.

次に、動きベクトル情報のコンテキストモデルについて説明する。frame／field flagの値が１である場合、マクロブロックＣの動きベクトル情報に対応する第１乃至３のコンテキストモデルctx_mvd_field(C,k)は、次式（２５−１）乃至（２５−３）によって定義される。
ctx_mvd_field (C,k)＝０
ｅ_k（Ｃ）＜３
・・・（２５−１）
ctx_mvd_field (C,k)＝１
３２＜ｅ_k（Ｃ）
・・・（２５−２）
ctx_mvd_field (C,k)＝２
３≦ｅ_k（Ｃ）≦３２
・・・（２５−３）ただし、式（２５−１）乃至（２５−３）における評価関数ｅ_kは次式（２６）のように定義されている。マクロブロックＡ，Ｂは同じパリティフィールドにある。
ｅ_k（Ｃ）＝｜ｍｖｄ_k（Ａ）｜＋｜ｍｖｄ_k（Ｂ）｜
・・・（２６） Next, a context model of motion vector information will be described. When the value of the frame / field flag is 1, the first to third context models ctx_mvd_field (C, k) corresponding to the motion vector information of the macroblock C are expressed by the following equations (25-1) to (25-3) Defined by
ctx_mvd_field (C, k) = 0
e _k (C) <3
... (25-1)
ctx_mvd_field (C, k) = 1
32 <e _k (C)
... (25-2)
ctx_mvd_field (C, k) = 2
3 ≦ e _k (C) ≦ 32
... (25-3) where the evaluation function e _k in Equation (25-1) to (25-3) is defined by the following equation (26). Macroblocks A and B are in the same parity field.
e _k (C) = | mvd _k (A) | + | mvd _k (B) |
... (26)

ここで、マクロブロックＡがフレームベース符号化されたものである場合、垂直方向成分の動きベクトル情報ｍｖｄ₁（Ａ）に関しては、次式（２７）を用いて算出したｍｖｄ_{1_field}（Ａ）を式（２６）に適用する。また、マクロブロックＢがフレームベース符号化されたものである場合においても同様である。
ｍｖｄ_{1_field}（Ａ）＝ｍｖｄ_{1_frame}（Ａ）／２
・・・（２７） Here, when the macroblock A is frame-based encoded, regarding the motion vector information mvd ₁ (A) of the vertical direction component, mvd _{1_field} (A) calculated using the following equation (27) Applies to (26). The same applies to the case where the macroblock B is frame-based encoded.
mvd _{1_field} (A) = mvd _{1_frame} (A) / 2
... (27)

反対に、マクロブロックＣをフレームベース符号化する場合であって、隣接ブロックＡがフィールドベース符号化されたものである場合、ｍｖｄ_k（Ａ）の水平方向成分、垂直方向成分は、それぞれ次式（２８−１），（２８−２）を用いて算出したｍｖｄ_{k_frame}（Ａ）を式（２６）に適用する。
ｍｖｄ_{0_frame}（Ａ）
＝（ｍｖｄ_{0_top}（Ａ）＋ｍｖｄ_{0_bottom}（Ａ））／２・・・（２８−１）
ｍｖｄ_{1_frame}（Ａ）
＝ｍｖｄ_{1_top}（Ａ）＋ｍｖｄ_{1_bottom}（Ａ）
・・・（２８−２） On the other hand, when the macroblock C is frame-based encoded and the adjacent block A is field-based encoded, the horizontal component and the vertical component of mvd _k (A) Mvd _{k_frame} (A) calculated using (28-1) and (28-2) is applied to equation (26).
mvd _{0_frame} (A)
= (Mvd _{0_top} (A) + mvd _{0_bottom} (A)) / 2 (28-1)
mvd _{1_frame} (A)
= Mvd _{1_top} (A) + mvd _{1_bottom} (A)
(28-2)

次に、参照フィールドパラメータに関するコンテキストモデルについて説明する。frame／field flagの値が１である場合、第１フィールドに対応する第１のコンテキストモデルctx_ref_field_top(C)は、次式（２９−１）によって定義される。また、第２フィールドに対応する第１のコンテキストモデルctx_ref_field_bot(C)は、次式（２９−２）によって定義される。
ctx_ref_field_top(C)＝ａ_t＋２ｂ_t
・・・（２９−１）
ctx_ref_field_bot(C)＝ａ_b＋２ｂ_b
・・・（２９−２） Next, a context model related to the reference field parameter will be described. When the value of frame / field flag is 1, the first context model ctx_ref_field_top (C) corresponding to the first field is defined by the following equation (29-1). Also, the first context model ctx_ref_field_bot (C) corresponding to the second field is defined by the following equation (29-2).
_{ctx_ref_field_top (C) = a t +} 2b t
... (29-1)
ctx_ref_field_bot (C) = a _b + 2b _b
(29-2)

ただし、式（２９−１），（２９−２）において、パラメータａ_tは、隣接するマクロブロックＡの第１フィールドに関するものであり、パラメータａ_bは、隣接するマクロブロックＡの第２フィールドに関するものであり、パラメータｂ_tは、隣接するマクロブロックＢの第１フィールドに関するものであり、パラメータｂ_bは、隣接するマクロブロックＢの第２フィールドに関するものであり、次式（３０−１），（３０−２）のように定義されている。
ａ_t，ａ_b，ｂ_t，ｂ_b
＝０
参照フィールドが最も直前に符号化されたものである場合
・・・（３０−１）
ａ_t，ａ_b，ｂ_t，ｂ_b
＝１
上記以外の場合
・・・（３０−２） However, the formula (29-1) and (29-2), the parameter a _t is related to the first field of the neighboring macroblock A, the parameter a _b is related to the second field of the neighboring macroblock A The parameter b _t relates to the first field of the adjacent macroblock B, the parameter b _b relates to the second field of the adjacent macroblock B, and the following equation (30-1), It is defined as (30-2).
a _t , a _b , b _t , b _b
= 0
The reference field is the most recently encoded one
... (30-1)
a _t , a _b , b _t , b _b
= 1
In cases other than the above
... (30-2)

第２以降のｂｉｎに対応するコンテキストモデルに関しては、それぞれ、式（８）に示したコンテキストモデルctx_ref_frame(C)と同様に定義される。ただし、符号化されるCode_numberは、フレームに対するものではなく、フィールドに対して割り当てられたものである。 The context models corresponding to the second and subsequent bins are defined in the same manner as the context model ctx_ref_frame (C) shown in Expression (8). However, Code_number to be encoded is not assigned to the frame but assigned to the field.

次に、イントラ予測モードに関するコンテキストモデルについて説明する。frame／field flagの値が１である場合、マクロブロックＣに対応するイントラ予測モードに関するコンテキストモデルctx_intra_pred_field(C)は、フレームモードのマクロブロックに対するコンテキストモデルctx_intra_pred(C)と同様に定義される。なお、隣接するマクロブロックＡ，Ｂは、フィールドベース符号化されていても、フレームベース符号化されていてもかまわない。 Next, a context model related to the intra prediction mode will be described. When the value of the frame / field flag is 1, the context model ctx_intra_pred_field (C) for the intra prediction mode corresponding to the macroblock C is defined in the same manner as the context model ctx_intra_pred (C) for the macroblock in the frame mode. Adjacent macroblocks A and B may be field-based encoded or frame-based encoded.

以上説明したように、新たなコンテキストモデルを導入し、既存のコンテキストモデルを変更することにより、CABAC方式を用いたフィールド／フレーム符号化を行うことが可能となる。 As described above, by introducing a new context model and changing an existing context model, field / frame encoding using the CABAC method can be performed.

次に、図２９は、図２６の画像情報符号化装置に対応する画像情報復号装置の構成例を示している。 Next, FIG. 29 shows a configuration example of an image information decoding apparatus corresponding to the image information encoding apparatus of FIG.

当該画像情報復号装置において、蓄積バッファ１０１は、入力される画像圧縮情報を蓄積し、適宜、算術復号化部１０２に出力する。算術復号化部１０２は、CABAC方式に基づいて符号化されている画像圧縮情報に算術復号化処理を施し、復号したframe／field flagをフィールド／フレーム変換部１０５，１１０に出力し、量子化されている直交変換係数を逆量子化部１０３に出力し、予測モード情報および動きベクトル情報を動き予測・補償部１１１に出力する。 In the image information decoding apparatus, the accumulation buffer 101 accumulates input image compression information and outputs it to the arithmetic decoding unit 102 as appropriate. The arithmetic decoding unit 102 performs arithmetic decoding processing on the compressed image information encoded based on the CABAC method, outputs the decoded frame / field flag to the field / frame conversion units 105 and 110, and is quantized. The orthogonal transform coefficient is output to the inverse quantization unit 103, and the prediction mode information and motion vector information are output to the motion prediction / compensation unit 111.

逆量子化部１０３は、算術復号化部１０２によって復号された、量子化されている直交変換係数を逆量子化する。逆直交変換部１０４は、逆量子化された直交変換係数を逆直交変換する。フィールド／フレーム変換部１０５は、処理対象のマクロブロックがフィールドベースで符号化されている場合、逆直交変換の結果得られた出力画像または差分画像をフレーム構造に変換する。 The inverse quantization unit 103 inversely quantizes the quantized orthogonal transform coefficient decoded by the arithmetic decoding unit 102. The inverse orthogonal transform unit 104 performs inverse orthogonal transform on the inversely quantized orthogonal transform coefficient. The field / frame conversion unit 105 converts the output image or the difference image obtained as a result of the inverse orthogonal transform into a frame structure when the macro block to be processed is encoded on a field basis.

加算器１０６は、処理対象のマクロブロックがインターマクロブロックであった場合、逆直交変換部１０４からの差分画像と、動き予測・補償部１１１からの参照画像を合成して出力画像を生成する。画面並べ替えバッファ１０７は、入力された画像圧縮情報のGOP構造に応じて、出力画像を並べ替えてＤ／Ａ変換部１０８に出力する。Ｄ／Ａ変換部１０８は、ディジタル信号である出力画像をアナログ信号に変換して後段に出力する。 When the macro block to be processed is an inter macro block, the adder 106 combines the difference image from the inverse orthogonal transform unit 104 and the reference image from the motion prediction / compensation unit 111 to generate an output image. The screen rearrangement buffer 107 rearranges the output images according to the GOP structure of the input image compression information and outputs the rearranged output images to the D / A conversion unit 108. The D / A converter 108 converts the output image, which is a digital signal, into an analog signal and outputs it to the subsequent stage.

フレームメモリ１０９は、加算器１０６が生成した、参照画像の元となる画像情報を格納する。フィールド／フレーム変換部１１０は、処理対象のマクロブロックがフィールドベースで符号化されている場合、フレームメモリ１１１に格納されている画像情報をフィールド構造に変換する。動き予測・補償部１１１は、画像圧縮情報に含まれる、マクロブロックごとの予測モード情報および動きベクトル情報に基づき、フレームメモリに格納された画像情報を元にいて参照画像を生成し、加算部１０６に出力する。 The frame memory 109 stores image information that is generated by the adder 106 and serves as a reference image. The field / frame conversion unit 110 converts the image information stored in the frame memory 111 into a field structure when the macroblock to be processed is encoded on a field basis. The motion prediction / compensation unit 111 generates a reference image based on the image information stored in the frame memory based on the prediction mode information and motion vector information for each macroblock included in the image compression information, and the addition unit 106 Output to.

以上説明したように構成される画像情報復号装置によれば、図２６の画像情報符号化装置が出力する画像圧縮情報を復号し、元の画像情報を得ることができる。 According to the image information decoding apparatus configured as described above, the original image information can be obtained by decoding the image compression information output by the image information encoding apparatus in FIG.

上述した一連の処理は、ハードウェアにより実行させることもできるが、ソフトウェアにより実行させることもできる。一連の処理をソフトウェアにより実行させる場合には、そのソフトウェアを構成するプログラムが、専用のハードウェアに組み込まれているコンピュータ、または、各種のプログラムをインストールすることで、各種の機能を実行することが可能な、例えば汎用のパーソナルコンピュータなどに、例えば図２６の記録媒体６７からインストールされる。 The series of processes described above can be executed by hardware, but can also be executed by software. When a series of processing is executed by software, a program constituting the software may execute various functions by installing a computer incorporated in dedicated hardware or various programs. For example, it is installed in a general-purpose personal computer or the like from the recording medium 67 of FIG.

この記録媒体６７は、コンピュータとは別に、ユーザにプログラムを提供するために配布される、プログラムが記録されている磁気ディスク（フレキシブルディスクを含む）、光ディスク（CD-ROM(Compact Disc-Read Only Memory)、DVD(Digital Versatile Disc)を含む）、光磁気ディスク（ＭＤ(Mini Disc)を含む）、もしくは半導体メモリなどよりなるパッケージメディアにより構成されるだけでなく、コンピュータに予め組み込まれた状態でユーザに提供される、プログラムが記録されているROMやハードディスクなどで構成される。 The recording medium 67 is distributed to provide a program to the user separately from the computer, and includes a magnetic disk (including a flexible disk) on which the program is recorded, an optical disk (CD-ROM (Compact Disc-Read Only Memory). ), DVD (including Digital Versatile Disc), magneto-optical disc (including MD (Mini Disc)), or packaged media consisting of semiconductor memory, etc. It is composed of a ROM, hard disk, etc., where the program is recorded.

なお、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に従って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。 In the present specification, the step of describing the program recorded in the recording medium is not limited to the processing performed in time series according to the described order, but is not necessarily performed in time series, either in parallel or individually. The process to be executed is also included.

５３フィールド／フレーム判定部，５５フィールド／フレーム変換部，５８算術符号化部，６３フィールド／フレーム変換部，６６制御部，６７記録媒体，１０２算術復号化部，１０５フィールド／フレーム変換部，１１０フィールド／フレーム変換部 53 field / frame determination unit, 55 field / frame conversion unit, 58 arithmetic coding unit, 63 field / frame conversion unit, 66 control unit, 67 recording medium, 102 arithmetic decoding unit, 105 field / frame conversion unit, 110 field / Frame converter

Claims

In a decoding device that decodes encoded data in which image information is encoded,
The motion of an adjacent macroblock adjacent to a target macroblock to be decoded using encoding mode information indicating at a macroblock level whether the image information is encoded in a frame mode or a field mode Context model means for converting a vector prediction error to match the encoding mode of the target macroblock and calculating a context model corresponding to the motion vector prediction error of the target macroblock;
A decoding apparatus comprising: context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding on the encoded data using the context model calculated by the context model means.

A decoding method of a decoding apparatus for decoding encoded data in which image information is encoded,
The context model means is adjacent to the target macroblock to be decoded using encoding mode information indicating at a macroblock level whether the image information is encoded in the frame mode or the field mode. Converting a motion vector prediction error of an adjacent macroblock to match the encoding mode of the target macroblock, and calculating a context model corresponding to the motion vector prediction error of the target macroblock;
A decoding method, wherein context adaptive arithmetic decoding means performs context adaptive arithmetic decoding on the encoded data using the calculated context model.

A computer that decodes encoded data in which image information is encoded;
The motion of an adjacent macroblock adjacent to a target macroblock to be decoded using encoding mode information indicating at a macroblock level whether the image information is encoded in a frame mode or a field mode Context model means for converting a vector prediction error to match the encoding mode of the target macroblock and calculating a context model corresponding to the motion vector prediction error of the target macroblock;
A computer-readable recording medium storing a program for causing the encoded data to function as context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding using the context model calculated by the context model means.

A computer that decodes encoded data in which image information is encoded;
The motion of an adjacent macroblock adjacent to a target macroblock to be decoded using encoding mode information indicating at a macroblock level whether the image information is encoded in a frame mode or a field mode Context model means for converting a vector prediction error to match the encoding mode of the target macroblock and calculating a context model corresponding to the motion vector prediction error of the target macroblock;
A program for functioning as context adaptive arithmetic decoding means for performing context adaptive arithmetic decoding on the encoded data using the context model calculated by the context model means.