JP2000236547A

JP2000236547A - Image information converter and image information conversion method

Info

Publication number: JP2000236547A
Application number: JP18360299A
Authority: JP
Inventors: Kazufumi Sato; 数史佐藤; Takeshi Kubozono; 猛窪園; Shintaro Okada; 紳太郎岡田; Iku Ryuu; イクリュウ; Hisafumi Yanagihara; 尚史柳原
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-12-15
Filing date: 1999-06-29
Publication date: 2000-08-29

Abstract

PROBLEM TO BE SOLVED: To decrease a bit rate of a bit stream resulting from compression coding in compliance with the moving picture experts group MPEG-2 system while minimizing degradation in image quality of a decoded image. SOLUTION: An inverse quantizer 26 applies inverse quantization to a bit stream in compliance with the MPG-2 and outputs a discrete cosine coefficient in the unit of discrete cosine transform DCT blocks. A band limit device 27 replaces a high frequency component of the discrete cosine coefficient in a horizontal direction with zero. A quantizer 28 quantizes the discrete cosine coefficient with a quantization width different from an original quantization width.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、所定の画素ブロッ
ク単位で直交変換するとともに直交変換して得られた上
記直交変換ブロック内の直交変換係数を量子化すること
により圧縮符号化をした画像圧縮情報のビットレートを
変換する画像情報変換装置及び画像情報変換方法に関す
るものである。例えば、本発明は、衛星放送やケーブル
テレビジョン等のネットワークに伝送する場合や、光デ
ィスクや磁気ディスク等の記録メディアに記録する場合
などに、ＭＰＥＧ−２等により圧縮符号化した画像圧縮
情報のビットレートを削減するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an image compression system which performs an orthogonal transform on a predetermined pixel block basis and quantizes an orthogonal transform coefficient in the orthogonal transform block obtained by the orthogonal transform. The present invention relates to an image information conversion device and an image information conversion method for converting a bit rate of information. For example, the present invention is applied to a case where bits of image compression information compressed and encoded by MPEG-2 or the like are transmitted to a network such as a satellite broadcast or a cable television, or to a recording medium such as an optical disk or a magnetic disk. It is to reduce the rate.

【０００２】[0002]

【従来の技術】近年、画像情報をデジタルデータとして
取り扱い、そのデジタルデータを画像情報特有の冗長性
を利用した直交変換と動き補償により圧縮を行い、衛星
放送やケーブルテレビジョン等のネットワークメディア
へ伝送や、光ディスクや磁気ディスク等のストレージメ
ディアへの記録を行う装置が普及している。このような
装置では、一般に、画像圧縮方式に、離散コサイン変換
を用いたＭＰＥＧ−２（Moving Picture Experts Group
phase - 2）が用いられている。2. Description of the Related Art In recent years, image information is handled as digital data, and the digital data is compressed by orthogonal transformation and motion compensation using redundancy inherent in the image information, and transmitted to network media such as satellite broadcasting and cable television. In addition, devices that perform recording on storage media such as optical disks and magnetic disks have become widespread. In such an apparatus, MPEG-2 (Moving Picture Experts Group) using discrete cosine transform as an image compression method is generally used.
phase-2) is used.

【０００３】また、近年、このＭＰＥＧ−２等の画像圧
縮方式を用いたデジタルテレビジョン放送の規格化が進
められている。デジタルテレビジョン放送の規格には、
標準解像度画像（例えば垂直方向の有効ライン数が５７
６本）に対応した規格、高解像度画像（例えば垂直方向
の有効ライン数が１１５２本）に対応した規格等があ
る。In recent years, standardization of digital television broadcasting using an image compression method such as MPEG-2 has been promoted. Digital television broadcasting standards include:
Standard resolution image (for example, if the number of effective lines in the vertical direction is 57
6), a standard corresponding to a high-resolution image (for example, the number of effective lines in the vertical direction is 1152), and the like.

【０００４】ところで、この高解像度画像の画像情報は
膨大であり、ＭＰＥＧ−２等の符号化方式を用いて圧縮
しても、十分な画質を得るためには多くの符号量（ビッ
トレート）が必要となる。例えば、画枠が１９２０画素
×１０８０画素の３０Ｈｚの飛び越し走査画像の場合に
は、１８〜２２Ｍｂｐｓ程度或いはそれ以上の符号量
（ビットレート）を必要とする。Incidentally, the image information of this high-resolution image is enormous, and even if it is compressed using an encoding method such as MPEG-2, a large amount of code (bit rate) is required to obtain a sufficient image quality. Required. For example, in the case of a 30 Hz interlaced scan image having an image frame of 1920 × 1080 pixels, a code amount (bit rate) of about 18 to 22 Mbps or more is required.

【０００５】そのため、例えば衛星放送やケーブルテレ
ビジョン等のネットワークメディアへこのような高解像
度画像を伝送する場合には、伝送経路のバンド幅に合わ
せて更に符号量（ビットレート）の削減をしなければな
らない。同様に、光ディスクや磁気ディスク等のストレ
ージメディアへこのような高解像度画像を記録する場合
にも、メディアの記録容量に合わせて、更に符号量（ビ
ットレート）の削減をしなければならない。また、この
ような符号量（ビットレート）の削減の必要性は、高解
像度画像のみならず、標準解像度画像（例えば画枠が７
２０画素×４８０画素の３０Ｈｚの飛び越し走査画像
等）でも生じることが考えられる。Therefore, when transmitting such a high-resolution image to network media such as satellite broadcasting and cable television, the code amount (bit rate) must be further reduced in accordance with the bandwidth of the transmission path. Must. Similarly, when recording such a high-resolution image on a storage medium such as an optical disk or a magnetic disk, the code amount (bit rate) must be further reduced in accordance with the recording capacity of the medium. Further, the necessity of reducing the code amount (bit rate) is not limited to a high-resolution image, but also to a standard-resolution image (for example, when the image frame is 7 frames).
It is also conceivable that this may occur in a 30 Hz interlaced scan image of 20 pixels × 480 pixels).

【０００６】従来より、図４１に示すような、画像復号
化装置１１０と画像符号化装置１２０とが直列接続され
て構成された画像情報変換装置が、画像圧縮情報（ビッ
トストリーム）の持つ符号量（ビットレート）を削減す
る装置として知られている。Conventionally, as shown in FIG. 41, an image information conversion device configured by connecting an image decoding device 110 and an image encoding device 120 in series has been used to provide a code amount of image compression information (bit stream). (Bit rate).

【０００７】この従来の画像情報変換装置では、高ビッ
トレートの画像圧縮情報（ビットストリーム）が画像復
号化装置１１０に入力する。この画像復号化装置１１０
は、高ビットレートの画像圧縮情報を一旦完全に復号
し、ベースバンドのビデオデータを出力する。そして、
画像符号化装置１２０は、画像復号化装置２１の出力と
して得られるベースバンドのビデオデータを再符号化
し、低ビットレートの画像圧縮情報（ビットストリー
ム）を出力する。このとき、画像符号化装置１２０は、
入力された画像圧縮情報の符号量（高ビットレート）よ
り低い目標符号量（ターゲットビットレート）が予め与
えられ、この目標符号量に応じた量子化処理をする。In this conventional image information conversion apparatus, high bit rate image compression information (bit stream) is input to the image decoding apparatus 110. This image decoding device 110
Once completely decodes the high bit rate image compression information and outputs baseband video data. And
The image encoding device 120 re-encodes baseband video data obtained as an output of the image decoding device 21, and outputs low bit rate image compression information (bit stream). At this time, the image encoding device 120
A target code amount (target bit rate) lower than the code amount (high bit rate) of the input image compression information is given in advance, and quantization processing according to the target code amount is performed.

【０００８】以上のような処理を行うことにより、従来
の画像情報変換装置では、画像圧縮情報の符号量を削減
することができる。[0008] By performing the above processing, the conventional image information conversion apparatus can reduce the code amount of the image compression information.

【０００９】[0009]

【発明が解決しようとする課題】しかしながら、このよ
うな従来の画像情報変換装置では、復号化及び符号化に
伴う全てのハードウェア構成が必要となるため、コスト
と消費電力が大きくなり、民生用の機器や携帯型の機器
に組み込むことは困難であった。また、このような従来
の画像情報変換装置では、復号化及び符号化に伴う処理
を全て行わなければならないため、演算処理量が非常に
膨大となり、汎用の集積回路を用いてソフトウエアとし
て実現する場合には、回路の持つ演算処理能力では実時
間内に処理が終わらないといった問題が生じる可能性が
あった。However, such a conventional image information conversion apparatus requires all hardware components for decoding and encoding, so that the cost and power consumption increase, and It has been difficult to incorporate it into other devices or portable devices. Further, in such a conventional image information conversion device, since all processes associated with decoding and encoding must be performed, the amount of arithmetic processing becomes extremely large, and is realized as software using a general-purpose integrated circuit. In such a case, there is a possibility that a problem may occur that the processing does not end in real time with the arithmetic processing capability of the circuit.

【００１０】また、従来の画像情報変換装置の画像符号
化装置１２０では、入力された高ビットレートの画像圧
縮情報に対して、より低い目標符号量（ターゲットビッ
トレート）が与えられるため、量子化幅が粗くなり、こ
れによって生じる量子化雑音が復号画像においてブロッ
クノイズとして現れ、良好な画質が得られなかった。Further, in the image encoding device 120 of the conventional image information conversion device, a lower target code amount (target bit rate) is given to the input high-bit-rate image compression information. The width became coarse, and the quantization noise generated by this appeared as block noise in the decoded image, and good image quality was not obtained.

【００１１】本発明の目的は、上記問題点を解決するも
ので、入力された多くの符号量（高いビットレート）の
画像圧縮情報（ビットストリーム）を、復号画像の画質
劣化を最小限に抑えながら符号量（ビットレート）を削
減して、少ない符号量（低いビットレート）の画像圧縮
情報を出力する画像情報変換装置及び画像情報変換方法
を提供するものである。SUMMARY OF THE INVENTION An object of the present invention is to solve the above-mentioned problems and reduce the deterioration of the image quality of a decoded image by minimizing the deterioration of the decoded image by inputting a large amount of code (high bit rate). It is an object of the present invention to provide an image information conversion apparatus and an image information conversion method for outputting image compression information with a small code amount (low bit rate) while reducing the code amount (bit rate).

【００１２】[0012]

【課題を解決するための手段】本発明にかかる画像情報
変換装置は、所定の画素ブロック（直交変換ブロック）
単位で直交変換するとともに直交変換して得られた上記
直交変換ブロック内の直交変換係数を量子化することに
より、画像信号を圧縮符号化をした第１のビットレート
の第１の画像圧縮情報を、上記第１のビットレートより
低い第２のビットレートの第２の画像圧縮情報に変換す
る画像情報変換装置であって、入力された第１の画像圧
縮情報の直交変換係数の量子化幅に応じて、この直交変
換係数を逆量子化する逆量子化手段と、出力する第２の
画像圧縮情報が上記第２のビットレートとなるような量
子化幅に応じて、上記逆量子化手段により逆量子化され
た直交変換係数を再量子化する量子化手段とを備えるこ
とを特徴とする。An image information conversion apparatus according to the present invention comprises a predetermined pixel block (orthogonal conversion block).
By performing orthogonal transform in units and quantizing orthogonal transform coefficients in the orthogonal transform block obtained by performing orthogonal transform, first image compression information of a first bit rate obtained by compression-coding an image signal is obtained. , An image information conversion device for converting the first image compression information into a second image compression information having a second bit rate lower than the first bit rate. In response, the inverse quantization means for inversely quantizing the orthogonal transform coefficient, and the inverse quantization means according to the quantization width such that the output second image compression information becomes the second bit rate. And a quantizing means for requantizing the inversely quantized orthogonal transform coefficient.

【００１３】この画像情報変換装置では、直交変換係数
を逆量子化した後量子化幅を変えて再量子化する。In this image information converter, the orthogonal transform coefficients are inversely quantized and then requantized by changing the quantization width.

【００１４】また、本発明にかかる画像情報変換装置
は、上記逆量子化手段により逆量子化された直交変換係
数の水平方向の高周波成分の値を制限する帯域制限手段
を備え、上記量子化手段は、出力する第２の画像圧縮情
報が上記第２のビットレートとなるような量子化幅に応
じて、上記帯域制限手段により水平方向の高周波成分が
制限された直交変換係数を再量子化することを特徴とす
る。The image information converting apparatus according to the present invention further comprises a band limiting unit for limiting a value of a high frequency component in a horizontal direction of the orthogonal transform coefficient inversely quantized by the inverse quantizing unit. Requantizes the orthogonal transform coefficient in which the high-frequency component in the horizontal direction is limited by the band limiting means according to the quantization width at which the output second image compression information becomes the second bit rate. It is characterized by the following.

【００１５】この画像情報変換装置では、直交変換係数
を逆量子化した後、直交変換係数の水平方向の高周波成
の値を制限し、量子化幅を変えて再量子化する。In this image information conversion apparatus, after the orthogonal transform coefficient is dequantized, the value of the high-frequency component of the orthogonal transform coefficient in the horizontal direction is limited, and the quantization width is changed to perform requantization.

【００１６】また、本発明にかかる画像情報変換装置
は、所定の画素ブロック（直交変換ブロック）単位で直
交変換するとともに直交変換して得られた上記直交変換
ブロック内の直交変換係数を量子化することにより、画
像信号を圧縮符号化をした第１のビットレートの第１の
画像圧縮情報を、上記第１のビットレートより低い第２
のビットレートの第２の画像圧縮情報に変換する画像情
報変換装置であって、入力された第１の画像圧縮情報の
直交変換係数の量子化幅に応じて、この直交変換係数を
逆量子化する第１の逆量子化手段と、上記第１の逆量子
化手段により逆量子化された直交変換係数と動き補償誤
差補正係数とを加算する加算手段と、出力する第２の画
像圧縮情報が上記第２のビットレートとなるような量子
化幅に応じて、上記加算手段により動き補償誤差係数が
加算された直交変換係数を再量子化する量子化手段と上
記量子化手段により再量子化された直交変換係数を逆量
子化する第２の逆量子化手段と、上記第２の逆量子化手
段により逆量子化された直交変換係数から上記加算手段
により動き補償補正係数が加算された直交変換係数を減
算する減算手段と、上記減算手段により減算した上記直
交変換係数を直交変換して動きベクトルに基づき動き補
償し、動き補償で得た値を逆直交変換をして上記動き補
償誤差補正係数を生成する動き補償誤差補正手段とを備
えることを特徴とする。Further, the image information conversion apparatus according to the present invention performs orthogonal transformation on a predetermined pixel block (orthogonal transformation block) basis and quantizes orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. Accordingly, the first image compression information of the first bit rate obtained by compressing and encoding the image signal is converted to the second image compression information lower than the first bit rate.
An image information conversion device for converting the orthogonal transform coefficient into a second image compressed information having a bit rate of, wherein the orthogonal transform coefficient is dequantized in accordance with the quantization width of the orthogonal transform coefficient of the input first image compressed information. A first inverse quantizing means for performing the operation, an adding means for adding the orthogonal transform coefficient inversely quantized by the first inverse quantizing means and the motion compensation error correction coefficient, and a second image compression information to be output. In accordance with the quantization width to achieve the second bit rate, the quantizing means for requantizing the orthogonal transform coefficient to which the motion compensation error coefficient has been added by the adding means and the requantization by the quantizing means. A second inverse quantizer for inversely quantizing the orthogonal transform coefficient, and an orthogonal transform obtained by adding a motion compensation correction coefficient by the adder from the orthogonal transform coefficient inversely quantized by the second inverse quantizer. Subtraction means for subtracting coefficients Motion compensation error correction means for orthogonally transforming the orthogonal transform coefficient subtracted by the subtraction means, performing motion compensation based on a motion vector, and performing inverse orthogonal transform on the value obtained by the motion compensation to generate the motion compensation error correction coefficient. And characterized in that:

【００１７】この画像情報変換装置では、直交変換係数
を逆量子化した後量子化幅を変えて再量子化するととも
に、入力された直交変換係数と再量子化した直交変換係
数との差分を動き補償して動き補償による誤差を補正す
る。In this image information converter, the orthogonal transform coefficient is inversely quantized, requantized by changing the quantization width, and the difference between the input orthogonal transform coefficient and the requantized orthogonal transform coefficient is calculated by moving the difference. Compensation corrects the error due to motion compensation.

【００１８】また、本発明にかかる画像情報変換装置
は、上記第１の逆量子化手段により逆量子化された直交
変換係数の水平方向の高周波成分の値を制限する帯域制
限手段を備え、上記量子化手段は、出力する第２の画像
圧縮情報が上記第２のビットレートとなるような量子化
幅に応じて、上記帯域制限手段により水平方向の高周波
成分が制限された直交変換係数を再量子化することを特
徴とする。The image information conversion apparatus according to the present invention further comprises a band limiting unit for limiting a value of a high frequency component in a horizontal direction of the orthogonal transform coefficient inversely quantized by the first inverse quantization unit. The quantizing means regenerates the orthogonal transform coefficient of which the high-frequency component in the horizontal direction is limited by the band limiting means according to the quantization width at which the output second image compression information has the second bit rate. It is characterized by being quantized.

【００１９】この画像情報変換装置では、直交変換係数
を逆量子化した後直交変換係数の水平方向の高周波成の
値を制限し量子化幅を変えて再量子化するとともに、入
力された直交変換係数と再量子化した直交変換係数との
差分を動き補償して動き補償による誤差を補正する。In this image information conversion apparatus, after the orthogonal transform coefficients are dequantized, the values of the high-frequency components of the orthogonal transform coefficients in the horizontal direction are limited, the quantization width is changed, and requantization is performed. The difference between the coefficient and the requantized orthogonal transform coefficient is motion compensated to correct the error due to the motion compensation.

【００２０】また、本発明にかかる画像情報変換装置
は、上記直交変換係数は、水平方向８係数と垂直方向８
係数の８×８係数からなり、上記動き補償誤差補正手段
が、水平方向８係数及び垂直方向８係数からなる量子化
誤差係数に対して、４×８の逆直交変換を行い、空間領
域の量子化誤差値を生成する４×８逆直交変換部と、上
記４×８逆直交変換部により生成された空間領域の量子
化誤差値に対して、水平方向に１／４画素精度で動き補
償をし、垂直方向に１／２画素精度で動き補償をして、
空間領域の量子化誤差補正値を生成する動き補償部と、
上記動き補償部により生成された空間領域の量子化誤差
補正値に対して、４×８の直交変換を行い、周波数領域
の上記動き補償誤差補正係数を生成する４×８直交変換
部とを有することを特徴とする。Further, in the image information conversion apparatus according to the present invention, the orthogonal transform coefficient is determined by calculating a coefficient in the horizontal direction and a coefficient in the vertical direction.
The motion compensation error correction means performs 4 × 8 inverse orthogonal transform on the quantization error coefficient including 8 horizontal coefficients and 8 vertical coefficients, and performs quantization in the spatial domain. 4 × 8 inverse orthogonal transform unit for generating a quantization error value, and performing motion compensation with a １／ pixel precision in the horizontal direction on the spatial domain quantization error value generated by the 4 × 8 inverse orthogonal transform unit. And perform motion compensation in the vertical direction with half-pixel accuracy,
A motion compensation unit that generates a quantization error correction value in the spatial domain,
A 4 × 8 orthogonal transformation unit that performs 4 × 8 orthogonal transformation on the spatial domain quantization error correction value generated by the motion compensation unit and generates the motion compensation error correction coefficient in the frequency domain. It is characterized by the following.

【００２１】この画像情報変換装置では、直交変換係数
を逆量子化した後直交変換係数の水平方向の高周波成の
値を制限し量子化幅を変えて再量子化するとともに、入
力された直交変換係数と再量子化した直交変換係数との
差分の水平方向の低域４係数及び垂直方向の８係数と
を、動き補償して動き補償による誤差を補正する。In this image information conversion apparatus, the orthogonal transform coefficient is inversely quantized, the value of the high-frequency component of the orthogonal transform coefficient in the horizontal direction is limited, the quantization width is changed, and requantization is performed. The four low-frequency coefficients in the horizontal direction and the eight coefficients in the vertical direction of the difference between the coefficient and the requantized orthogonal transform coefficient are motion-compensated to correct an error due to motion compensation.

【００２２】本発明にかかる画像情報変換方法は、所定
の画素ブロック（直交変換ブロック）単位で直交変換す
るとともに直交変換して得られた上記直交変換ブロック
内の直交変換係数を量子化することにより、画像信号を
圧縮符号化をした第１のビットレートの第１の画像圧縮
情報を、上記第１のビットレートより低い第２のビット
レートの第２の画像圧縮情報に変換する画像情報変換方
法であって、上記第１のビットレートの第１の画像圧縮
情報を入力し、入力された第１の画像圧縮情報の直交変
換係数の量子化幅に応じて、この直交変換係数を逆量子
化し、出力する第２の画像圧縮情報が上記第２のビット
レートとなるような量子化幅に応じて、逆量子化された
上記直交変換係数を再量子化し、再量子化して生成した
第２の画像圧縮情報を出力することを特徴とする。An image information conversion method according to the present invention performs orthogonal transformation in units of a predetermined pixel block (orthogonal transformation block) and quantizes orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. Image information conversion method for converting first image compression information of a first bit rate obtained by compression-coding an image signal into second image compression information of a second bit rate lower than the first bit rate And inputs the first image compression information of the first bit rate, and dequantizes the orthogonal transform coefficient according to the quantization width of the orthogonal transform coefficient of the input first image compression information. A second quantization coefficient generated by requantizing and requantizing the inversely-quantized orthogonal transform coefficient according to a quantization width at which the output second image compression information becomes the second bit rate. Image compression information And outputs a.

【００２３】この画像情報変換方法では、直交変換係数
を逆量子化した後量子化幅を変えて再量子化する。In this image information conversion method, the orthogonal transform coefficients are inversely quantized and then requantized by changing the quantization width.

【００２４】また、本発明にかかる画像情報変換方法
は、逆量子化された上記直交変換係数の水平方向の高周
波成分の値を制限し、出力する第２の画像圧縮情報が上
記第２のビットレートとなるような量子化幅に応じて、
水平方向の高周波成分が制限された上記直交変換係数を
再量子化することを特徴とする。Further, in the image information conversion method according to the present invention, the value of the high frequency component in the horizontal direction of the inversely quantized orthogonal transform coefficient is limited, and the second image compression information to be output is the second bit compression information. Depending on the quantization width that becomes the rate,
The orthogonal transform coefficient in which the high frequency component in the horizontal direction is restricted is requantized.

【００２５】この画像情報変換方法では、直交変換係数
を逆量子化した後、直交変換係数の水平方向の高周波成
の値を制限し、量子化幅を変えて再量子化する。In this image information conversion method, after the orthogonal transform coefficient is dequantized, the value of the high frequency component of the orthogonal transform coefficient in the horizontal direction is limited, and the quantization width is changed to perform requantization.

【００２６】また、本発明にかかる画像情報変換方法
は、所定の画素ブロック（直交変換ブロック）単位で直
交変換するとともに直交変換して得られた上記直交変換
ブロック内の直交変換係数を量子化することにより、画
像信号を圧縮符号化をした第１のビットレートの第１の
画像圧縮情報を、上記第１のビットレートより低い第２
のビットレートの第２の画像圧縮情報に変換する画像情
報変換方法であって、上記第１のビットレートの第１の
画像圧縮情報を入力し、入力された第１の画像圧縮情報
の直交変換係数の量子化幅に応じて、この直交変換係数
を逆量子化し、逆量子化された上記直交変換係数と動き
補償誤差補正係数とを加算し、出力する第２の画像圧縮
情報が上記第２のビットレートとなるような量子化幅に
応じて、動き補償誤差係数が加算された直交変換係数を
再量子化し、再量子化して生成した第２の画像圧縮情報
を出力し、再量子化された上記直交変換係数を逆量子化
し、逆量子化された直交変換係数から動き補償補正係数
が加算された直交変換係数を減算し、減算した直交変換
係数を直交変換して動きベクトルに基づき動き補償し、
動き補償をした値を逆直交変換をして上記動き補償誤差
補正係数を生成することを特徴とする。In the image information conversion method according to the present invention, the orthogonal transform is performed in units of a predetermined pixel block (orthogonal transform block) and the orthogonal transform coefficients in the orthogonal transform block obtained by the orthogonal transform are quantized. Accordingly, the first image compression information of the first bit rate obtained by compressing and encoding the image signal is converted to the second image compression information lower than the first bit rate.
An image information conversion method for converting the first image compression information of the first bit rate into an image information conversion method of converting the first image compression information of the first bit rate into orthogonal image data. The orthogonal transform coefficient is inversely quantized in accordance with the quantization width of the coefficient, the inversely quantized orthogonal transform coefficient is added to the motion compensation error correction coefficient, and the second image compression information to be output is the second image compression information. In accordance with a quantization width such that the bit rate becomes, the orthogonal transform coefficient to which the motion compensation error coefficient has been added is requantized, and the second image compression information generated by requantization is output. The orthogonal transform coefficient is inversely quantized, the orthogonal transform coefficient to which the motion compensation correction coefficient has been added is subtracted from the inversely quantized orthogonal transform coefficient, and the subtracted orthogonal transform coefficient is orthogonally transformed to perform motion compensation based on the motion vector. And
The motion compensated value is subjected to inverse orthogonal transformation to generate the motion compensation error correction coefficient.

【００２７】この画像情報変換方法では、直交変換係数
を逆量子化した後量子化幅を変えて再量子化するととも
に、入力された直交変換係数と再量子化した直交変換係
数との差分を動き補償して動き補償による誤差を補正す
る。In this image information conversion method, the orthogonal transform coefficient is inversely quantized and then requantized by changing the quantization width, and the difference between the input orthogonal transform coefficient and the requantized orthogonal transform coefficient is changed. Compensation corrects the error due to motion compensation.

【００２８】また、本発明にかかる画像情報変換方法
は、逆量子化された直交変換係数の水平方向の高周波成
分の値を制限し、出力する第２の画像圧縮情報が上記第
２のビットレートとなるような量子化幅に応じて、水平
方向の高周波成分が制限された直交変換係数を再量子化
することを特徴とする。In the image information conversion method according to the present invention, the value of the high frequency component in the horizontal direction of the inversely quantized orthogonal transform coefficient is limited, and the second image compression information to be output is the second bit rate. The orthogonal transform coefficient whose horizontal high frequency component is restricted is requantized in accordance with a quantization width such that

【００２９】この画像情報変換方法では、直交変換係数
を逆量子化した後直交変換係数の水平方向の高周波成の
値を制限し量子化幅を変えて再量子化するとともに、入
力された直交変換係数と再量子化した直交変換係数との
差分を動き補償して動き補償による誤差を補正する。In this image information conversion method, the orthogonal transform coefficient is inversely quantized, the value of the high frequency component of the orthogonal transform coefficient in the horizontal direction is restricted, the quantization width is changed, and requantization is performed. The difference between the coefficient and the requantized orthogonal transform coefficient is motion compensated to correct the error due to the motion compensation.

【００３０】また、本発明にかかる画像情報変換方法
は、上記直交変換係数が、水平方向８係数と垂直方向８
係数の８×８係数からなり、水平方向８係数及び垂直方
向８係数からなる量子化誤差係数に対して、４×８の逆
直交変換を行い、空間領域の量子化誤差値を生成し、上
記空間領域の量子化誤差値に対して、水平方向に１／４
画素精度で動き補償をし、垂直方向に１／２画素精度で
動き補償をして、空間領域の量子化誤差補正値を生成
し、上記空間領域の量子化誤差補正値に対して、４×８
の直交変換を行い、周波数領域の上記動き補償誤差補正
係数を生成することを特徴とする。Further, in the image information conversion method according to the present invention, the orthogonal transform coefficient may be defined as a horizontal eight coefficient and a vertical eight coefficient.
A 4 × 8 inverse orthogonal transform is performed on a quantization error coefficient composed of 8 × 8 coefficients and composed of 8 horizontal coefficients and 8 vertical coefficients to generate a quantization error value in the spatial domain. 1/4 in the horizontal direction with respect to the quantization error value in the spatial domain
Motion compensation is performed with pixel precision, motion compensation is performed in the vertical direction with 1/2 pixel precision, and a quantization error correction value in the spatial domain is generated. 8
Is performed to generate the motion compensation error correction coefficient in the frequency domain.

【００３１】この画像情報変換方法では、直交変換係数
を逆量子化した後直交変換係数の水平方向の高周波成の
値を制限し量子化幅を変えて再量子化するとともに、入
力された直交変換係数と再量子化した直交変換係数との
差分の水平方向の低域４係数及び垂直方向の８係数と
を、動き補償して動き補償による誤差を補正する。In this image information conversion method, after the orthogonal transform coefficients are dequantized, the values of the high-frequency components of the orthogonal transform coefficients in the horizontal direction are limited, the quantization width is changed, and requantization is performed. The four low-frequency coefficients in the horizontal direction and the eight coefficients in the vertical direction of the difference between the coefficient and the requantized orthogonal transform coefficient are motion-compensated to correct an error due to motion compensation.

【００３２】[0032]

【発明の実施の形態】以下、本発明の実施の形態とし
て、ＭＰＥＧ−２方式で符号化された画像圧縮情報（ビ
ットストリーム）の符号量（ビットレート）を削減する
画像情報変換装置について説明する。ＭＰＥＧ−２（Ｉ
ＳＯ／ＩＥＣ１３８１８−２）は、飛び越し走査画像
及び順次走査画像、並びに、標準解像度画像及び高解像
度画像の双方に対応した画像情報の圧縮方式である。BEST MODE FOR CARRYING OUT THE INVENTION As an embodiment of the present invention, an image information conversion apparatus for reducing the code amount (bit rate) of image compression information (bit stream) encoded by the MPEG-2 system will be described. . MPEG-2 (I
SO / IEC 13818-2) is a compression method of image information corresponding to interlaced scan images and progressive scan images, and both standard resolution images and high resolution images.

【００３３】まず、本発明の実施の形態の画像情報変換
装置を説明する前に、ＭＰＥＧ−２方式の画像圧縮情報
（ビットストリーム）を符号化するＭＰＥＧ−２画像情
報符号化装置、並びに、ＭＰＥＧ−２方式の画像圧縮情
報（ビットストリーム）の構成について説明する。Before describing an image information conversion apparatus according to an embodiment of the present invention, an MPEG-2 image information encoding apparatus for encoding image compression information (bit stream) of the MPEG-2 system, and an MPEG. The configuration of the image compression information (bit stream) of the -2 system will be described.

【００３４】（画像情報符号化装置）図１に、ＭＰＥＧ
−２画像情報符号化装置のブロック図を示す。(Image Information Encoding Apparatus) FIG.
2 shows a block diagram of an image information encoding device.

【００３５】ＭＰＥＧ−２画像情報符号化装置は、この
図１に示すように、画面並べ替えバッファ１と、加算器
２と、離散コサイン変換装置３と、量子化装置４と、可
変長符号化装置５と、符号バッファ６と、符号量制御装
置７と、逆量子化装置８と、逆離散コサイン変換装置９
と、加算器１０と、ビデオメモリ１１と、動き予測装置
１２とから構成される。As shown in FIG. 1, the MPEG-2 image information encoding apparatus includes a screen rearrangement buffer 1, an adder 2, a discrete cosine transform unit 3, a quantization unit 4, a variable length encoding Device 5, code buffer 6, code amount control device 7, inverse quantization device 8, inverse discrete cosine transform device 9
, An adder 10, a video memory 11, and a motion prediction device 12.

【００３６】画像並べ替えバッファ１には、ベースバン
ドのビデオデータが入力される。この画像並べ替えバッ
ファ１は、表示時間順に時系列に並んでいるフレーム順
序を、ＭＰＥＧ−２の符号化順序に応じたフレーム順序
に並び替える。The image rearranging buffer 1 receives baseband video data. The image rearrangement buffer 1 rearranges the order of frames arranged in chronological order in display time order into a frame order corresponding to the encoding order of MPEG-2.

【００３７】図２に、画面並べ替えバッファ１によるフ
レーム順序の並べ替え処理の一例を示す。FIG. 2 shows an example of a frame order rearrangement process performed by the screen rearrangement buffer 1.

【００３８】入力されたビデオデータの各フレームが、
図２（Ａ）に示すように、表示時間順に時系列に並んで
いたとする。このとき、ＭＰＥＧ−２画像情報符号化装
置では、まず、表示順序で３番目のフレームをフレーム
内符号化によって符号化する。次に、この３番目のフレ
ームを参照フレームとして、６番目のフレームを予測符
号化によって符号化し、更に、この３番目のフレームと
６番目のフレームを参照フレームとして、４番目、５番
目のフレームを予測符号化によって符号化するといった
符号化処理がなされる。そこで、この画像並べ替えバッ
ファ１では、図２（Ｂ）に示すように、フレームの並べ
替えを行う。ＭＰＥＧ−２では、フレーム内符号化が行
われた画像、及び、この画像を元にしてフレーム間符号
化が行われた画像群からなるＧＯＰ（Group Of Picture
s）が定義されている。このＭＰＥＧ−２画像符号化装
置では、フレーム内符号化によって符号化された画像
（以下これをＩピクチャと呼ぶ。）が１枚と、１つの参
照フレーム（時間軸上で処理されるフレームより過去に
あるもの）を予測フレームとしてフレーム間符号化によ
って符号化された画像（以下、これをＰピクチャと呼
ぶ。）が４枚と、２つの参照フレーム（時間軸上で、処
理されるフレームより過去にあるもの及び未来にあるも
の）を予測フレームとしてフレーム間符号化によって符
号化された画像（以下、これをＢピクチャと呼ぶ）が１
０枚の合計１５枚の画像から構成されるＧＯＰを符号化
するものとする。Each frame of the input video data is
As shown in FIG. 2A, it is assumed that the images are arranged in chronological order in display time. At this time, the MPEG-2 image information encoding device first encodes the third frame in the display order by intra-frame encoding. Next, the third frame is used as a reference frame, the sixth frame is encoded by predictive coding, and the third frame and the sixth frame are used as reference frames, and the fourth and fifth frames are used as reference frames. Encoding processing such as encoding by predictive encoding is performed. Therefore, the image rearrangement buffer 1 rearranges the frames as shown in FIG. In MPEG-2, a GOP (Group Of Picture) composed of an image subjected to intra-frame encoding and an image group subjected to inter-frame encoding based on the image.
s) is defined. In this MPEG-2 image encoding apparatus, one image encoded by intra-frame encoding (hereinafter referred to as an I picture) and one reference frame (a frame that is processed earlier on the time axis) Are referred to as prediction frames, and four images (hereinafter referred to as P pictures) coded by inter-frame coding and two reference frames (on the time axis, past the frame to be processed) , And a picture in the future (hereinafter referred to as a B picture) encoded by inter-frame
It is assumed that a GOP composed of a total of 15 images of 0 is encoded.

【００３９】このような画像並べ替えバッファ１により
並べ替えられた各フレームは、加算器２に送られる。Each frame rearranged by the image rearrangement buffer 1 is sent to the adder 2.

【００４０】加算器２は、Ｐピクチャ又はＢピクチャが
送られてきた場合には、そのフレームと、後述する動き
補償予測装置１２により生成されたこのフレームの予測
フレームとを減算して、離散コサイン変換装置３に送
る。また、加算器２は、Ｉピクチャが送られてきた場合
には、そのフレームから何らデータを減算せずに、その
フレームを離散コサイン変換装置３に送る。When a P-picture or a B-picture is sent, the adder 2 subtracts the frame and a predicted frame of this frame generated by a motion-compensated prediction unit 12 to be described later to obtain a discrete cosine. Send to conversion device 3. In addition, when an I picture is sent, the adder 2 sends the frame to the discrete cosine transform device 3 without subtracting any data from the frame.

【００４１】離散コサイン変換装置３は、各フレームを
８画素×８画素のブロック（ＤＣＴブロック）に分割
し、各ＤＣＴブロックに対し、水平方向、垂直方向それ
ぞれに８次の離散コサイン変換を施す。離散コサイン変
換を施すことにより、２次元の画像信号が直交変換さ
れ、空間周波数係数（離散コサイン変換係数）となる。
その際、入力されたビデオデータが例えば図３に示すよ
うなＹ：Ｕ：Ｖ＝４：２：０信号である場合、輝度信号
の１６画素×１６画素及び色差信号の８画素×８画素を
一つのブロック（以下、これをマクロブロックとい
う。）として処理を行う。この場合離散コサイン変換装
置３は、輝度信号に関しては、図４に示すようなフレー
ムを単位として８画素×８画素の離散コサイン変換を施
すモード（以下フレーム離散コサイン変換モードと呼
ぶ。）と、図５に示すような各マクロブロックのデータ
をフィールド毎に分離して８画素×８画素の離散コサイ
ン変換を施すモード（以下フィールド離散コサイン変換
モードと呼ぶ。）との２つのモードのうち、符号化効率
の良い方の離散コサイン変換を選択して行う。The discrete cosine transform unit 3 divides each frame into blocks of 8 pixels × 8 pixels (DCT blocks), and performs an 8th-order discrete cosine transform on each DCT block in the horizontal and vertical directions. By performing the discrete cosine transform, a two-dimensional image signal is subjected to an orthogonal transform to be a spatial frequency coefficient (discrete cosine transform coefficient).
At this time, if the input video data is, for example, a Y: U: V = 4: 2: 0 signal as shown in FIG. 3, 16 pixels × 16 pixels of a luminance signal and 8 pixels × 8 pixels of a color difference signal are used. The processing is performed as one block (hereinafter, referred to as a macro block). In this case, the discrete cosine transform device 3 performs a discrete cosine transform of 8 × 8 pixels on a frame basis as shown in FIG. 4 (hereinafter, referred to as a frame discrete cosine transform mode) for the luminance signal. 5 is a mode in which the data of each macro block is separated for each field as shown in FIG. 5 and discrete cosine transform of 8 pixels × 8 pixels is performed (hereinafter referred to as a field discrete cosine transform mode). The more efficient discrete cosine transform is selected and performed.

【００４２】離散コサイン変換装置３は、離散コサイン
変換を施して得られたデータ（離散コサイン変換係数）
を量子化装置４に送る。The discrete cosine transform device 3 obtains data (discrete cosine transform coefficients) obtained by performing discrete cosine transform.
To the quantizer 4.

【００４３】量子化装置４は、水平方向、垂直方向の８
次の離散コサイン変換係数に対して、所定の量子化行列
を用いて量子化を行う。この量子化行列の量子化スケー
ルは、後段の符号バッファ６がオーバフロー及びアンダ
ーフロー等しないように、符号量制御装置７により予め
定められた目標符号量（ターゲットビットレート）に基
づきフィードバック制御される。量子化装置４は、量子
化した離散コサイン係数を可変長符号化装置５及び逆量
子化装置８に送る。The quantizing device 4 has 8 horizontal and vertical directions.
The next discrete cosine transform coefficient is quantized using a predetermined quantization matrix. The quantization scale of the quantization matrix is feedback-controlled by the code amount control device 7 based on a predetermined target code amount (target bit rate) so that the subsequent code buffer 6 does not overflow or underflow. The quantization device 4 sends the quantized discrete cosine coefficients to the variable length coding device 5 and the inverse quantization device 8.

【００４４】可変長復号化装置５は、量子化した離散コ
サイン変換係数を、平均符号長が短くなるように可変長
符号化する。その際、可変長復号化装置５は、離散コサ
イン変換係数の直流成分に関しては、１ブロック前の直
流成分係数を予測値としてその差分を符号化し、その他
の成分に関しては、予め設定された走査方式（ジグザグ
スキャン若しくはオルタネートスキャン）に基づいて１
次元の配列データに並べ替えた後、連続する０係数の数
（ラン）及び非０係数（レベル）のペアを事象とした符
号化を行う。The variable length decoding device 5 performs variable length coding on the quantized discrete cosine transform coefficients so that the average code length becomes short. At this time, the variable-length decoding device 5 encodes the difference with the DC component coefficient of the previous block as a prediction value for the DC component of the discrete cosine transform coefficient, and encodes the difference for the other components using a preset scanning method. 1 based on (zigzag scan or alternate scan)
After rearrangement into the dimensional array data, encoding is performed using a pair of the number of consecutive 0 coefficients (run) and the non-zero coefficient (level) as an event.

【００４５】図６（Ａ）にジグザグスキャンによる水平
方向及び垂直方向の８次の離散コサイン変換係数に対す
るスキャン順序を示し、図６（Ｂ）にオルタネートスキ
ャンによる水平方向及び垂直方向の８次の離散コサイン
変換係数に対するスキャン順序を示す。量子化装置４
は、ＤＣＴブロック内のスキャンを行っている際に、そ
れ以降の係数の値が全て０となった場合、ＥＯＢ（End
Of Block）と呼ばれる符号を出力し、そのブロックに対
する可変長符号化を終了する。FIG. 6A shows the scanning order for the 8th-order discrete cosine transform coefficients in the horizontal and vertical directions by zigzag scanning, and FIG. 6B shows the 8th-order discrete signals in the horizontal and vertical directions by the alternate scanning. 3 shows a scan order for cosine transform coefficients. Quantizer 4
EOB (End) when scanning in the DCT block and all coefficient values thereafter become 0
A code called “Of Block” is output, and the variable-length coding for the block ends.

【００４６】そして、可変長符号化装置５により可変長
符号化された離散コサイン変換係数は符号バッファ６に
送られ、この符号バッファ６に一時格納されたのち、Ｍ
ＰＥＧ−２に規定されたビットストリーム構造とされ
て、圧縮画像情報として出力される。The discrete cosine transform coefficients subjected to variable length coding by the variable length coding device 5 are sent to the code buffer 6 and temporarily stored in the code buffer 6.
A bit stream structure defined in PEG-2 is output as compressed image information.

【００４７】一方、量子化装置４により量子化されたデ
ータは、逆量子化装置８にも送られる。この逆量子化装
置８は、送られたデータを逆量子化して、逆離散コサイ
ン変換装置９に送る。On the other hand, the data quantized by the quantization device 4 is also sent to the inverse quantization device 8. The inverse quantization device 8 inversely quantizes the transmitted data and sends the data to the inverse discrete cosine transform device 9.

【００４８】逆離散コサイン変換装置９は、逆量子化し
た離散コサイン変換係数に対して、逆離散コサイン変換
を施す。逆離散コサイン変換を施して得られた画素値
は、加算器１０を介して、ビデオメモリ１１に格納され
る。The inverse discrete cosine transform unit 9 performs an inverse discrete cosine transform on the inversely quantized discrete cosine transform coefficients. The pixel value obtained by performing the inverse discrete cosine transform is stored in the video memory 11 via the adder 10.

【００４９】加算器１０は、逆離散コサイン変換装置８
からＰピクチャが送られてきた場合には、そのフレーム
と、後述する動き補償予測装置１２により生成されたこ
のフレームの参照フレームとを加算して、ビデオメモリ
１１に格納する。また、加算器１０は、Ｉピクチャが送
られてきた場合には、そのフレームに何らデータ加算せ
ずに、ビデオメモリ１１に格納する。The adder 10 includes an inverse discrete cosine transform device 8
, A P-picture is sent from the video memory 11 and the frame is added to a reference frame of this frame generated by the motion compensation prediction device 12 described later and stored in the video memory 11. When an I picture is sent, the adder 10 stores the I picture in the video memory 11 without adding any data to the frame.

【００５０】動き補償予測装置１２は、Ｐピクチャ及び
Ｂピクチャに対するマクロブロックを単位とした１／２
画素精度の動きベクトル情報を、ブロックマッチング法
などの方式により検出する。動きベクトル情報は、符号
バッファ６から出力される画像圧縮情報（ビットストリ
ーム）に格納されて外部に送出される。また、これとと
もに、動き補償予測装置１２は、検出したベクトル情報
に基づきビデオメモリ１１に格納された画素データに対
して動き補償を行ってＰピクチャ及びＢピクチャの予測
フレームを生成し、その予測フレームを加算器２に供給
する。The motion-compensated prediction device 12 uses a half of a macroblock for a P picture and a B picture as a unit.
Pixel-accurate motion vector information is detected by a method such as a block matching method. The motion vector information is stored in the image compression information (bit stream) output from the code buffer 6 and sent out. At the same time, the motion compensation prediction device 12 performs motion compensation on the pixel data stored in the video memory 11 based on the detected vector information to generate prediction frames for P-pictures and B-pictures. Is supplied to the adder 2.

【００５１】Ｉピクチャに関してはフレーム内予測符号
化を行うので動きベクトル情報は検出されないが、Ｐピ
クチャ及びＢピクチャに関しても、動きベクトルに基づ
くフレーム間予測による符号化効率が高くない場合に
は、マクロブロック単位で、フレーム内予測を行うこと
も可能である。以下、フレーム内予測に基づいて符号化
されるマクロブロックをイントラマクロブロック、フレ
ーム間予測に基づいて符号化されるマクロブロックをイ
ンターマクロブロックと呼ぶ。Ｉピクチャに含まれるマ
クロブロックは全てイントラマクロブロックである。Since the I-picture is subjected to intra-frame prediction coding, no motion vector information is detected. However, for P-pictures and B-pictures, if the coding efficiency by inter-frame prediction based on motion vectors is not high, the macro It is also possible to perform intra-frame prediction in block units. Hereinafter, a macroblock encoded based on intra-frame prediction is called an intra macroblock, and a macroblock encoded based on inter-frame prediction is called an inter-macroblock. All macroblocks included in the I picture are intra macroblocks.

【００５２】以上のように、この図１に示したＭＰＥＧ
−２画像情報符号化装置では、入力されたベースバンド
のビデオデータを符号化して、画像圧縮情報（ビットス
トリーム）を出力する。As described above, the MPEG shown in FIG.
The -2 image information encoding device encodes the input baseband video data and outputs image compression information (bit stream).

【００５３】つぎに、上述した量子化装置４における量
子化処理について更に詳しく述べる。Next, the quantization processing in the quantization device 4 will be described in more detail.

【００５４】まず、イントラマクロブロックの直流係数
に関しては、以下の式（１）に基づいて量子化される。First, the DC coefficient of an intra macroblock is quantized based on the following equation (1).

【００５５】[0055]

【数１】 (Equation 1)

【００５６】この式（１）に示した“Ｆ′′［０］
［０］”は直流係数値の量子化代表値であり、“ＱＦ
［０］［０］”は直流係数値の量子化代表値レベル番号
である。逆量子化係数（intra dc mult）は、図７に示
すように、各フレーム毎に設定可能な変数であり、直流
係数のビット精度を表すイントラＤＣ精度（intra dc p
recision）によって設定される。"F" [0] shown in the equation (1)
[0] "is a quantization representative value of the DC coefficient value, and" QF
[0] [0] "is a quantization representative value level number of the DC coefficient value. The inverse quantization coefficient (intra dc mult) is a variable that can be set for each frame as shown in FIG. Intra DC precision (intra dc p) representing the bit precision of the DC coefficient
recision).

【００５７】イントラマクロブロックの直流係数以外の
係数に関しては、以下の式（２）に基づいて量子化され
る。The coefficients other than the DC coefficient of the intra macroblock are quantized based on the following equation (2).

【００５８】[0058]

【数２】 (Equation 2)

【００５９】この式（２）に示した“Ｆ′′［ｕ］
［ｖ］”は第（ｕ，ｖ）係数値の量子化代表値であり、
“ＱＦ［ｕ］［ｖ］”は第（ｕ，ｖ）係数値の量子化代
表値レベル番号である。ｋは以下の式（３）によって与
えられる。"F" [u] shown in the equation (2)
[V] "is a quantization representative value of the (u, v) th coefficient value,
“QF [u] [v]” is a quantization representative value level number of the (u, v) th coefficient value. k is given by the following equation (3).

【００６０】[0060]

【数３】 (Equation 3)

【００６１】また、“Ｗ［ｗ］［ｖ］［ｕ］”は量子化
行列であり、“ｑｕａｎｔｉｓｅｒ＿ｓｃａｌｅ”は量
子化スケールである。"W [w] [v] [u]" is a quantization matrix, and "quantizer_scale" is a quantization scale.

【００６２】上記量子化行列は、ＤＣＴブロック内の離
散コサイン変換係数値間での相対的な量子化精度を設定
するために設けられた行列である。この量子化行列は、
視覚的に劣化が目に付きにくい高域成分の離散コサイン
変換係数を粗く量子化し、視覚的に劣化が目に付き易い
低域成分の離散コサイン変換係数を細かく量子化するこ
とができる。そのため、この量子化行列を用いることに
より画質劣化を最小限に抑えながら符号量を削減するこ
とが可能となる。The above-mentioned quantization matrix is a matrix provided for setting the relative quantization accuracy between the discrete cosine transform coefficient values in the DCT block. This quantization matrix is
It is possible to roughly quantize the discrete cosine transform coefficient of a high-frequency component whose deterioration is hardly noticeable visually, and finely quantize the discrete cosine transform coefficient of a low-frequency component whose visual deterioration is easily noticeable. Therefore, by using this quantization matrix, it is possible to reduce the code amount while minimizing image quality deterioration.

【００６３】この量子化行列は、各フレーム毎に設定す
ることが可能である。設定されない場合には、イントラ
マクロブロックについては図８（Ａ）に示すようなデフ
ォルト値に設定された量子化行列が用いられ、インター
マクロブロックについては図８（Ｂ）に示すようなデフ
ォルト値に設定された量子化行列が用いられる。This quantization matrix can be set for each frame. If not set, a quantization matrix set to a default value as shown in FIG. 8A is used for an intra macroblock, and a default value as shown in FIG. 8B for an inter macroblock. The set quantization matrix is used.

【００６４】また、上記ｑｕａｎｔｉｓｅｒ＿ｓｃａｌ
ｅ（量子化スケール）は、ＭＰＥＧ２画像圧縮情報（ビ
ットストリーム）の構文中で、フレーム毎に定義される
ｑ＿ｓｃａｌｅ＿ｔｙｐｅ（量子化スケールタイプ）
と、マクロブロック毎に定義されるｑ＿ｓｃａｌｅ＿ｃ
ｏｄｅ（量子化スケールコード）とによって、図９に示
すように定義される。各マクロブロックに対するｑ＿ｓ
ｃａｌｅ＿ｃｏｄｅの割り当ては、符号量制御装置７に
よって行われる。Also, the above quantizer_scal
e (quantization scale) is q_scale_type (quantization scale type) defined for each frame in the syntax of MPEG2 image compression information (bit stream).
And q_scale_c defined for each macroblock
mode (quantization scale code) as shown in FIG. Q_s for each macroblock
The assignment of the call_code is performed by the code amount control device 7.

【００６５】つぎに、符号量制御装置７における処理に
ついて更に詳しく述べる。Next, the processing in the code amount control device 7 will be described in more detail.

【００６６】符号量制御装置７における処理について
は、ＭＰＥＧ２（ＩＳＯ／ＩＥＣ１３８１８−２）で
は特に定められておらず、符号化装置の開発者が、出力
符号バッファ６においてオーバーフロー及びアンダーフ
ローを起こさないという拘束条件の下に、同じ符号量で
より良好な画質が得られるように自由に設定する。ここ
では、ＭＰＥＧ２ＴｅｓｔＭｏｄｅｌ５（ＩＳＯ
／ＩＥＣＪＴＣ１／ＳＣ２９／ＷＧ１１Ｎ０４０
０）で述べられている一般的な処理について、図１０に
示すフローチャートを用いて述べる。The processing in the code amount control device 7 is not particularly defined in MPEG2 (ISO / IEC 13818-2), and the developer of the coding device does not cause overflow and underflow in the output code buffer 6. Under such a constraint condition, it is set freely so that better image quality can be obtained with the same code amount. Here, MPEG2 Test Model 5 (ISO
/ IEC JTC1 / SC29 / WG11 N040
The general processing described in (0) will be described with reference to the flowchart shown in FIG.

【００６７】符号量制御装置７は、図１０に示すよう
に、目標符号量（ターゲットビットレート）及びＧＯＰ
構成要素を設定し、この目標符号量及びＧＯＰ構成要素
を入力変数として、まずステップＳ１として各ピクチャ
ヘのビット配分を行い、続いてステップＳ２として仮想
バッファを用いたレート制御を行い、最後に、ステップ
Ｓ３として視覚特性を考慮した適応量子化を行う。符号
量制御装置７は、これらの処理を、出力符号バッファ６
の占有量（バッファフルネス）を監視しながら行う。As shown in FIG. 10, the code amount control device 7 controls the target code amount (target bit rate) and the GOP
The components are set, and the target code amount and the GOP components are used as input variables. First, bit allocation to each picture is performed in step S1, and rate control using a virtual buffer is performed in step S2. As S3, adaptive quantization is performed in consideration of visual characteristics. The code amount control device 7 performs these processes on the output code buffer 6
While monitoring the occupancy (buffer fullness).

【００６８】まず、上記ステップＳ１では、割り当て対
象ピクチャを含めＧＯＰ内でまだ符号化されていないピ
クチャに対して割り当てられるビット量（以下、これを
Ｒとする）を配分して、ＧＯＰ内の各ピクチャに対して
符号量を割り当てる。この符号量の割り当てをＧＯＰ内
のピクチャ順に繰り返し行う。その際、以下に述べる第
１と第２の２つの仮定に基づいて各ピクチャヘの符号量
の割り当てを行う。First, in step S1, a bit amount (hereinafter, referred to as R) to be allocated to a picture which has not been coded in the GOP including the picture to be allocated is allocated, and each bit in the GOP is allocated. A code amount is assigned to a picture. This code amount allocation is repeated in the order of the pictures in the GOP. At this time, a code amount is assigned to each picture based on the first and second assumptions described below.

【００６９】第１に、各ピクチャを符号化する際に用い
る平均量子化スケールコードと発生符号量の積（画面の
複雑さ）は画面が変化しない限りピクチャタイプ毎に一
定値となる、と仮定する。そして、各ピクチャを符号化
した後、各ピクチャタイプ毎に、画面の複雑さを示す変
数Ｘｉ，Ｘｐ，Ｘｂ（grobal complexity measure）を
以下の式（４）に基づき更新する。First, it is assumed that the product of the average quantization scale code and the generated code amount (complexity of the screen) used when encoding each picture is a fixed value for each picture type unless the screen changes. I do. After encoding each picture, the variables Xi, Xp, and Xb (global complexity measure) indicating the complexity of the screen are updated for each picture type based on the following equation (4).

【００７０】[0070]

【数４】 (Equation 4)

【００７１】この式（４）に示した“Ｓｉ”，“Ｓ
ｐ”，“Ｓｂ”はピクチャ符号化時の発生符号ビット量
であり、“Ｑｉ”，“Ｑｐ”，“Ｑｂ”はピクチャ符号
化時の平均量子化スケールコードである。The "Si" and "S" shown in the equation (4)
“p” and “Sb” are generated code bit amounts at the time of picture coding, and “Qi”, “Qp” and “Qb” are average quantization scale codes at the time of picture coding.

【００７２】画面の複雑さを示す変数Ｘｉ，Ｘｐ，Ｘｂ
の初期値は、目標符号量（ターゲットビットレート）ｂ
ｉｔ＿ｒａｔｅ［ｂｉｔｓ／ｓｅｃ］を用いて、以下の
式（５）、式（６）、式（７）で示される値とする。Variables Xi, Xp, Xb indicating screen complexity
Is the target code amount (target bit rate) b
Using it_rate [bits / sec], the values are represented by the following equations (5), (6), and (7).

【００７３】[0073]

【数５】 (Equation 5)

【００７４】[0074]

【数６】 (Equation 6)

【００７５】[0075]

【数７】 (Equation 7)

【００７６】第２に、Ｉピクチャの量子化スケールコー
ドを基準としたＰ，Ｂピクチャの量子化スケールコード
の比率Ｋｐ，Ｋｂが以下の式（８）に定めた値となる場
合に常に全体の画質が最適化される、と仮定する。Second, when the ratios Kp and Kb of the quantized scale codes of the P and B pictures with respect to the quantized scale code of the I picture become the values defined by the following equation (8), the whole is always obtained. Assume that the image quality is optimized.

【００７７】[0077]

【数８】 (Equation 8)

【００７８】以上第１及び第２の２つの仮定から、ＧＯ
Ｐの各ピクチャに対する割り当てビット量（Ｔｉ，Ｔ
ｐ，Ｔｂ）は、以下の式（９）、式（１０）、式（１
１）に示す値とする。From the above first and second assumptions, GO
The amount of bits allocated to each picture of P (Ti, T
p, Tb) are calculated by the following equations (9), (10), and (1).
1).

【００７９】[0079]

【数９】 (Equation 9)

【００８０】[0080]

【数１０】 (Equation 10)

【００８１】[0081]

【数１１】 [Equation 11]

【００８２】この式（９）〜（１１）に示した“Ｎ
ｐ”，“Ｎｂ”はＧＯＰ内でまだ符号化されていない
Ｐ，Ｂピクチャの枚数である。"N" shown in equations (9) to (11)
“p” and “Nb” are the numbers of P and B pictures that have not been encoded in the GOP.

【００８３】このようにして求めた割当符号量を基にし
て、ＧＯＰ内の未符号化ピクチャに対して割り当てるビ
ット量Ｒを、各ピクチャを符号化する毎に、以下の式
（１２）により更新する。Based on the allocated code amount thus obtained, the bit amount R allocated to the uncoded picture in the GOP is updated by the following equation (12) every time each picture is coded. I do.

【００８４】[0084]

【数１２】 (Equation 12)

【００８５】また、ＧＯＰの最初のピクチャを符号化す
る際には、以下の（１３）によりＲを更新する。When encoding the first picture of the GOP, R is updated by the following (13).

【００８６】[0086]

【数１３】 (Equation 13)

【００８７】この式（１３）で示した“Ｎ”はＧＯＰ内
のピクチャ数である。また、シーケンスの最初のＲを０
とする。"N" shown in equation (13) is the number of pictures in the GOP. Also, set the first R in the sequence to 0.
And

【００８８】続いて、ステップＳ２では、実際の発生符
号量とステップ１で求められた各ピクチャに対する割当
ビット量（Ｔｉ，Ｔｐ，Ｔｂ）と一致させるため、各ピ
クチャタイプに独立に設定した３種類の仮想バッファの
容量に基づき、量子化スケールコードをマクロブロック
単位のフィードバック制御により求める。Subsequently, in step S2, in order to match the actual generated code amount with the allocated bit amount (Ti, Tp, Tb) for each picture obtained in step 1, three types independently set for each picture type are set. , The quantization scale code is obtained by feedback control in units of macroblocks based on the capacity of the virtual buffer.

【００８９】まず、ｊ番目のマクロブロック符号化に先
だって、仮想バッファの占有量を以下の式（１４）、式
（１５）、式（１６）により求める。First, prior to the j-th macroblock coding, the occupancy of the virtual buffer is obtained by the following equations (14), (15), and (16).

【００９０】[0090]

【数１４】 [Equation 14]

【００９１】[0091]

【数１５】 (Equation 15)

【００９２】[0092]

【数１６】 (Equation 16)

【００９３】式（１４）〜（１６）で示した“ｄ₀ ⁱ”，
“ｄ₀ ^p”，“ｄ₀ ^b”はＩ，Ｐ，Ｂの各ピクチャの仮想バ
ッファの初期占有量であり、“Ｂｊ”はピクチャの先頭
からｊ番目のマクロブロックまでの発生ビット量であ
り、“ＭＢ＿ｃｎｔ”は１ピクチャ内のマクロブロック
数である。ピクチャ符号化終了時の各仮想バッファ占有
量（ｄ_{MB_cnt} ⁱ，ｄ_{MB_cnt} ^p，ｄ_{MB_cnt} ^b）は、それぞれ
同一のピクチャタイプで、次のピクチャに対する仮想バ
ッファ占有量の初期値（ｄ₀ ⁱ，ｄ₀ ^p，ｄ₀ ^b）として用い
られる。“D ₀ ⁱ ” shown in the equations (14) to (16),
“D ₀ ^p ” and “d ₀ ^b ” are the initial occupancy of the virtual buffer of each of the I, P, and B pictures, and “Bj” is the amount of generated bits from the head of the picture to the j-th macroblock. , “MB_cnt” is the number of macroblocks in one picture. The virtual buffer occupancy at the end of picture encoding (d _{MB_cnt} ⁱ , d _{MB_cnt} ^p , d _{MB_cnt} ^b ) is the same as the picture type, and the initial value (d ₀ ⁱ , d) of the virtual buffer occupancy for the next picture ₀ ^p , d ₀ ^b ).

【００９４】次にｊ番目のマクロブロックに対する量子
化スケールコードを、以下の式（１７）により計算す
る。Next, the quantization scale code for the j-th macroblock is calculated by the following equation (17).

【００９５】[0095]

【数１７】 [Equation 17]

【００９６】この式（１７）で示した“ｒ”はリアクシ
ョンパラメーターと呼ばれるフィードバックループの応
答を制御する変数であり、以下の式（１８）により与え
られる。"R" shown in the equation (17) is a variable called a reaction parameter for controlling the response of the feedback loop, and is given by the following equation (18).

【００９７】[0097]

【数１８】 (Equation 18)

【００９８】なお、符号化開始時における仮想バッファ
の初期値は以下の式（１９）で与えられる。The initial value of the virtual buffer at the start of encoding is given by the following equation (19).

【００９９】[0099]

【数１９】 [Equation 19]

【０１００】最後に、ステップ３では、視覚的に劣化の
目立ちやすい平坦部でより細かく量子化し、劣化の比較
的目立ちにくい絵柄の複雑な部分で粗く量子化するよう
に、ステップ２で求められた量子化スケールコードを、
各マクロブロック毎のアクティビティと呼ばれる変数に
よって変化させる。Finally, in step 3, the quantization was made finer in a flat portion where the deterioration is visually conspicuous, and coarsely quantized in a complicated portion of the pattern where the deterioration is relatively inconspicuous. Quantization scale code,
It is changed by a variable called activity for each macroblock.

【０１０１】アクティビティａｃｔ_jは、原画の輝度信
号画素値を用い、フレーム離散コサイン変換モードにお
ける４個のブロックと、フィールド離散コサイン変換モ
ードにおける４個のブロックとの、合計８ブロックの画
素値を用いて、以下の式（２０）、式（２１）、式（２
２）で与えられる。The activity act _j uses the pixel values of the luminance signal of the original image, and uses the pixel values of a total of eight blocks of four blocks in the frame discrete cosine transform mode and four blocks in the field discrete cosine transform mode. Therefore, the following expressions (20), (21), and (2)
Given in 2).

【０１０２】[0102]

【数２０】 (Equation 20)

【０１０３】[0103]

【数２１】 (Equation 21)

【０１０４】[0104]

【数２２】 (Equation 22)

【０１０５】式（２０）、式（２１）、式（２２）で示
したＰｋは原画の輝度信号ブロック内画素値である。式
（２０）において最小値を採るのは、マクロブロック内
の一部だけでも平坦部分のある場合には量子化を細かく
するためである。Pk shown in Expressions (20), (21), and (22) is a pixel value in a luminance signal block of an original image. The reason for taking the minimum value in the equation (20) is to make the quantization finer when there is a flat portion even in only a part of the macroblock.

【０１０６】更に、以下の式（２３）により、その値が
０．５〜２の範囲を取る正規化アクティビティＮａｃｔ
_jを求める。Further, according to the following equation (23), the normalized activity Nact whose value ranges from 0.5 to 2.
_{Find j} .

【０１０７】[0107]

【数２３】 (Equation 23)

【０１０８】この式（２３）で示したａｖｇ＿ａｃｔは
直前に符号化したピクチャでのａｃｔ_jの平均値であ
る。Avg_act shown in the equation (23) is the average value of act _j in the picture coded immediately before.

【０１０９】視覚特性を考慮した量子化スケールコード
ｍｑｕａｎｔ_jはステップ２で得られた量子化スケール
コードＱｊに基づき以下の式（２４）で与えられる。The quantization scale code mquant _j considering the visual characteristics is given by the following equation (24) based on the quantization scale code Qj obtained in step 2.

【０１１０】[0110]

【数２４】 (Equation 24)

【０１１１】符号化制御装置７では、以上のようなステ
ップＳＳ１〜ステップＳ３の処理を行い、出力符号バッ
ファ６においてオーバーフロー及びアンダーフローを起
こさず、同じ符号量でより良好な画質が得られるように
自由に設定する。The encoding control device 7 performs the processing of steps SS1 to S3 as described above so that overflow and underflow do not occur in the output code buffer 6 and a better image quality can be obtained with the same code amount. Set freely.

【０１１２】つぎに、出力符号バッファ６における拘束
条件について述べる。Next, the constraint conditions in the output code buffer 6 will be described.

【０１１３】ＭＰＥＧ２（ＩＳＯ／ＩＥＣ１３８１８
−２）では、ＶＢＶ（Video BufferVerifier）バッファ
と呼ばれる仮想バッファが復号化装置に対して定義され
ている。符号量制御装置７では、この画像符号化装置の
出力である画像圧縮情報（ビットストリーム）が、この
拘束条件を満たすように符号の割当が行われる。ＶＢＶ
バッファには、符号化装置とＶＢＶバッファとは完全に
同期して動作するという第１の理想条件と、各ピクチャ
の復号は瞬時に行われ各ピクチャのデータはＶＢＶバッ
ファより瞬時に抜き取られるという第２の理想条件との
２つの理想条件の下で動作することが仮定されている。
なお、ＶＢＶバッファを定義する目的は、あくまでこの
モデルによって画像圧縮情報（ビットストリーム）に対
する規定を加えることであり、現実の復号化装置の実現
方式を規定するものではない。MPEG2 (ISO / IEC 13818)
In -2), a virtual buffer called a VBV (Video Buffer Verifier) buffer is defined for the decoding device. The code amount control device 7 allocates codes so that the image compression information (bit stream) output from the image encoding device satisfies the constraint condition. VBV
The buffer has a first ideal condition that the encoding apparatus and the VBV buffer operate completely synchronously, and a second ideal condition that decoding of each picture is performed instantaneously and data of each picture is instantaneously extracted from the VBV buffer. It is assumed to operate under two ideal conditions and two ideal conditions.
Note that the purpose of defining the VBV buffer is to add a definition for the image compression information (bit stream) based on this model, but does not specify an actual implementation method of the decoding device.

【０１１４】ＶＢＶバッファは、単位時間当たりの符号
量が一定の場合（固定レート：ＣＢＲ［Constant Bit R
ate］）及び単位時間当たりの符号量が一定ではない場
合（可変レート：ＶＢＲ［Variable Bit Rate］）の双
方に対応しており、固定レートは可変レートの特別な場
合として定められている。A VBV buffer has a constant code amount per unit time (fixed rate: CBR [Constant Bit R]).
ate]) and the case where the code amount per unit time is not constant (variable rate: VBR [Variable Bit Rate]), and the fixed rate is defined as a special case of the variable rate.

【０１１５】また、様々な応用に対応するため、ＶＢＶ
バッファに対するデータの入力は以下の第１の方法と第
２の方法の２種類の方法で規定することが可能となって
いる。第１の方法では、入力レートがピクチャ単位で変
化するよう規定される。第２の方法では、入力レートが
ＶＢＶバッファの状態に応じてピークレート若しくは０
のいずれかの値を取るように規定される。いずれの方法
も、そのピークレートＲｍａｘ〔ｂｉｔ／ｓｅｃ］が、
ＭＰＥＧ２による画像圧縮情報の構文（シンタクス）に
おいて、シーケンスヘッダと呼ばれる部分のパラメータ
ビットレートで指定される。２種類の方法の区別は、各
フレーム毎に定義されるｖｂｖ＿ｄｅｌａｙと呼ばれる
変数によってされる。Further, in order to cope with various applications, VBV
Data input to the buffer can be defined by the following two methods, a first method and a second method. In the first method, the input rate is defined to change on a picture-by-picture basis. In the second method, the input rate is set to the peak rate or 0 according to the state of the VBV buffer.
Is defined to take one of the following values. In either method, the peak rate Rmax [bit / sec] is
In the syntax (syntax) of image compression information according to MPEG2, it is specified by a parameter bit rate of a part called a sequence header. The distinction between the two methods is made by a variable called vbv_delay defined for each frame.

【０１１６】各ピクチャのｖｂｖ＿ｄｅｌａｙが全て最
大値“０ｘＦＦＦＦ”でない場合、ｖｂｖ＿ｄｅｌａｙ
は、各ピクチャのスタートコードがＶＢＶバッファに入
力されてから復号されるまでの時間を示す。この場合、
ＶＢＶバッファへのデータの入力レートは、ピクチャデ
ータ単位で可変レートとなる。ｎ番目のピクチャデータ
がＶＢＶバッファに入力されるレートＲ（ｎ）［ｂｉｔ
／ｓｅｃ］は、以下の式（２５）で定められる。If the vbv_delay of each picture is not the maximum value “0xFFFF”, vbv_delay
Indicates the time from when the start code of each picture is input to the VBV buffer until it is decoded. in this case,
The input rate of data to the VBV buffer is variable on a picture data basis. Rate R (n) [bit at which the n-th picture data is input to the VBV buffer
/ Sec] is determined by the following equation (25).

【０１１７】[0117]

【数２５】 (Equation 25)

【０１１８】この式（２５）に示したｄｎ^*は、ｎ番目
のピクチャに対するスタートコードから、（ｎ＋１）番
目のピクチャに対するスタートコードまでの符号量であ
る。τ（ｎ）は、ｎ番目のピクチャに対して設定された
変数ｖｂｖ＿ｄｅｌａｙの値であり、ｔ（ｎ）はｎ番目
のピクチャに対する値がＶＢＶバッファより抜き取られ
る時刻である。固定レート（ＣＢＲ）データは、Ｒ
（ｎ）がｎによらず一定の値である特別な場合としてこ
の規定に含まれることになる。Dn ^* shown in the equation (25) is the code amount from the start code for the nth picture to the start code for the (n + 1) th picture. τ (n) is the value of the variable vbv_delay set for the nth picture, and t (n) is the time at which the value for the nth picture is extracted from the VBV buffer. The fixed rate (CBR) data is R
A special case where (n) is a constant value irrespective of n is included in this specification.

【０１１９】図１１に、ＶＢＶバッファのビット占有量
の変化を示す。ＶＢＶバッファは、初期状態において、
そのビット占有量が０となっている。ＶＢＶバッファ
は、最初のピクチャのスタートコードの入力時点から、
変数ｖｂｖ＿ｄｅｌａｙが示す時間の経過の後に復号が
開始される。引き続き定めれた復号時間問隔で、各ピク
チャの復号とＶＢＶバッファの検証を継続する。FIG. 11 shows changes in the bit occupancy of the VBV buffer. The VBV buffer is initially
The bit occupancy is 0. The VBV buffer starts from the time when the start code of the first picture is input.
Decoding starts after the lapse of the time indicated by the variable vbv_delay. The decoding of each picture and the verification of the VBV buffer are continued at the determined decoding time interval.

【０１２０】各ピクチャのｖｂｖ＿ｄｅｌａｙが全て最
大値“０ｘＦＦＦＦ”の場合、ＶＢＶバッファに対する
データの入力レートは、このＶＢＶバッファのビット占
有量で決定される。この入力レートは、ＶＢＶバッファ
に空きがあるときにはＲｍａｘとなり、空きのない場合
は０となる。When the vbv_delay of each picture is the maximum value “0xFFFF”, the data input rate to the VBV buffer is determined by the bit occupancy of the VBV buffer. This input rate becomes Rmax when there is a vacancy in the VBV buffer, and becomes 0 when there is no vacancy.

【０１２１】この場合、ＶＢＶバッファは、初期状態に
おいて、ビット占有量が０となっている。バッファに空
きがなくなった時点で復号を開始する。引き続き、定め
られた復号時間間隔で各ピクチャの復号とＶＢＶバッフ
ァの検証を継続する。In this case, the bit occupancy of the VBV buffer is 0 in the initial state. Decoding starts when the buffer becomes full. Subsequently, decoding of each picture and verification of the VBV buffer are continued at a predetermined decoding time interval.

【０１２２】画像圧縮情報（ビットストリーム）は、以
上の規定に従ってＶＢＶバッファに対するデータの入力
を行う。その際に、画像圧縮情報（ビットストリーム）
は、ピクチャデータをＶＢＶバッファから瞬時に抜き取
る直前にＶＢＶバッファがオーバーフローしないという
条件（条件（１））、ピクチャデータをＶＢＶバッファ
から瞬時に抜き取った直後におけるＶＢＶバッファがア
ンダーフローをしないという条件（条件（２））、とい
う２つの条件を満たすよう検証がされて符号化される。As for the image compression information (bit stream), data is input to the VBV buffer in accordance with the above rules. At that time, image compression information (bit stream)
The condition is that the VBV buffer does not overflow immediately before picture data is instantaneously extracted from the VBV buffer (condition (1)), and that the VBV buffer does not underflow immediately after picture data is instantaneously extracted from the VBV buffer (condition (1)). (2)), verification is performed so as to satisfy the two conditions, and encoding is performed.

【０１２３】なお、上記条件（２）に関しては、変数ｌ
ｏｗ−ｄｅｌａｙで指定される低遅延モードの場合とそ
うでない場合で検証内容が異なる。低遅延モードとは、
Ｂピクチャの使用を禁ずるモードである。この低遅延モ
ードは、符号化及び復号化の際のピクチャの並べ替えに
よって生じる遅延を防ぐとともに、符号化ピクチャでの
スキップ（駒落し）を許してＶＢＶバッファでの遅延を
少なくする。Incidentally, regarding the above condition (2), the variable l
The verification content differs between the case of the low delay mode specified by ow-delay and the case of not being the low delay mode. What is low-latency mode?
This mode prohibits the use of B pictures. This low delay mode prevents delay caused by rearrangement of pictures at the time of encoding and decoding, and allows skipping (dropping of frames) in encoded pictures to reduce delay in the VBV buffer.

【０１２４】低遅延モードでない場合には、ＶＢＶバッ
ファからピクチャデータを瞬時に抜き取った直後にＶＢ
Ｖバッファがアンダーフローを起こしてはならないよう
に検証がされる。If the mode is not the low-delay mode, the VBV
It is verified that the V-buffer must not underflow.

【０１２５】低遅延モードの場合には、仮にピクチャデ
ータを抜き取ると、ＶＢＶバッファがアンダーフローを
起こすことを許す。この場合には、その時刻でのピクチ
ャデータを抜き取らず、２フィールド間隔毎に全てのピ
クチャデータが到達したかどうかの検証を繰り返し、初
めて到達した時点でピクチャデータを抜き取る。In the low delay mode, if picture data is extracted, the VBV buffer is allowed to underflow. In this case, without extracting the picture data at that time, verification of whether or not all the picture data has arrived is repeated every two-field interval, and the picture data is extracted when the picture data arrives for the first time.

【０１２６】従って、低遅延モードの場合も含め、１ピ
クチャのデータ量はＶＢＶバッファサイズより小さくな
ければならない。Therefore, the data amount of one picture must be smaller than the VBV buffer size including the case of the low delay mode.

【０１２７】（ＭＰＥＧ−２方式の画像圧縮情報の構
成）つぎに、図１に示すＭＰＥＧ−２画像情報符号化装
置により符号化されたＭＰＥＧ２画像圧縮情報（ビット
ストリーム）のデータ構成について説明する。(Configuration of MPEG-2 Image Compression Information) Next, the data configuration of MPEG2 image compression information (bit stream) encoded by the MPEG-2 image information encoding apparatus shown in FIG. 1 will be described.

【０１２８】図１２に、ＭＰＥＧ２画像圧縮情報（ビッ
トストリーム）のデータ構成を模式的に示す。FIG. 12 schematically shows the data structure of MPEG2 image compression information (bit stream).

【０１２９】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）は、この図１２に示すように階層構造となってお
り、上位層から順に、シーケンス（Sequence）層、ＧＯ
Ｐ（Group of Pictures）層、ピクチャ（Picture）層、
スライス（Slice）層、マクロブロック（Macro Block）
層、ブロック（Block）層から構成される。The MPEG2 image compression information (bit stream) has a hierarchical structure as shown in FIG. 12, and a sequence (Sequence) layer, GO
P (Group of Pictures) layer, Picture (Picture) layer,
Slice layer, macro block (Macro Block)
Layer and block layer.

【０１３０】シーケンス層からスライス層までの各階層
では、“０ｘ０００００１”に続く８ビット（１バイ
ト）のユニークなスタートコード（合計３２ビット（４
バイト））が予め定義されている。ＭＰＥＧ２画像圧縮
情報（ビットストリーム）では、このスタートコードに
より各層が区別され、また、このスタートコードがエラ
ー回復ポイントになっている。各階層におけるスタート
コードのコード値を図１３に示す。In each layer from the sequence layer to the slice layer, a unique start code of 8 bits (1 byte) following “0x000001” (total 32 bits (4 bytes))
Bytes)) are predefined. In the MPEG2 image compression information (bit stream), each layer is distinguished by this start code, and this start code is an error recovery point. FIG. 13 shows the code values of the start code in each layer.

【０１３１】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）の各階層のブロック構成を図１４に示し、以下、こ
のＭＰＥＧ２画像圧縮情報データ構成についてさらに詳
細に説明する。FIG. 14 shows a block configuration of each layer of the MPEG2 image compression information (bit stream). Hereinafter, this MPEG2 image compression information data configuration will be described in more detail.

【０１３２】シーケンス層は、この図１４に示すよう
に、画像圧縮情報（ビットストリーム）の一番上位に位
置する階層で、シーケンスヘッダで始まり、シーケンス
エンドコード（sequence_end_code）で終了する。この
シーケンス層には、基本的に１つ以上のＧＯＰ層が含ま
れる。このシーケンス層では、シーケンス拡張部、及
び、拡張部＆ユーザデータ（１）が含まれており、この
シーケンス拡張のスタートコード（extension_start_co
de）の有無でＭＰＥＧ１とＭＰＥＧ２の画像圧縮情報
（ビットストリーム）の分類が行われる。As shown in FIG. 14, the sequence layer is the highest layer of the image compression information (bit stream), and starts with a sequence header and ends with a sequence end code (sequence_end_code). This sequence layer basically includes one or more GOP layers. This sequence layer includes a sequence extension part and extension part & user data (1), and a start code (extension_start_co.
Based on the presence or absence of de), MPEG1 and MPEG2 image compression information (bit stream) is classified.

【０１３３】シーケンスヘッダには、図１５に示すよう
に、シーケンスヘッダコード（sequence_header_cod
e）、水平方向画素数情報（horizontal_size_value）、
垂直方向ライン数情報（vertical_size_value）、画素
アスペクト比情報（aspect_ratio_information）、フレ
ームレートコード（frame_rate_code）、ビットレート
情報（bit_rate_value）、ＶＢＶバッファサイズ情報
（vbv_buffer_size_value）、イントラマクロブロック
用量子化行列（intra_quantiser_matrix[64]）、インタ
ーマクロブロック用量子化行列（non_intra_quantiser_
matrix[64]）等のシーケンス単位で設定される情報が含
まれる。As shown in FIG. 15, a sequence header code (sequence_header_cod
e), horizontal pixel count information (horizontal_size_value),
Vertical line number information (vertical_size_value), pixel aspect ratio information (aspect_ratio_information), frame rate code (frame_rate_code), bit rate information (bit_rate_value), VBV buffer size information (vbv_buffer_size_value), quantization matrix for intra macroblock (intra_quantiser_matrix [64]) ]), Inter-macroblock quantization matrix (non_intra_quantiser_
matrix [64]) and other information set in sequence units.

【０１３４】シーケンス拡張部には、シーケンス拡張部
のスタートコード（extension_start_code）の後に、図
１６に示すように、ＭＰＥＧ２で用いられているプロフ
ァイル，レベル表示情報（profile_and_level_indicati
on）、シーケンス全体の順次走査画像フラグ（progress
ive_sequence）、色差フォーマット（chroma_forma
t）、低遅延モード（low_delay）、及び、ＭＰＥＧ１と
共有しているシーケンスヘッダでの画素数や符号量（ビ
ットレート）等の上限をＭＰＥＧ２用に拡張するための
付加データが含まれている。After the start code (extension_start_code) of the sequence extension unit, as shown in FIG. 16, the profile extension and level display information (profile_and_level_indicati) used in MPEG2 are added to the sequence extension unit.
on), progressive scan image flag (progress
ive_sequence), chroma format (chroma_forma)
t), low-delay mode (low_delay), and additional data for extending the upper limit of the number of pixels and code amount (bit rate) in the sequence header shared with MPEG1 for MPEG2.

【０１３５】拡張部＆ユーザデータ（１）には、図１７
に示すように、拡張データ（１）（extension_data
(1)）、シーケンス表示拡張部（）（sequence_display_
extension()）、シーケンススケーラブル拡張部（）（s
equence_scalable_extension）、シーケンススケーラブ
ル拡張ＩＤ（extension_start_code_identifier）、ス
ケーラビリティモード情報（scalable_mode）、スケー
ラブル階層のレイヤＩＤ（layer_id）が含まれている。
シーケンス表示拡張部（）には、原信号のＲＧＢ変換特
性や表示画像サイズ等の情報が含まれている。また、シ
ーケンススケーラブル拡張部（）には、スケーラビリテ
ィモード（空間、ＳＮＲ、テンポラル、データパーティ
ション）、スケーラビリティのレイヤの設定がされる。
また、拡張部＆ユーザデータ（１）には、空間スケーラ
ビリティとテンポラルスケーラビリティとでそれぞれ以
下の変数が設定される。空間スケーラビリティの場合に
は、予測用下位レイヤの水平サイズ（lower_layer_pred
iction_horizontal_size）、予測用下位レイヤの垂直サ
イズ（lower_layer_prediction_vertical_size）、垂直
方向アップサンプル用除数（vertical_subsampling_fac
tor_n）が含まれ、テンポラルスケーラビリティの場合
には、第１ベースレイヤ画像前の付加レイヤ画像数（pi
cture_mux_order）、ベースレイヤ間の付加レイヤの画
像数（picture_mux_factor）、ユーザデータ（）（user
_data()）、ユーザデータ（user_data）が含まれる。ユ
ーザデータ（）（user_data()）には、シーケンスレベ
ルでのユーザデータが自由に設定できる。FIG. 17 shows the extension and user data (1).
As shown in, the extension data (1) (extension_data
(1)), sequence display extension () (sequence_display_
extension ()), sequence scalable extension () (s
equence_scalable_extension), a sequence scalable extension ID (extension_start_code_identifier), scalability mode information (scalable_mode), and a layer ID (layer_id) of a scalable hierarchy.
The sequence display extension section () contains information such as the RGB conversion characteristics of the original signal and the display image size. In the sequence scalable extension unit (), a scalability mode (space, SNR, temporal, data partition) and a scalability layer are set.
Further, the following variables are set in the extension part & user data (1) for spatial scalability and temporal scalability. In the case of spatial scalability, the horizontal size of the lower layer for prediction (lower_layer_pred
iction_horizontal_size), prediction lower-layer vertical size (lower_layer_prediction_vertical_size), vertical up-sample divisor (vertical_subsampling_fac)
tor_n), and in the case of temporal scalability, the number of additional layer images before the first base layer image (pi
cture_mux_order), the number of pictures in the additional layer between the base layers (picture_mux_factor), user data () (user
_data ()) and user data (user_data). User data at the sequence level can be freely set in the user data () (user_data ()).

【０１３６】続いて、ＧＯＰ層は、図１４に示すよう
に、シーケンス層の下の階層に位置し、ＧＯＰヘッダに
始まり、拡張部＆ユーザデータ（２）と、１つ以上のピ
クチャ層から構成される。ＧＯＰ層の最初のピクチャ
は、参照画面を用いずに独立して符号化（イントラ符号
化）されるＩピクチャとなる。このため、このＧＯＰ
は、このＩピクチャを用いることによりランダムアクセ
スを行うエントリーポイントとして用いられる。例えば
１つのＧＯＰを１０〜１５枚のピクチャで構成した場
合、毎秒３０フレームのビデオデータであれば０．３〜
０．５秒毎にランダムアクセスポイントが存在すること
になる。また、ＧＯＰの長さは任意であるため、シーン
の切り替わりポイントでＧＯＰを開始又は終了して符号
化効率を高めることも可能である。なお、Ｉピクチャは
非常に大きな符号量となり占有バッファ量も大きくなる
ため、低遅延特性が要求される通信などのアプリケーシ
ョンでも用いることができるように、ＭＰＥＧ２では、
ＧＯＰ層を省略することが可能となっている。この場
合、例えば、フレーム内符号化を行うスライス（以下、
イントラスライスと呼ぶ）を用い、各スライスのイント
ラ符号化データを数画面に分けて巡回させることによ
り、Ｉピクチャの代用となるとともに、どの画面のバッ
ファ占有量も平均的に低くすることが可能である。Subsequently, as shown in FIG. 14, the GOP layer is located in the lower layer of the sequence layer, starts with the GOP header, and includes an extension & user data (2) and one or more picture layers. Is done. The first picture in the GOP layer is an I picture that is independently coded (intra-coded) without using a reference screen. Therefore, this GOP
Are used as entry points for random access by using this I picture. For example, when one GOP is composed of 10 to 15 pictures, if the video data is 30 frames per second, the GOP is 0.3 to 15 pictures.
There will be a random access point every 0.5 seconds. Further, since the length of the GOP is arbitrary, it is possible to start or end the GOP at the scene switching point to increase the coding efficiency. In addition, since the I picture has a very large code amount and a large occupied buffer amount, MPEG2 uses an I-picture so that it can be used in applications such as communication requiring low delay characteristics.
The GOP layer can be omitted. In this case, for example, a slice for performing intra-frame encoding (hereinafter, referred to as a slice)
Intra-encoded data of each slice is divided into several screens and circulated, thereby making it possible to substitute for I-pictures and reduce the buffer occupancy of any screen on average. is there.

【０１３７】ＧＯＰヘッダには、図１８に示すように、
ＧＯＰヘッダのスタートコード（group_start_code）の
後に、タイムコード（time_code）、ＧＯＰの独立性を
示すフラグ（closed_gop）、ＧＯＰ内Ｉピクチャ前のＢ
ピクチャの正当性を示すフラグ（broken_link）が含ま
れている。ＧＯＰ内Ｉピクチャ前のＢピクチャの正当性
を示すフラグ（broken_link）は、例えば前ＧＯＰが編
集によりカットされたこと等による、ＧＯＰ内の最初の
Ｉピクチャより前にあるＢピクチャが正確に復号できな
いことを示している。また、ＧＯＰの独立性を示すフラ
グ（closed_gop）は、当該ＧＯＰが他のＧＯＰ内の画像
に依存しないを示している。これらのコードはランダム
アクセス時の不当なＢピクチャ表示の回避や、編集でＧ
ＯＰをカットした際のフラグとして用いられる。As shown in FIG. 18, in the GOP header,
After the start code (group_start_code) of the GOP header, a time code (time_code), a flag (closed_gop) indicating the independence of the GOP, and a B before the I picture in the GOP
A flag (broken_link) indicating the validity of the picture is included. The flag (broken_link) indicating the validity of the B picture before the I picture in the GOP indicates that the B picture before the first I picture in the GOP cannot be decoded correctly due to, for example, the previous GOP being cut by editing. It is shown that. A flag (closed_gop) indicating the independence of a GOP indicates that the GOP does not depend on an image in another GOP. These codes can be used to avoid illegal B picture display at random access and
Used as a flag when the OP is cut.

【０１３８】拡張部＆ユーザデータ（２）には、図１９
に示すように、拡張データ（２）（extension_data
(2)）、ユーザデータ（）（user_data()）、ユーザデー
タ（user_data）が含まれ、ＧＯＰレベルでのユーザデ
ータの設定がされる。The extension part and user data (2) include FIG.
As shown in, the extension data (2) (extension_data
(2)), user data () (user_data ()), and user data (user_data) are included, and user data is set at the GOP level.

【０１３９】続いて、ピクチャ層は、図１４に示すよう
に、ＧＯＰ層の下の階層に位置し、各画面に相当する層
である。ピクチャ層は、ピクチャヘッダ、ピクチャ符号
化拡張部、拡張部＆ユーザデータ（３）、ピクチャデー
タが含まれる。このピクチャ層は図１２で示したよう
に、１個以上のスライス層に分割されている。Subsequently, the picture layer is a layer located below the GOP layer and corresponding to each screen, as shown in FIG. The picture layer includes a picture header, a picture coding extension, an extension & user data (3), and picture data. This picture layer is divided into one or more slice layers as shown in FIG.

【０１４０】ピクチャヘッダには、図２０に示すよう
に、ピクチャスタートコード（picture_start_code）、
テンポラルリファレンス（temporal_reference）、ピク
チャ符号化タイプ情報（picture_coding_type）、符号
開始までのＶＢＶ遅延量情報（vbv_delay）が含まれて
いる。テンポラルリファレンス（temporal_reference）
は、ＧＯＰ内における当該ピクチャの表示順序を示す画
像番号で、ＧＯＰ毎に初期値が０にリセットされる。ピ
クチャの符号化タイプ情報（picture_coding_type）に
は、ピクチャ毎のＩ，Ｐ，Ｂの区別が設定される。ま
た、符号開始までのＶＢＶ遅延量情報（vbv_delay）
は、先に述べたＶＢＶバッファによる復号開始時刻まで
の遅延量が設定される。The picture header includes a picture start code (picture_start_code), as shown in FIG.
It includes a temporal reference (temporal_reference), picture coding type information (picture_coding_type), and VBV delay amount information (vbv_delay) until the start of coding. Temporal reference (temporal_reference)
Is an image number indicating the display order of the picture in the GOP, and the initial value is reset to 0 for each GOP. In the picture coding type information (picture_coding_type), the distinction between I, P, and B for each picture is set. Also, VBV delay amount information until code start (vbv_delay)
Is set to the amount of delay until the decoding start time by the VBV buffer described above.

【０１４１】ピクチャ符号化拡張部には、図２１に示す
ように、動きベクトル範囲情報（f_code[s][t]）、イン
トラマクロブロックのＤＣ係数精度情報（intra_dc_pre
cision）、ピクチャ構造情報（picture_structure）、
表示フィールドの指定情報（top_field_first）、フレ
ーム予測及びフレームＤＣＴフラグ（frame_pred_frame
_dct）、イントラマクロブロック用コンシールメント動
きベクトルフラグ（concealment_motion_vector）、量
子化スケールタイプ情報（q_scale_type）、イントラマ
クロブロック用可変長符号タイプ情報（intra_vlc_form
at）、スキャンタイプ情報（alternate_scan）、２：３
プルダウン用フィールドリピート情報（repeat_first_f
ield）、４２０タイプ情報（chroma_420_type）、順次
走査フラグ（progressive_frame）が含まれている。動
きベクトル範囲情報（f_code[s][t]）には、前・後方向
及び水平・垂直方向の動きベクトル範囲が設定される。
ピクチャ構造情報（picture_structure）には、フィー
ルド毎に符号化・復号化処理を行うか（以下、フィール
ド構造と呼ぶ）、フレーム毎に符号化・復号化処理を行
うか（以下、フレーム構造と呼ぶ）が設定される。ま
た、イントラマクロブロック用可変長符号タイプ情報
（intra_vlc_format）には、線形・非線型量子化スケー
ルの設定がされ、スキャンタイプ情報（alternate_sca
n）には、そのピクチャの可変長符号化処理のタイプの
選択（オルタネートスキャンか、ジグザグスキャンかの
選択）の設定がされる。As shown in FIG. 21, the picture coding extension unit includes motion vector range information (f_code [s] [t]) and DC coefficient precision information (intra_dc_pre
cision), picture structure information (picture_structure),
Display field designation information (top_field_first), frame prediction and frame DCT flag (frame_pred_frame)
_dct), concealment motion vector flag for intra macroblock (concealment_motion_vector), quantization scale type information (q_scale_type), variable length code type information for intra macroblock (intra_vlc_form)
at), scan type information (alternate_scan), 2: 3
Field repeat information for pulldown (repeat_first_f
ield), 420 type information (chroma_420_type), and a progressive scan flag (progressive_frame). In the motion vector range information (f_code [s] [t]), the motion vector ranges in the forward / backward and horizontal / vertical directions are set.
In the picture structure information (picture_structure), whether encoding / decoding processing is performed for each field (hereinafter, referred to as a field structure) or encoding / decoding processing is performed for each frame (hereinafter, referred to as a frame structure) Is set. In addition, the linear / non-linear quantization scale is set in the variable length code type information for intra macroblock (intra_vlc_format), and the scan type information (alternate_sca
In n), the selection of the type of the variable-length encoding process for the picture (selection between alternate scan and zigzag scan) is set.

【０１４２】拡張部＆ユーザデータ（３）には、図２２
に示すように、拡張データ（３）（extension_data
(3)）、量子化行列拡張部（）（quant_matrix_extensio
n()）、イントラマクロブロック量子化行列（intra_qua
ntiser_matrix[64]）、インターマクロブロック量子化
行列（non_intra_quantiser_matrix[64]）、色差イント
ラマクロブロック量子化行列（chroma_intra_quantiser
_matrix[64]）、色差インターマクロブロック量子化行
列（chroma_non_intra_quantiser_matrix[64]）、著作
権拡張部（）（copyright_extension()）、ピクチャ表
示拡張部（）（picture_display_extension()）、ピク
チャ空間スケーラブル拡張部（）（picture_spatial_sc
alable_extention()）、アップサンプル用空間重み付け
テーブル（spatial_temporal_weight_code_table_inde
x）、下位レイヤ順次走査画像フラグ（lower_layer_pro
gressive_frame）、下位レイヤのフィールド選択情報
（lower_layer_deinterlaced_field_select）、ピクチ
ャテンポラルスケーラブル拡張部（）（picture_tempor
al_scalable_extention()）、参照画面の選択コード（r
eference_select_code）、前方向予測用の下位レイヤの
画像番号（forward_temporal_reference）、後方向予測
用の下位レイヤの画像番号（backward_tempral_referen
ce）、ユーザデータ（）（user_data()）、ユーザデー
タ（user_data）が含まれる。The extension part & user data (3) includes FIG.
As shown in, the extension data (3) (extension_data
(3)), quantization matrix extension part () (quant_matrix_extensio
n ()), intra macroblock quantization matrix (intra_qua
ntiser_matrix [64]), inter macroblock quantization matrix (non_intra_quantiser_matrix [64]), chroma intra macroblock quantization matrix (chroma_intra_quantiser)
_matrix [64]), chroma inter-macroblock quantization matrix (chroma_non_intra_quantiser_matrix [64]), copyright extension () (copyright_extension ()), picture display extension () (picture_display_extension ()), picture space scalable extension ( ) (Picture_spatial_sc
alable_extention ()), spatial weighting table for upsampling (spatial_temporal_weight_code_table_inde
x), lower layer progressive scan image flag (lower_layer_pro
gressive_frame), lower layer field selection information (lower_layer_deinterlaced_field_select), picture temporal scalable extension () (picture_tempor
al_scalable_extention ()), reference screen selection code (r
eference_select_code), the image number of the lower layer for forward prediction (forward_temporal_reference), the image number of the lower layer for backward prediction (backward_tempral_referen
ce), user data () (user_data ()), and user data (user_data).

【０１４３】量子化行列拡張部（）（quant_matrix_ext
ension()）には、量子化マトリクスの設定がされる。こ
のため、ピクチャ毎にイントラ、インター、輝度、色差
ブロックの量子化行列を変更することも可能であり、各
画面の特性に応じた量子化を行うことが可能となる。著
作権拡張部（）（copyright_extension()）には、著作
権番号の設定がされる。ピクチャ表示拡張部（）（pict
ure_display_extension()）には、表示領域の設定がさ
れる。これにより、例えば１６：９の高解像度画像（Ｈ
ＤＴＶ）を復号し、最も興味にある部分に対して４：３
の標準解像度テレビサイズでその領域を指定して、標準
解像度テレビで高解像度画像の一部表示を行うといった
パン−スキャン機能を実現することができる。ピクチャ
空間スケーラブル拡張部（）（picture_spatial_scalab
le_extention()）やピクチャテンポラルスケーラブル拡
張部（）（picture_temporal_scalable_extention()）
は、空間スケーラビリティやテンポラルスケーラビリテ
ィで、上位レイヤの予測に用いる下位レイヤの画像番号
などの情報が設定される。Quantization matrix extension () (quant_matrix_ext)
In the extension (), a quantization matrix is set. Therefore, the quantization matrix of the intra, inter, luminance, and chrominance blocks can be changed for each picture, and quantization according to the characteristics of each screen can be performed. A copyright number is set in the copyright extension () (copyright_extension ()). Picture display extension () (pict
In ure_display_extension ()), a display area is set. Thus, for example, a 16: 9 high-resolution image (H
DTV) and decode the 4: 3
, A pan-scan function such as displaying a part of a high-resolution image on a standard resolution television by designating the area with the standard resolution television size can be realized. Picture space scalable extension () (picture_spatial_scalab
le_extention ()) and picture temporal scalable extension () (picture_temporal_scalable_extention ())
Is a space scalability or a temporal scalability, and information such as an image number of a lower layer used for prediction of an upper layer is set.

【０１４４】続いて、スライス層は、図１４に示すよう
に、ピクチャ層の下の階層に位置する層である。スライ
ス層は、図２３のように、画面内で分割された横長の帯
状の領域（図２３のＡ，Ｂ，Ｃ・・・Ｑ領域）から構成
され、図２４に示すようなスライスデータとして定義さ
れる。画面を複数のスライス層で構成することにより、
あるスライス層でエラーが発生しても、次のスライス層
の開始コード（slice_start_code）からの同期でエラー
回復が可能となる。スライス層は、１個以上のマクロブ
ロックから構成され、ラスタスキャンオーダで、左から
右、上から下へと並ぶ。その長さや開始位置は自由で、
画面毎に変更可能である。但し、並列処理や効果的なエ
ラー耐性を目的として、一つのスライスは右方向にのみ
に伸び、下方にまで伸びることはない。Subsequently, the slice layer is a layer located at a lower layer than the picture layer, as shown in FIG. As shown in FIG. 23, the slice layer is composed of a horizontally long band-like area (A, B, C... Q area in FIG. 23) divided in the screen, and is defined as slice data as shown in FIG. Is done. By configuring the screen with multiple slice layers,
Even if an error occurs in a certain slice layer, the error can be recovered by synchronization from the start code (slice_start_code) of the next slice layer. The slice layer is composed of one or more macroblocks, and is arranged in a raster scan order from left to right and from top to bottom. Its length and starting position are free,
It can be changed for each screen. However, for the purpose of parallel processing and effective error tolerance, one slice extends only to the right and does not extend downward.

【０１４５】スライスデータには、図２５に示すよう
に、スライス層のスタートコード（slice_start_cod
e）、スライス垂直位置拡張情報（slice_vertical_posi
tion_extension）、データパーティショニング用区分点
情報（priority_breakpoint）、量子化スケールコード
（quantiser_scale_code）、イントラスライスフラグ
（intra_slice）、マクロブロックデータ（macrobloc
k()）とが含まれる。スライス層のスタートコード（sli
ce_start_code）、スライス垂直位置拡張情報（slice_v
ertical_position_extension）には、スライスの開始の
垂直位置が設定される。量子化スケールコード（quanti
ser_scale_code）には、図９に示した量子化スケールコ
ードが設定される。イントラスライスフラグ（intra_sl
ice）には、スライス内のマクロブロックが全てイント
ラマクロブロックであるかどうかが設定される。The slice data includes a slice layer start code (slice_start_cod) as shown in FIG.
e), slice vertical position extension information (slice_vertical_posi)
tion_extension), partitioning point information for data partitioning (priority_breakpoint), quantization scale code (quantizer_scale_code), intra slice flag (intra_slice), macroblock data (macrobloc)
k ()). Start code of slice layer (sli
ce_start_code), slice vertical position extension information (slice_v
In “ertical_position_extension”, a vertical position at the start of the slice is set. Quantization scale code (quanti
In ser_scale_code), the quantization scale code shown in FIG. 9 is set. Intra slice flag (intra_sl
In ice), it is set whether or not all the macroblocks in the slice are intra macroblocks.

【０１４６】続いて、マクロブロック層は、図１４に示
すように、スライス層の下の階層に位置する層である。
マクロブロック層には、マクロブロック情報、ブロック
データが含まれている。Subsequently, the macroblock layer is a layer located at a lower layer than the slice layer, as shown in FIG.
The macroblock layer includes macroblock information and block data.

【０１４７】マクロブロックデータには、図２６に示す
ように、マクロブロックのアドレス拡張データ（macrob
lock_escape）、現マクロブロックと前マクロブロック
アドレスの差分量（macroblock_address_increment）、
マクロブロックモード（）（macroblock_modes()）、マ
クロブロック符号化タイプ情報（macroblock_type）、
アップサンプル用時空間重み付けコード（spatial_temp
oral_weight_code）、フレーム構造の動き補償タイプ情
報（frame_motion_type）、フィールド構造の動き補償
タイプ情報（field_motion_type）、ＤＣＴタイプ情報
（dct_type）、マクロブロック量子化スケールコード
（quantiser_scale_code）、動きベクトル（ｓ）（moti
on_vectors[s]）、予測に用いる参照フィールドの選択
情報（motion_vertical_field_select[r][s]）、動きベ
クトル（ｒ，ｓ）（motion_vectors[s][r]）、基本差分
動きベクトル（motion_code[r][s][t] ）、残差ベクト
ル（motion_residual[r][s][t]）デュアルプライム用差
分ベクトル（dmvector[t]）、ＣＢＰ（coded_block_pat
tern）、ブロックデータ（block(i)）が含まれている。As shown in FIG. 26, the macro block data includes address extension data (macrob
lock_escape), the difference between the current and previous macroblock addresses (macroblock_address_increment),
Macroblock modes () (macroblock_modes ()), macroblock coding type information (macroblock_type),
Spatio-temporal weighting code for upsampling (spatial_temp
oral_weight_code), frame structure motion compensation type information (frame_motion_type), field structure motion compensation type information (field_motion_type), DCT type information (dct_type), macroblock quantization scale code (quantiser_scale_code), motion vector (s) (moti
on_vectors [s]), reference field selection information used for prediction (motion_vertical_field_select [r] [s]), motion vectors (r, s) (motion_vectors [s] [r]), basic difference motion vectors (motion_code [r]) [s] [t]), residual vector (motion_residual [r] [s] [t]) dual-prime difference vector (dmvector [t]), CBP (coded_block_pat
tern) and block data (block (i)).

【０１４８】現マクロブロックと前マクロブロックアド
レスの差分量（macroblock_address_increment）には、
現在のマクロブロックの位置（macroblock_address）
が、直前のマクロブロックの位置（previous_macrobloc
k_address）との差分として設定される。この現マクロ
ブロックと前マクロブロックアドレスの差分量（macrob
lock_address_increment）が１より大きい場合には、現
在のマクロブロックと直前の符号化マクロブロックとの
間のマクロブロックはスキップマクロブロックとなり、
画像圧縮情報（ビットストリーム）中に動きベクトルや
離散コサイン変換係数などの符号化情報が存在しない。
スキップマクロブロックとなる条件は、Ｐピクチャでは
動きベクトル値が水平、垂直成分ともに０且つ離散コサ
イン変換係数が全て０であることであり、Ｂピクチャで
は一つ前のマクロブロックと動き補償予測モード（前方
向、後方向、双方向）及び動きベクトルが同じで且つ離
散コサイン変換係数が全て０であることである。ＭＰＥ
Ｇ２では、スキップマクロブロックの導入により、不要
なマクロブロックの情報の削減を図っている。なお、Ｉ
ピクチャではスキップマクロブロックは存在せず、ま
た、Ｐ，Ｂピクチャにおいても、スライスの最初と最後
のマクロブロックはスキップマクロブロックとはならず
に必ず符号化される。The difference between the current macroblock and the previous macroblock address (macroblock_address_increment) includes:
Current macroblock location (macroblock_address)
Is the position of the previous macroblock (previous_macrobloc
k_address). The difference between the current macroblock and the previous macroblock address (macrob
If lock_address_increment is greater than 1, the macroblock between the current macroblock and the immediately preceding coded macroblock is a skip macroblock,
Encoding information such as a motion vector and a discrete cosine transform coefficient does not exist in the image compression information (bit stream).
The condition for a skipped macroblock is that the motion vector value of the P picture is 0 for both the horizontal and vertical components and the discrete cosine transform coefficients are all 0. For the B picture, the previous macroblock and the motion compensation prediction mode ( (Forward, backward, bidirectional) and the motion vector are the same, and the discrete cosine transform coefficients are all zero. MPE
In G2, unnecessary macroblock information is reduced by introducing skipped macroblocks. Note that I
There are no skip macroblocks in pictures, and even in P and B pictures, the first and last macroblocks of a slice are always encoded without being skipped macroblocks.

【０１４９】マクロブロック符号化タイプ情報（macrob
lock_type）には、動き補償予測モード、及び、離散コ
サイン変換係数符号化モードが決定される。ピクチャタ
イプによって可能なモードは異なるが、基本的には、動
きベクトルが０かどうかを示すＭＣ／Ｎｏ＿ＭＣ選択モ
ード（Ｐピクチャの場合）、動き補償予測の方向（Ｂピ
クチャの場合で、前方向、後方向若しくは双方向）、Ｃ
ｏｄｅｄ／Ｎｏｔ＿Ｃｏｄｅｄ（離散コサイン変換係数
の有無を示す）、フレーム内符号化及び量子化スケール
コードの更新が設定される。The macroblock coding type information (macrob
In lock_type), a motion compensation prediction mode and a discrete cosine transform coefficient encoding mode are determined. Although possible modes differ depending on the picture type, basically, an MC / No_MC selection mode indicating whether a motion vector is 0 (in the case of a P picture), a direction of motion compensation prediction (in the case of a B picture, a forward direction, Backward or bidirectional), C
oded / Not_Coded (indicating presence / absence of discrete cosine transform coefficients), intra-frame encoding, and updating of the quantization scale code are set.

【０１５０】フレーム構造の動き補償タイプ情報（fram
e_motion_type）及びフィールド構造の動き補償タイプ
情報（field_motion_type）には、フィールド単位の動
き補償を行うか、フレーム単位の動き補償を行うかが設
定される。以下、前者をフィールド動き補償モード、後
者をフレーム動き補償モードと呼ぶ。The frame type motion compensation type information (fram
In the “e_motion_type” and the motion compensation type information of the field structure (field_motion_type), it is set whether to perform motion compensation on a field basis or on a frame basis. Hereinafter, the former is called a field motion compensation mode, and the latter is called a frame motion compensation mode.

【０１５１】動きベクトル（ｓ）（motion_vectors
[s]）では、動きベクトルが設定される。アップサンプ
ル用時空間重み付けコード（spatial_temporal_weight_
code）では、下位レイヤをアップサンプルして上位レイ
ヤを予測するための時空間重み付け係数が設定される。
マクロブロック量子化スケールコード（quantiser_scal
e_code）では、マクロブロック毎の量子化スケールのサ
イズが設定されている。なお、量子化スケールの制御に
関しては、先述のように、符号量制御装置７によって行
われる。The motion vectors (s) (motion_vectors)
In [s]), a motion vector is set. Spatio-temporal weighting code for upsampling (spatial_temporal_weight_
In code), a spatio-temporal weighting coefficient for up-sampling the lower layer and predicting the upper layer is set.
Macroblock quantization scale code (quantiser_scal
In e_code), the size of the quantization scale for each macroblock is set. Note that the quantization scale is controlled by the code amount control device 7 as described above.

【０１５２】続いて、ブロック層は、図１４に示すよう
に、マクロブロック層の下の階層に位置する層である。
ブロック層には、量子化された離散コサイン変換係数か
ら構成されたブロックデータ（block(i)）が定義され
る。ブロックデータ（block(i)）は、輝度信号若しくは
色差信号の８ライン×８画素から構成され、離散コサイ
ン変換及び逆離散コサイン変換はこの単位で行われる。Subsequently, as shown in FIG. 14, the block layer is a layer located at a lower layer than the macro block layer.
Block data (block (i)) composed of quantized discrete cosine transform coefficients is defined in the block layer. The block data (block (i)) is composed of 8 lines × 8 pixels of a luminance signal or a color difference signal, and discrete cosine transform and inverse discrete cosine transform are performed in this unit.

【０１５３】ブロックデータには、図２７に示すよう
に、ＤＣＴ輝度直流係数差分サイズ（dct_dc_size_lumi
nance）、ＤＣＴ輝度直流係数差分値（dct_dc_differen
cial）、ＤＣＴ色差直流係数差分サイズ（dct_dc_size_
chrominance）、ＤＣＴ色差直流係数差分値（dct_dc_di
fferencial）、非イントラブロックの第一非零係数（Fi
rst_DCT_coefficients）、後続のＤＣＴ係数（Subsquen
ce_DCT_coefficients）、ブロック内のＤＣＴ係数終了
フラグ（End_of_Block）が含まれている。As shown in FIG. 27, DCT luminance DC coefficient difference size (dct_dc_size_lumi)
nance), DCT luminance DC coefficient difference value (dct_dc_differen
cial), DCT color difference DC coefficient difference size (dct_dc_size_
chrominance), DCT color difference DC coefficient difference value (dct_dc_di
fferencial), the first nonzero coefficient of the non-intra block (Fi
rst_DCT_coefficients) and subsequent DCT coefficients (Subsquen
ce_DCT_coefficients), and a DCT coefficient end flag (End_of_Block) in the block.

【０１５４】ＤＣＴ輝度直流係数差分サイズ（dct_dc_s
ize_luminance）には、輝度成分のイントラマクロブロ
ック内の直流成分係数と、隣のブロックとの差分の大き
さが与えられる。ＤＣＴ輝度直流係数差分値（dct_dc_d
ifferencial）には、輝度成分のイントラマクロブロッ
クの直流成分の係数の差分情報が与えられる。ＤＣＴ色
差直流係数差分サイズ（dct_dc_size_chrominance）、
色差成分のイントラマクロブロック内の直流成分係数
と、隣のブロックとの差分の大きさが与えられる。ＤＣ
Ｔ色差直流係数差分値（dct_dc_differencial）では、
色差成分のイントラマクロブロックの直流成分の係数の
差分情報が与えられる。非イントラブロックの第一非零
係数（First_DCT_coefficients）では、インターマクロ
ブロックの非０量子化離散コサイン変換係数までの０係
数の長さ（以下、ラン長と呼ぶ。）が与えられる。後続
のＤＣＴ係数（Subsquence_DCT_coefficients）では、
インターマクロブロックの非０係数の大きさが与えられ
る。ブロック内のＤＣＴ係数終了フラグ（End_of_Bloc
k）では、各ブロックでの離散コサイン変換係数が終了
情報が与えられる。DCT luminance DC coefficient difference size (dct_dc_s
ize_luminance) is given the magnitude of the difference between the DC component coefficient in the intra macroblock of the luminance component and the adjacent block. DCT luminance DC coefficient difference value (dct_dc_d
Ifferencial), difference information of the coefficient of the DC component of the intra macroblock of the luminance component is given. DCT color difference DC coefficient difference size (dct_dc_size_chrominance),
The magnitude of the difference between the DC component coefficient in the intra macroblock of the color difference component and the adjacent block is given. DC
In the T color difference DC coefficient difference value (dct_dc_differencial),
The difference information of the coefficient of the DC component of the intra macroblock of the color difference component is provided. In the first non-zero coefficient (First_DCT_coefficients) of the non-intra block, the length of a zero coefficient up to the non-zero quantized discrete cosine transform coefficient of the inter macroblock (hereinafter, referred to as a run length) is given. Subsequent DCT coefficients (Subsquence_DCT_coefficients)
The magnitude of the non-zero coefficient of the inter macroblock is given. DCT coefficient end flag in block (End_of_Bloc
In k), the end information is given to the discrete cosine transform coefficient in each block.

【０１５５】以上のように構成されるＭＰＥＧ２画像圧
縮情報（ビットストリーム）は、ＭＰＥＧ−２画像情報
符号化装置により符号化され復号装置等に送出される。The MPEG2 image compression information (bit stream) configured as described above is encoded by the MPEG-2 image information encoding device and transmitted to a decoding device or the like.

【０１５６】（本発明の第１の実施の形態の画像情報変
換装置）つぎに、本発明を適用した第１の実施の形態の
画像情報変換装置について説明する。(Image Information Converter of First Embodiment of the Present Invention) Next, an image information converter of the first embodiment to which the present invention is applied will be described.

【０１５７】図２８に本発明の第１の実施の形態の画像
情報変換装置のブロック図を示す。この画像情報変換装
置は、ＭＰＥＧ−２方式で符号化された画像圧縮情報
（ビットストリーム）の符号量（ビットレート）を削減
して、低ビットレートの画像圧縮情報を出力する装置で
ある。FIG. 28 is a block diagram showing an image information conversion apparatus according to the first embodiment of the present invention. This image information conversion apparatus is an apparatus that outputs a low bit rate image compression information by reducing the code amount (bit rate) of the image compression information (bit stream) encoded by the MPEG-2 system.

【０１５８】図２８に示す画像情報変換装置２０は、符
号バッファ２３と、圧縮情報解析装置２４と、可変長復
号化装置２５と、逆量子化装置２６と、帯域制限装置２
７と、量子化装置２８と、可変長符号化装置２９と、符
号バッファ３０と、符号量制御装置３１とから構成され
る。An image information conversion device 20 shown in FIG. 28 includes a code buffer 23, a compression information analysis device 24, a variable length decoding device 25, an inverse quantization device 26, and a band limiting device 2.
7, a quantization device 28, a variable length coding device 29, a code buffer 30, and a code amount control device 31.

【０１５９】符号バッファ２３には、多くの符号量（高
いビットレート）の画像圧縮情報（ビットストリーム）
が入力される。画像圧縮情報（ビットストリーム）は、
上述したようにＶＢＶバッファの拘束条件を満たすよう
符号化されているので、符号バッファ２３でオーバーフ
ロー、アンダーフローを起こすことはない。符号バッフ
ァ２３に格納された画像圧縮情報（ビットストリーム）
は、圧縮情報解析装置２４に送られる。The code buffer 23 stores image compression information (bit stream) having a large code amount (high bit rate).
Is entered. Image compression information (bit stream)
As described above, since the encoding is performed so as to satisfy the constraint condition of the VBV buffer, no overflow or underflow occurs in the code buffer 23. Image compression information (bit stream) stored in the code buffer 23
Is sent to the compression information analyzer 24.

【０１６０】圧縮情報解析装置２４は、図１５〜図２７
に示したような、ＭＰＥＧ−２の構文（シンタクス）に
従って画像圧縮情報（ビットストリーム）中から情報を
抽出して、抽出した情報を可変長復号化装置２５及び以
後の各装置に送る。[0160] The compression information analysis device 24 has a
The information is extracted from the compressed image information (bit stream) according to the MPEG-2 syntax (syntax) as shown in (1), and the extracted information is sent to the variable length decoding device 25 and each of the subsequent devices.

【０１６１】可変長復号化装置２５は、まず、イントラ
マクロブロックの直流成分に対しては隣のブロックとの
差分値として符号化されているデータを可変長復号し、
その他の係数に対してはランとレベルにより符号化され
たデータを可変長復号し、量子化された一次元の離散コ
サイン変換係数を検出する。次に、可変長復号化装置２
５は、圧縮情報解析装置２４により抽出された走査方式
（ジグザグスキャン若しくはオルタネートスキャン）に
関する情報に基づき、一次元配列された離散コサイン変
換係数を逆スキャンして、量子化された二次元の離散コ
サイン変換係数に再配列する。二次元配列された量子化
された離散コサイン変換係数は、逆量子化装置２６に送
られる。The variable-length decoding device 25 first performs variable-length decoding on the DC component of the intra macroblock, which is encoded as a difference value from the adjacent block.
For other coefficients, the data encoded by the run and the level are subjected to variable length decoding, and quantized one-dimensional discrete cosine transform coefficients are detected. Next, the variable length decoding device 2
Reference numeral 5 denotes an inverse scan of the discrete cosine transform coefficients arranged one-dimensionally on the basis of information on the scanning method (zigzag scan or alternate scan) extracted by the compression information analysis device 24, thereby obtaining a quantized two-dimensional discrete cosine. Rearrange them into transform coefficients. The two-dimensionally arranged quantized discrete cosine transform coefficients are sent to the inverse quantization device 26.

【０１６２】逆量子化装置２６は、圧縮情報解析装置２
４により抽出された量子化幅及び量子化行列に関する情
報に基づき、量子化されている離散コサイン変換係数を
逆量子化する。逆量子化された離散コサイン変換係数
は、帯域制限装置２７に送られる。The inverse quantization device 26 is a compression information analysis device 2
4, the quantized discrete cosine transform coefficients are inversely quantized based on the information on the quantization width and the quantization matrix extracted in step 4. The inversely quantized discrete cosine transform coefficients are sent to the band limiting device 27.

【０１６３】帯域制限装置は、逆量子化装置２６から送
られた離散コサイン変換係数に対して、ＤＣＴブロック
毎に、水平方向高周波成分係数の帯域制限をする。The band limiting device limits the band of the high frequency component coefficient in the horizontal direction for each DCT block with respect to the discrete cosine transform coefficient sent from the inverse quantization device.

【０１６４】図２９に、帯域制限装置２７における水平
方向高周波成分の帯域制限処理の一例を示す。例えば、
帯域制限装置２７は、輝度信号に関しては、図２９
（Ａ）に示すように８×８の離散コサイン変換係数のう
ち、水平方向低域成分である６×８係数のみの値を保存
し、残りを０と置きかえる。また、帯域制限装置２７
は、色差信号に関しては、図２９（Ｂ）に示すように、
８×８の離散コサイン変換係数のうち、水平方向低域成
分である４×８係数のみの値を保存し、残りを０と置き
かえる。FIG. 29 shows an example of the band limiting process of the high frequency component in the horizontal direction in the band limiting device 27. For example,
The band limiting device 27 controls the luminance signal as shown in FIG.
As shown in (A), of the 8 × 8 discrete cosine transform coefficients, only the values of the 6 × 8 coefficients, which are the low-frequency components in the horizontal direction, are stored, and the rest are replaced with 0. Also, the band limiting device 27
With respect to the color difference signal, as shown in FIG.
Of the 8 × 8 discrete cosine transform coefficients, only the values of 4 × 8 coefficients, which are low-frequency components in the horizontal direction, are stored, and the rest are replaced with 0.

【０１６５】このように離散コサイン変換係数の高周波
成分を帯域制限することで、周波数領域において符号量
（ビットレート）の削減をすることができる。By limiting the band of the high-frequency component of the discrete cosine transform coefficient, the amount of code (bit rate) can be reduced in the frequency domain.

【０１６６】また、入力となる画像圧縮情報（ビットス
トリーム）が、飛び越し走査画像のものである場合に
は、フィールド間の時間差に関する情報を、離散コサイ
ン変換係数の垂直方向高域成分が含むことになる。その
ため、垂直方向の離散コサイン変換係数の帯域制限を行
うことは大幅な画質劣化に繋がる。従って、この帯域制
限装置２７では、垂直方向の帯域制限は行わない。If the input image compression information (bit stream) is that of an interlaced image, the information on the time difference between fields is included in the vertical high-frequency component of the discrete cosine transform coefficient. Become. Therefore, limiting the band of the discrete cosine transform coefficient in the vertical direction leads to a significant deterioration in image quality. Therefore, the band limiting device 27 does not perform band limiting in the vertical direction.

【０１６７】また、この帯域制限装置２７では、劣化が
より人間の目に付きやすい輝度信号に比べ、より人間の
目に付きにくい色差信号に対して、より大きく帯域制限
を行っている。このことにより、この帯域制限装置２７
では、画質劣化を最小限に抑えながら、再量子化の歪み
を低減することができる。なお、削減する符号量（ビッ
トレート）が少ない場合や回路的な制限がある場合等
は、輝度信号と色差信号との帯域制限を同一にしてもよ
い。The band limiting device 27 limits the band of a color difference signal which is less likely to be seen by humans than a luminance signal which is more easily seen by humans. As a result, this band limiting device 27
Thus, distortion of requantization can be reduced while minimizing image quality deterioration. When the amount of code (bit rate) to be reduced is small or when there is a circuit limitation, the band limitation of the luminance signal and the color difference signal may be the same.

【０１６８】また、帯域制限装置２７における水平方向
の離散コサイン変換係数の帯域制限処理は、この図２９
に示したような係数を０と置く処理に限らない。例え
ば、０と置き換える代わりに、予め用意した重み係数を
離散コサイン変換の水平方向高域成分に乗じることで同
様に符号量（ビットレート）を削減することが可能であ
る。The band limiting process of the discrete cosine transform coefficient in the horizontal direction in the band limiting device 27 is the same as that shown in FIG.
Is not limited to the process of setting the coefficient to 0 as shown in FIG. For example, instead of replacing with 0, it is possible to similarly reduce the code amount (bit rate) by multiplying a previously prepared weight coefficient by the horizontal high-frequency component of the discrete cosine transform.

【０１６９】帯域制限装置２７により帯域制限がされた
離散コサイン変換係数は、量子化装置２８に送られる。The discrete cosine transform coefficient band-limited by the band limiting device 27 is sent to the quantization device 28.

【０１７０】量子化装置２８は、帯域制限装置２７から
送られた８×８離散コサイン変換係数を量子化する。そ
の際用いられる量子化幅は、以下に説明するように符号
量制御装置３１により制御される。また、量子化の際に
用いる量子化行列は、入力された高い符号量（高ビット
レート）の画像圧縮情報（ビットストリーム）で用いら
れている量子化行列をそのまま用いても良いが、再量子
化に適した行列を新たに設定して用いてもよい。The quantizing device quantizes the 8 × 8 discrete cosine transform coefficients sent from the band limiting device 27. The quantization width used at that time is controlled by the code amount control device 31 as described below. As the quantization matrix used for quantization, the quantization matrix used in the input image compression information (bit stream) having a high code amount (high bit rate) may be used as it is. A new matrix suitable for conversion may be used.

【０１７１】以下符号化制御装置３１における量子化幅
の制御方法について説明する。A method of controlling the quantization width in the coding control device 31 will be described below.

【０１７２】図１０において説明したように、ＭＰＥＧ
−２画像情報符号化装置において適用されるＭＰＥＧ−
２ＴｅｓｔＭｏｄｅｌ５（ＩＳＯ／ＩＥＣＪＴ
Ｃ１／ＳＣ２９／ＷＧ１１Ｎ０４００）で用いられて
いる手法では、まず、ＧＯＰを構成するピクチャ（Ｉピ
クチャ，Ｐピクチャ，Ｂピクチャ）に関する情報に基づ
きピクチャ毎の目標符号量（ターゲットビットレート）
が算出され、符号化処理が行われる。しかしながら、本
実施の形態の画像情報変換装置２０では、画像圧縮情報
（ビットストリーム）が入力されるので、ヘッダ情報か
らＧＯＰ情報を読み取ることができなず、ＭＰＥＧ−２
ＴｅｓｔＭｏｄｅｌ５で用いられている手法をそ
のまま適用することができない。As described with reference to FIG.
-2 MPEG applied in image information coding apparatus
2 Test Model 5 (ISO / IEC JT
In the method used in C1 / SC29 / WG11 N0400), first, a target code amount (target bit rate) for each picture based on information about pictures (I picture, P picture, B picture) constituting a GOP.
Is calculated, and an encoding process is performed. However, in the image information conversion device 20 of the present embodiment, since image compression information (bit stream) is input, GOP information cannot be read from header information, and MPEG-2
The technique used in Test Model 5 cannot be applied as it is.

【０１７３】そのため、画像情報変換装置２０では、図
３０に示すように、まず、圧縮情報解析装置２４におい
て、１ピクチャに割り当てられた符号量を予め解析する
（ステップＳ１１）。そして、解析した１ピクチャに割
り当てられた符号量の情報を、符号量制御装置３１に与
える。符号量制御装置３１は、与えられたこの符号量を
Ｂ₁とし、入力ビットストリームのビットレートをＲ₁と
し、出力ビットストリームのビットレートをＲ₂とし
て、当該ピクチャの目標符号量（ターゲットビットレー
ト）Ｔを、以下の式（２６）によって算出して、目標符
号量を設定する（ステップＳ１２）。Therefore, in the image information conversion device 20, as shown in FIG. 30, first, the compression information analysis device 24 analyzes the code amount assigned to one picture in advance (step S11). Then, information on the code amount assigned to the analyzed one picture is provided to the code amount control device 31. The code amount control device 31 sets the given code amount to B ₁ , the bit rate of the input bit stream to R ₁ , the bit rate of the output bit stream to R ₂ , and sets the target code amount (target bit rate) of the picture. ) T is calculated by the following equation (26), and a target code amount is set (step S12).

【０１７４】[0174]

【数２６】 (Equation 26)

【０１７５】なお、Ｒ₁、Ｒ₂は、図１５に示したシーケ
ンスヘッダ内の情報を解析することにより抽出すること
ができる。Note that R ₁ and R ₂ can be extracted by analyzing the information in the sequence header shown in FIG.

【０１７６】そして、符号量制御装置３１は、以下、図
１０に示したステップＳ１、ステップＳ２、ステップＳ
３と同一の処理を行う。Then, the code amount control device 31 performs the following steps S1, S2, S2 shown in FIG.
The same processing as in step 3 is performed.

【０１７７】このように画像情報変換装置２０では、１
ピクチャに割り当てられた符号量を予め解析し、この解
析した符号量に入出力される画像圧縮情報のビットレー
トの比率を乗じることで、ＧＯＰに関する情報が無くて
も、ＭＰＥＧ−２ＴｅｓｔＭｏｄｅｌ５で定められ
た方式での符号量の制御を行うことができる。As described above, in the image information conversion apparatus 20, 1
The code amount assigned to the picture is analyzed in advance, and the analyzed code amount is multiplied by the bit rate ratio of the input / output image compression information, so that even if there is no information on the GOP, the code amount is determined by MPEG-2 Test Model 5. It is possible to control the code amount in the specified method.

【０１７８】また、符号量制御装置３１は、符号バッフ
ァ３０において、先述のＶＢＶバッファの拘束条件が満
たされるよう、各マクロブロックに対して量子化幅の割
当も行う。The code amount control device 31 also allocates a quantization width to each macro block in the code buffer 30 so that the above-described constraint condition of the VBV buffer is satisfied.

【０１７９】以上のように再量子化された離散コサイン
変換係数は、量子化装置２９から可変長符号化装置２９
に送られる。The discrete cosine transform coefficients requantized as described above are supplied from the quantizer 29 to the variable-length encoder 29.
Sent to

【０１８０】可変長復号化装置２９は、量子化した離散
コサイン変換係数を、平均符号長が短くなるように符号
化する。その際、可変長復号化装置５は、離散コサイン
変換係数の直流成分に関しては、１ブロック前の直流成
分係数を予測値としてその差分を符号化し、その他の成
分に関しては、予め設定された走査方式（ジグザグスキ
ャン若しくはオルタネートスキャン）に基づいて１次元
の配列データに並べ替えた後、連続する０係数の数（ラ
ン）及び非０係数（レベル）のペアを事象とした可変長
符号化を行う。そして、量子化装置４は、ＤＣＴブロッ
ク内のスキャンを行っている際に、それ以降の係数の値
が全て０となった場合、ＥＯＢ（End OfBlock）と呼ば
れる符号を出力し、そのブロックに対する可変長符号化
を終了する。The variable length decoding device 29 encodes the quantized discrete cosine transform coefficients so that the average code length becomes short. At this time, the variable-length decoding device 5 encodes the difference with the DC component coefficient of the previous block as a prediction value for the DC component of the discrete cosine transform coefficient, and encodes the difference for the other components using a preset scanning method. After rearrangement into one-dimensional array data based on (zigzag scan or alternate scan), variable-length coding is performed using a pair of the number of consecutive 0 coefficients (runs) and the non-zero coefficient (level) as an event. Then, the quantization device 4 outputs a code called EOB (End Of Block) when scanning all the coefficients after that during the scan in the DCT block, and outputs a code for the block. End long coding.

【０１８１】なお、可変長符号化装置２９は、入力され
た高い符号量（高ビットレート）の画像圧縮情報のスキ
ャン方式に関わらず、オルタネートスキャン方式により
離散コサイン変換係数を１次元データに配列してもよ
い。オルタネートスキャン方式により離散コサイン変換
係数を１次元データに配列するのは、以下の理由によ
る。Note that the variable-length coding device 29 arranges discrete cosine transform coefficients into one-dimensional data by an alternate scan method regardless of the scan method of the inputted high-code-rate (high bit rate) image compression information. You may. The reason why the discrete cosine transform coefficients are arranged in one-dimensional data by the alternate scan method is as follows.

【０１８２】すなわち、入力となる画像圧縮情報（ビッ
トストリーム）のあるブロックの離散コサイン変換係数
が、例えば、図３１（Ａ）に示すようになっていたとす
る。図３１において、●で示す係数は非０係数であり、
○で示すは０係数である。このような離散コサイン変換
係数に対して離散コサイン変換係数の水平高周波成分を
０としたとすると、非０係数の分布は例えば図３１
（Ｂ）に示すようになる。この図３１（Ｂ）に示す水平
高周波成分を０とした離散コサイン変換係数を、ジグザ
グスキャンで再符号化すると、最後の非０係数のスキャ
ン番号は５０となる（図６（Ａ）参照）。それに対し、
走査変換を行ってオルタネートスキャンで改めて符号化
すると、最後の非０係数のスキャン番号は４４になる
（図６（Ｂ）参照）。このことから、水平高周波成分を
０とした離散コサイン変換係数に対して可変長符号化す
る場合には、オルタネートスキャン方式によりスキャン
をすれば、ジグザグスキャンの場合より早いスキャン番
号でＥＯＢ信号を設定することが可能となる。そのた
め、量子化幅としてより細かな値を割り当てることがで
き、再量子化に伴う量子化歪みを低減することができ
る。That is, it is assumed that the discrete cosine transform coefficient of a certain block of the input image compression information (bit stream) is as shown in FIG. 31A, for example. In FIG. 31, coefficients indicated by ● are non-zero coefficients,
The circles indicate the 0 coefficient. Assuming that the horizontal high frequency component of the discrete cosine transform coefficient is 0 for such a discrete cosine transform coefficient, the distribution of the non-zero coefficient is, for example, as shown in FIG.
The result is as shown in FIG. When the discrete cosine transform coefficient in which the horizontal high frequency component shown in FIG. 31B is set to 0 is re-encoded by zigzag scan, the scan number of the last non-zero coefficient becomes 50 (see FIG. 6A). For it,
When the scan conversion is performed and the encoding is performed again by the alternate scan, the scan number of the last non-zero coefficient becomes 44 (see FIG. 6B). From this, when performing variable length coding on the discrete cosine transform coefficient with the horizontal high frequency component set to 0, if the scan is performed by the alternate scan method, the EOB signal is set with a scan number earlier than that in the zigzag scan. It becomes possible. Therefore, a finer value can be assigned as the quantization width, and quantization distortion accompanying re-quantization can be reduced.

【０１８３】そして、可変長符号化装置５により可変長
符号化された離散コサイン変換係数は符号バッファ６に
送られ、この符号バッファ６に一時格納されたのち、Ｍ
ＰＥＧ−２に規定されたビットストリーム構造とされ
て、圧縮画像情報として出力される。The discrete cosine transform coefficients subjected to variable length coding by the variable length coding device 5 are sent to the code buffer 6 and temporarily stored in the code buffer 6.
A bit stream structure defined in PEG-2 is output as compressed image information.

【０１８４】以上のように、本発明の第１の実施の形態
の画像情報変換装置２０では、周波数領域で各ブロック
のデータの受け渡しを行って符号量（ビットレート）を
削減することができるので、ベースバンドのビデオデー
タまで復号した後符号化する従来の画像情報変換装置に
比べて、演算量が少なくなり、また、回路構成を大幅に
削減することができる。As described above, in the image information conversion apparatus 20 according to the first embodiment of the present invention, the code amount (bit rate) can be reduced by transferring the data of each block in the frequency domain. As compared with a conventional image information conversion apparatus that decodes and encodes baseband video data, the amount of calculation is reduced, and the circuit configuration can be significantly reduced.

【０１８５】なお、上記画像情報変換装置２０では、逆
量子化装置２６と量子化装置２８との間に帯域制限装置
２７を設けているが、符号量（ビットレート）の削減量
によってはこの帯域制限装置２７を設けなくても良い。In the image information converter 20, a band limiting device 27 is provided between the inverse quantization device 26 and the quantization device 28. However, depending on the amount of code (bit rate) reduction, this band The restriction device 27 need not be provided.

【０１８６】（本発明の第２の実施の形態の画像情報変
換装置）つぎに、本発明を適用した第２の実施の形態の
画像情報変換装置について説明する。(Image Information Conversion Apparatus According to Second Embodiment of the Present Invention) Next, an image information conversion apparatus according to a second embodiment of the present invention will be described.

【０１８７】図３２に本発明の第２の実施の形態の画像
情報変換装置のブロック図を示す。なお、この第２の実
施の形態の画像情報変換装置を説明するのにあたり、上
記第１の実施の形態の画像情報変換装置２０と同一の構
成要素には図面中に同一の符号を付け、その詳細な説明
を省略する。FIG. 32 is a block diagram showing an image information conversion apparatus according to the second embodiment of the present invention. In describing the image information conversion device of the second embodiment, the same components as those of the image information conversion device 20 of the first embodiment are denoted by the same reference numerals in the drawings. Detailed description is omitted.

【０１８８】図３２に示す画像情報変換装置４０は、符
号バッファ２３と、圧縮情報解析装置２４と、可変長復
号化装置２５と、逆量子化装置２６と、加算器４１と、
帯域制限装置２７と、量子化装置２８と、可変長符号化
装置２９と、符号バッファ３０と、符号量制御装置３１
と、動き補償誤差補正装置４２とから構成される。The image information converter 40 shown in FIG. 32 comprises a code buffer 23, a compression information analyzer 24, a variable length decoder 25, an inverse quantizer 26, an adder 41,
Band limiting device 27, quantizing device 28, variable length coding device 29, code buffer 30, code amount control device 31
And a motion compensation error correction device 42.

【０１８９】加算器４１は、逆量子化装置２６と帯域制
限装置２７との間に設けられる。加算器４１は、逆量子
化装置２６が逆量子化して得られた離散コサイン変換係
数から、動き補償誤差補正装置４２により生成された動
き補償誤差補正係数を減算する。The adder 41 is provided between the inverse quantization device 26 and the band limiting device 27. The adder 41 subtracts the motion compensation error correction coefficient generated by the motion compensation error correction device 42 from the discrete cosine transform coefficient obtained by the inverse quantization performed by the inverse quantization device 26.

【０１９０】動き補償誤差補正装置４２は、逆量子化装
置２６により逆量子化した離散コサイン変換係数を、量
子化装置２８により再量子化する際に生じる動き補償誤
差を補正する動き補償誤差補正係数を生成する。A motion compensation error correction device 42 corrects a motion compensation error generated when the quantized device 28 requantizes the discrete cosine transform coefficient inversely quantized by the inverse quantizer 26. Generate

【０１９１】まず、動き補償誤差が生じる原因について
述べる。First, the cause of the motion compensation error will be described.

【０１９２】原画像の画素値をＯとし、入力された高い
符号量（高ビットレート）の画像圧縮情報（ビットスト
リーム）のこの原画像の画素値Ｏに対する量子化幅をＱ
₁とし、再符号化後の低い符号量（低ビットレート）の
画像圧縮情報（ビットストリーム）のこの原画像の画素
値Ｏに対する量子化幅をＱ₂とする。また、量子化幅
Ｑ₁、量子化幅Ｑ₂で復号された参照画像の画素値を、そ
れぞれＬ（Ｑ₁），Ｌ（Ｑ₂）とする。Let the pixel value of the original image be O, and let Q be the quantization width of the input image compression information (bit stream) of a high code amount (high bit rate) with respect to the pixel value O of this original image.
_1, and the quantization width with respect to the pixel value O in the original image of the image compression information of a low code amount after re-encoding (low bit rate) (bit stream) is referred to as Q _2. Also, let the pixel values of the reference image decoded with the quantization width Q ₁ and the quantization width Q ₂ be L (Q ₁ ) and L (Q ₂ ), respectively.

【０１９３】インターマクロブロックの画素は、符号化
時において、例えば図１に示したＭＰＥＧ−２画像情報
符号化装置の加算器２により差分値“Ｏ−Ｌ（Ｑ₁）”
が計算され、この差分値“Ｏ−Ｌ（Ｑ₁）”に離散コサ
イン変換が施される。このように符号化されたインター
マクロブロックの画素は、復号時においては、差分値
“Ｏ−Ｌ（Ｑ₁）”に逆離散コサイン変換が施され、こ
の差分値“Ｏ−Ｌ（Ｑ₁）”から動き補償により生成さ
れた参照画像“Ｌ（Ｑ₁）”が減算され、原画像の画素
値Ｏが復号される。At the time of encoding, the difference value “OL (Q ₁ )” of the pixel of the inter macroblock is obtained by the adder 2 of the MPEG-2 image information encoding apparatus shown in FIG.
Is calculated, and a discrete cosine transform is applied to the difference value “OL (Q ₁ )”. Pixels of the thus coded inter macroblock, during decoding, inverse discrete cosine transform is applied to the difference value _{"O-L (Q 1)} ", the difference value "O-L (Q ₁₎ Is subtracted from the reference image “L (Q ₁ )” generated by the motion compensation, and the pixel value O of the original image is decoded.

【０１９４】一方、インターマクロブロックの画素は、
図２８に示した画像情報変換装置２０による符号量（ビ
ットレート）を削減時において、逆量子化装置２６及び
量子化装置２８により差分値“Ｏ−Ｌ（Ｑ₁）”の量子
化幅がＱ１からＱ２に変換される。このように符号量を
削減したインターマクロブロックの画素は、復号時にお
いては、差分値“Ｏ−Ｌ（Ｑ₂）”が量子化幅Ｑ₂で符号
化されたものと見なされて復号される。On the other hand, the pixels of the inter macroblock are
When the code amount (bit rate) is reduced by the image information conversion device 20 shown in FIG. 28, the quantization width of the difference value “OL (Q ₁ )” is set to Q1 by the inverse quantization device 26 and the quantization device 28. To Q2. The pixels of the inter macro block whose code amount has been reduced in this way are decoded at the time of decoding, assuming that the difference value “OL (Q ₂ )” is coded with the quantization width Q _2. .

【０１９５】ここで、画像情報変換装置２０において量
子化幅を変えて符号量を削減していることからＱ₁＝Ｑ₂
は成立せず、インターマクロブロックの復号時に量子化
誤差が生じる。従って、インターマクロブロックにより
符号化がされているＰピクチャ、Ｂピクチャに、動き補
償に伴う誤差が発生する。Here, since the amount of code is reduced by changing the quantization width in the image information converter 20, Q ₁ = Q ₂
Does not hold, and a quantization error occurs at the time of decoding the inter macroblock. Therefore, an error accompanying the motion compensation occurs in the P picture and the B picture encoded by the inter macro block.

【０１９６】Ｐピクチャで生じた誤差は、以後このＰピ
クチャを参照画像とするＰピクチャやＢピクチャに伝播
し、さらなる画質劣化に繋がる。The error generated in the P picture propagates to a P picture or a B picture using the P picture as a reference picture, which leads to further deterioration of the picture quality.

【０１９７】この第２の実施の形態の画像情報圧縮装置
４０の動き補償誤差補正装置４２では、動き補償誤差補
正係数を生成し、逆量子化装置２６により逆量子化した
離散コサイン変換係数から減算し、以上の動き補償誤差
を補正している。The motion compensation error correction device 42 of the image information compression device 40 according to the second embodiment generates a motion compensation error correction coefficient and subtracts it from the discrete cosine transform coefficient inversely quantized by the inverse quantization device 26. Then, the above-described motion compensation error is corrected.

【０１９８】続いてこの動き補償誤差補正装置４２につ
いて説明する。Next, the motion compensation error correction device 42 will be described.

【０１９９】動き補償誤差補正装置４２は、逆量子化装
置４３と、加算器４４と、逆離散コサイン変換装置４５
と、ビデオメモリ４６と、動き補償予測装置４７と、離
散コサイン変換装置４８とから構成されている。The motion compensation error correction device 42 includes an inverse quantization device 43, an adder 44, and an inverse discrete cosine transform device 45.
, A video memory 46, a motion compensation prediction device 47, and a discrete cosine transform device 48.

【０２００】逆量子化装置４３は、量子化装置２８によ
り再量子化された離散コサイン変換係数を、上記量子化
装置２８で用いられた量子化行列に基づき逆量子化す
る。逆量子化装置４３により逆量子化された離散コサイ
ン変換係数は、加算器４４に送られる。The inverse quantization device 43 inversely quantizes the discrete cosine transform coefficients requantized by the quantization device 28 based on the quantization matrix used in the quantization device 28. The discrete cosine transform coefficient inversely quantized by the inverse quantization device 43 is sent to the adder 44.

【０２０１】加算器４４は、逆量子化装置４３により逆
量子化された離散コサイン変換係数から、加算器４１に
より動き補償誤差補正係数が減算された離散コサイン変
換係数を減算して、周波数領域の量子化誤差係数を生成
し、逆離散コサイン変換装置４５に送る。The adder 44 subtracts the discrete cosine transform coefficient, from which the motion compensation error correction coefficient has been subtracted by the adder 41, from the discrete cosine transform coefficient inversely quantized by the inverse quantizer 43, to obtain a frequency domain. A quantization error coefficient is generated and sent to the inverse discrete cosine transform unit 45.

【０２０２】逆離散コサイン変換装置４５は、加算器４
４から送られた周波数領域の量子化誤差係数に対して、
逆離散コサイン変換を施す。逆離散コサイン変換を施し
て得らた結果は、空間領域の量子化誤差値として、ビデ
オメモリ４６に格納される。The inverse discrete cosine transform unit 45 includes the adder 4
4 for the quantization error coefficient in the frequency domain sent from
Perform inverse discrete cosine transform. The result obtained by performing the inverse discrete cosine transform is stored in the video memory 46 as a quantization error value in the spatial domain.

【０２０３】動き補償予測装置４７は、入力された高い
符号量（高ビットレート）の画像圧縮情報（ビットスト
リーム）内における動き補償予測モード情報（フィール
ド動き補償予測モード或いはフレーム動き補償予測モー
ド、及び、前方向予測モード、後方向予測モード、或い
は、双方向予測モード）及び、動きベクトル情報に基づ
き、ビデオメモリ４６内の空間領域の量子化誤差値に対
して動き補償を行う。動き補償がされたデータが、空間
領域での動き補償誤差補正値となる。この動き補償誤差
補正値は、離散コサイン変換装置４８に送られる。The motion-compensated prediction device 47 calculates the motion-compensated prediction mode information (field motion-compensated prediction mode or frame motion-compensated prediction mode) in the input image compression information (bit stream) having a high code amount (high bit rate). , Forward prediction mode, backward prediction mode, or bidirectional prediction mode) and motion vector information, and performs motion compensation on the quantization error value in the spatial domain in the video memory 46. The data subjected to the motion compensation becomes a motion compensation error correction value in the spatial domain. This motion compensation error correction value is sent to the discrete cosine transform unit 48.

【０２０４】離散コサイン変換装置４８は、送られた空
間領域での動き補償誤差補正値に対して離散コサイン変
換を施し、周波数領域での動き補償誤差補正係数を生成
する。この動き補償誤差補正係数は、加算器４１に送ら
れる。The discrete cosine transform unit 48 performs a discrete cosine transform on the sent motion compensation error correction value in the spatial domain, and generates a motion compensation error correction coefficient in the frequency domain. The motion compensation error correction coefficient is sent to the adder 41.

【０２０５】そして、この加算器４１において、逆量子
化装置２６により逆量子化された離散コサイン変換係数
から、この動き補償誤差補正係数を減算することによっ
て、動き補償に起因する誤差の補正がされる。In the adder 41, the motion compensation error is corrected by subtracting the motion compensation error correction coefficient from the discrete cosine transform coefficient dequantized by the dequantizer 26. You.

【０２０６】以上のように、本発明の第２の実施の形態
の画像情報変換装置４０では、周波数領域で各ブロック
のデータの受け渡しを行って符号量（ビットレート）を
削減することができるので、ベースバンドのビデオデー
タまで復号した後符号化する従来の画像情報変換装置に
比べて、演算量が少なくなり、また、回路構成を大幅に
削減することができる。これとともに、画像情報変換装
置４０では、動き補償誤差の蓄積に起因する画質劣化を
生じさせずに、符号量を削減することができる。As described above, in the image information conversion device 40 according to the second embodiment of the present invention, the code amount (bit rate) can be reduced by transferring the data of each block in the frequency domain. As compared with a conventional image information conversion apparatus that decodes and encodes baseband video data, the amount of calculation is reduced, and the circuit configuration can be significantly reduced. At the same time, the image information conversion device 40 can reduce the code amount without deteriorating image quality due to accumulation of motion compensation errors.

【０２０７】なお、上記動き補償誤差補正装置４２の逆
離散コサイン変換装置４５及び離散コサイン変換装置４
８では、高速アルゴリズムを適用することが可能であ
る。The inverse discrete cosine transform unit 45 and the discrete cosine transform unit 4 of the motion compensation error correction unit 42
8, a fast algorithm can be applied.

【０２０８】図３３に、文献”Ａｆａｓｔｃｏｍｐ
ｕｔａｔｉｏｎａｌａｌｇｏｒｉｔｈｍｆｏｒｔ
ｈｅｄｉｓｃｒｅｔｅｃｏｓｉｎｅｔｒａｎｓｆ
ｏｒｍ”（ＩＥＥＥＴｒａｎｓ．Ｃｏｍｍｕｎ．，ｖ
ｏｌ．２５，ｎｏ．９ｐｐ．１００４−１００９，１
９７７）に示されている高速アルゴリズムを適用した場
合のフロー図を示す。FIG. 33 shows a document "A fast comp"
national algorithm fort
he discrete cosine transf
orm "(IEEE Trans. Commun., v
ol. 25, no. 9 pp. 1004-109,1
977) is a flowchart when the high-speed algorithm shown in FIG.

【０２０９】この方式は、図３３中の矢印の方向（変数
［ｘ（０）・・・ｘ（７）］から変数［Ｘ（０）・・・
Ｘ（７）］へ向かう方向）に処理することで離散コサイ
ン変換を実行することが可能である。また、図３３中の
矢印と反対の方向（変数［Ｘ（０）・・・Ｘ（７）］か
ら変数［ｘ（０）・・・ｘ（７）］へ向かう方向）に処
理することで逆離散コサイン変換を実行することが可能
である。なお、この図３３で示す“Ｃ_l ⁱ”は、ＣＯＳ
（ｉπ／ｌ）を意味する。In this method, the variables [X (0),..., X (0),.
X (7)], it is possible to execute the discrete cosine transform. In addition, by processing in the direction opposite to the arrow in FIG. 33 (direction from variable [X (0)... X (7)] to variable [x (0)... X (7)]). It is possible to perform an inverse discrete cosine transform. Note that “C _l ⁱ ” shown in FIG.
(Iπ / l).

【０２１０】また、逆離散コサイン変換装置４５及び離
散コサイン変換装置４８では、帯域制限装置３７におい
て水平高域成分の係数が０と置き換えられている場合、
０と置き換えられている係数に対する逆離散コサイン変
換及び離散コサイン変換を省くことで、回路規模及び演
算処理量を削減することが可能である。In the inverse discrete cosine transform unit 45 and the discrete cosine transform unit 48, when the coefficient of the horizontal high frequency component is replaced with 0 in the band limiting unit 37,
By omitting the inverse discrete cosine transform and the discrete cosine transform for the coefficient replaced with 0, it is possible to reduce the circuit scale and the amount of arithmetic processing.

【０２１１】更に、画像における色差信号の劣化は、輝
度信号の劣化に比べ、人間の目には分かり難いという特
色を有しているため、上記の動き補償誤差補正を、輝度
信号のみに適用することで、画質劣化を最小に保ちなが
ら回路規模及び演算処理量を大幅に削減することもでき
る。また、Ｐピクチャにおける誤差はＢピクチャに伝播
するが、Ｂピクチャにおける誤差はそれ以上伝播しな
い。一方、Ｂピクチャには双方向予測モードを含み、多
大なる演算処理量を要する。そこで、Ｐピクチャにのみ
動き補償誤差補正を行うことで、画質劣化を最小に保ち
ながら回路規模及び演算処理量を大幅に削減することも
考えられる。Ｂピクチャにおける処理を行わないこと
で、ビデオメモリ４６の容量を削減することも可能とな
る。Furthermore, since the deterioration of the color difference signal in the image has a characteristic color that is hard to be recognized by human eyes as compared with the deterioration of the luminance signal, the above-described motion compensation error correction is applied only to the luminance signal. Thus, the circuit scale and the amount of arithmetic processing can be significantly reduced while keeping the image quality deterioration to a minimum. Further, the error in the P picture propagates to the B picture, but the error in the B picture does not propagate any more. On the other hand, a B picture includes a bidirectional prediction mode and requires a large amount of calculation processing. Therefore, by performing the motion compensation error correction only on the P picture, it is conceivable to significantly reduce the circuit scale and the amount of arithmetic processing while keeping the image quality deterioration to a minimum. By not performing the processing on the B picture, the capacity of the video memory 46 can be reduced.

【０２１２】また、以上、入力として、ＭＰＥＧ２によ
る画像圧縮情報（ビットストリーム）を対象としてきた
が、直交変換と動き補償によって符号化された画像圧縮
情報（ビットストリーム）では、本装置と同様の構成で
符号量（ビットレート）の削減を行うことが可能であ
る。Although the above description has been directed to image compression information (bit stream) by MPEG2 as input, image compression information (bit stream) encoded by orthogonal transformation and motion compensation has the same configuration as that of the present apparatus. Thus, it is possible to reduce the code amount (bit rate).

【０２１３】（本発明の第３の実施の形態の画像情報変
換装置）つぎに、本発明を適用した第３の実施の形態の
画像情報変換装置について説明する。(Image Information Conversion Apparatus According to Third Embodiment of the Present Invention) Next, an image information conversion apparatus according to a third embodiment of the present invention will be described.

【０２１４】図３２に本発明の第３の実施の形態の画像
情報変換装置のブロック図を示す。なお、この第３の実
施の形態の画像情報変換装置を説明するのにあたり、上
記第１の実施の形態の画像情報変換装置２０及び上記第
２の実施の形態の画像情報変換装置４０と同一の構成要
素には図面中に同一の符号を付け、その詳細な説明を省
略する。FIG. 32 is a block diagram showing an image information conversion apparatus according to the third embodiment of the present invention. In describing the image information conversion apparatus according to the third embodiment, the same image information conversion apparatus 20 as the image information conversion apparatus 20 according to the first embodiment and the image information conversion apparatus 40 according to the second embodiment will be described. The same reference numerals are given to the components in the drawings, and detailed description thereof will be omitted.

【０２１５】図３４に示す画像情報変換装置５０は、符
号バッファ２３と、圧縮情報解析装置２４と、可変長復
号化装置２５と、逆量子化装置２６と、加算器４１と、
帯域制限装置２７と、量子化装置２８と、可変長符号化
装置２９と、符号バッファ３０と、符号量制御装置３１
と、動き補償誤差補正装置５１とから構成される。An image information conversion device 50 shown in FIG. 34 includes a code buffer 23, a compression information analysis device 24, a variable length decoding device 25, an inverse quantization device 26, an adder 41,
Band limiting device 27, quantizing device 28, variable length coding device 29, code buffer 30, code amount control device 31
And a motion compensation error correction device 51.

【０２１６】加算器４１は、逆量子化装置２６と帯域制
限装置２７との間に設けられる。加算器４１は、逆量子
化装置２６が逆量子化して得られた離散コサイン変換係
数から、動き補償誤差補正装置５１により生成された動
き補償誤差補正係数を減算する。The adder 41 is provided between the inverse quantization device 26 and the band limiting device 27. The adder 41 subtracts the motion compensation error correction coefficient generated by the motion compensation error correction device 51 from the discrete cosine transform coefficient obtained by the inverse quantization performed by the inverse quantization device 26.

【０２１７】動き補償誤差補正装置５１は、逆量子化装
置２６により逆量子化した離散コサイン変換係数を、量
子化装置２８により再量子化する際に生じる動き補償誤
差を補正する動き補償誤差補正係数を生成する。そし
て、この動き補償誤差補正装置５１は、この動き補償誤
差補正係数を、逆量子化装置２６により逆量子化した離
散コサイン変換係数から減算し、動き補償誤差を補正し
ている。The motion compensation error correction device 51 corrects a motion compensation error generated when the quantized device 28 requantizes the discrete cosine transform coefficient inversely quantized by the inverse quantizer 26. Generate The motion compensation error correction device 51 corrects the motion compensation error by subtracting the motion compensation error correction coefficient from the discrete cosine transform coefficient inversely quantized by the inverse quantization device 26.

【０２１８】続いてこの動き補償誤差補正装置５１につ
いて説明する。Next, the motion compensation error correction device 51 will be described.

【０２１９】動き補償誤差補正装置５１は、逆量子化装
置４３と、加算器４４と、４×８逆離散コサイン変換装
置５２と、ビデオメモリ５３と、補間装置５４と、動き
補償予測装置５５と、４×８離散コサイン変換装置５６
とから構成されている。The motion compensation error correction device 51 includes an inverse quantization device 43, an adder 44, a 4 × 8 inverse discrete cosine transform device 52, a video memory 53, an interpolation device 54, and a motion compensation prediction device 55. 4 × 8 discrete cosine transform unit 56
It is composed of

【０２２０】逆量子化装置４３は、量子化装置２８によ
り再量子化された離散コサイン変換係数を、上記量子化
装置２８で用いられた量子化行列に基づき逆量子化す
る。逆量子化装置４３により逆量子化された離散コサイ
ン変換係数は、加算器４４に送られる。[0220] The inverse quantization device 43 inversely quantizes the discrete cosine transform coefficients requantized by the quantization device 28 based on the quantization matrix used in the quantization device 28. The discrete cosine transform coefficient inversely quantized by the inverse quantization device 43 is sent to the adder 44.

【０２２１】加算器４４は、逆量子化装置４３により逆
量子化された離散コサイン変換係数から、加算器４１に
より動き補償誤差補正係数が減算された離散コサイン変
換係数を減算し、周波数領域の量子化誤差係数を生成し
て、４×８逆離散コサイン変換装置５２に送る。The adder 44 subtracts the discrete cosine transform coefficient, from which the motion compensation error correction coefficient has been subtracted by the adder 41, from the discrete cosine transform coefficient inversely quantized by the inverse quantizer 43, and performs frequency domain quantization. A generated error coefficient is generated and sent to the 4 × 8 inverse discrete cosine transform device 52.

【０２２２】４×８逆離散コサイン変換装置５２は、加
算器４４から送られた量子化誤差係数に対して、４×８
の逆離散コサイン変換を施し、空間領域の４×８量子化
誤差値を算出する。４×８逆離散コサイン変換装置５２
は、逆離散コサイン変換を施して得らた結果である空間
領域の４×８量子化誤差値を、ビデオメモリ５３に格納
する。The 4 × 8 inverse discrete cosine transform unit 52 converts the quantization error coefficient sent from the adder 44 into a 4 × 8
Is performed, and a 4 × 8 quantization error value in the spatial domain is calculated. 4 × 8 inverse discrete cosine transform device 52
Stores in the video memory 53 the 4 × 8 quantization error value in the spatial domain obtained as a result of performing the inverse discrete cosine transform.

【０２２３】ビデオメモリ５３は、空間領域の４×８量
子化誤差値を格納する。この４×８量子化誤差値は、水
平方向のデータが１／２間引かれているので、入力とな
る画像圧縮情報の解像度に対して１／２の解像度のデー
タが格納できる容量を有していればよい。The video memory 53 stores 4 × 8 quantization error values in the spatial domain. This 4 × 8 quantization error value has a capacity capable of storing data having a resolution of に対して of the resolution of the input image compression information, since the horizontal data is decimated by １／. It should just be.

【０２２４】補間装置５４は、ビデオメモリ５３に格納
されている４×８量子化誤差値の水平方向のデータの補
間処理を行う。この補間装置５４により水平方向の４×
８量子化誤差値の補間を行っておくことにより、後段の
動き補償予測装置５５で水平方向の情報に対して１／４
画素精度の動き予測を行うことができる。補間装置５４
は、例えば線形内挿若しくはハーフバンドフィルタ等の
数タップのデジタルフィルタを用いた内挿を行うことで
補間を行う。なお、この補間装置５４は、動きベクトル
の値に応じて、必要な値のみを補間して生成すればよ
い。[0224] The interpolation device 54 performs an interpolation process on the horizontal data of the 4x8 quantization error value stored in the video memory 53. This interpolator 54 allows the horizontal 4 ×
By interpolating the 8 quantization error values, the motion compensation prediction unit 55 at the subsequent stage
Pixel-precision motion prediction can be performed. Interpolator 54
Performs interpolation by performing linear interpolation or interpolation using a digital filter having several taps such as a half-band filter. It should be noted that the interpolating device 54 may generate only necessary values by interpolating according to the value of the motion vector.

【０２２５】図３５に、線形内挿により、水平方向の１
／４画素精度の補間を行う場合の処理内容を示す。動き
ベクトルが元画素と同位相の画素位置を示している場合
には、図３５（Ａ）に示すように、元画素の値が補間し
た画素の値となる。動きベクトルが元画素と１／４位相
ずれた画素位置を示している場合には、図３５（Ｂ）、
図３５（Ｃ）及び図３５（Ｄ）に示すように、水平方向
に近接する２つの画素値に、その画素との距離の割合を
乗じて、補間した画素の値を求める。FIG. 35 shows that 1 in the horizontal direction is obtained by linear interpolation.
The processing content when performing interpolation with / 4 pixel accuracy is shown. When the motion vector indicates a pixel position having the same phase as the original pixel, the value of the original pixel becomes the value of the interpolated pixel as shown in FIG. When the motion vector indicates a pixel position shifted by 1/4 phase from the original pixel, FIG.
As shown in FIG. 35 (C) and FIG. 35 (D), the value of the interpolated pixel is obtained by multiplying the two pixel values adjacent in the horizontal direction by the ratio of the distance to the pixel.

【０２２６】動き補償予測装置５５は、入力された高い
符号量（高ビットレート）の画像圧縮情報（ビットスト
リーム）内における動き補償予測モード情報（フィール
ド動き補償予測モード或いはフレーム動き補償予測モー
ド、及び、前方向予測モード、後方向予測モード、或い
は、双方向予測モード）及び、動きベクトル情報に基づ
き、ビデオメモリ５３内の空間領域の４×８量子化誤差
値に対して動き補償を行う。このとき、水平方向には補
間装置５４で補間した値を用いて１／４画素精度で動き
補償を行い、垂直方向には１／２画素精度で動き補償を
行う。The motion-compensated prediction device 55 calculates the motion-compensated prediction mode information (field motion-compensated prediction mode or frame motion-compensated prediction mode) in the input compressed image information (bit stream) having a high code amount (high bit rate). , Forward prediction mode, backward prediction mode, or bidirectional prediction mode), and performs motion compensation on a 4 × 8 quantization error value in a spatial region in the video memory 53 based on the motion vector information. At this time, motion compensation is performed in the horizontal direction with 1/4 pixel accuracy using the value interpolated by the interpolation device 54, and motion compensation is performed in the vertical direction with 1/2 pixel accuracy.

【０２２７】動き補償がされたデータが、空間領域での
４×８動き補償誤差補正値となる。この４×８動き補償
誤差補正値は、４×８離散コサイン変換装置５６に送ら
れる。The data subjected to the motion compensation becomes a 4 × 8 motion compensation error correction value in the spatial domain. This 4 × 8 motion compensation error correction value is sent to a 4 × 8 discrete cosine transform device 56.

【０２２８】４×８離散コサイン変換装置５６は、送ら
れた４×８動き補償誤差補正値に対して、４×８の離散
コサイン変換を施し、周波数領域の動き補償誤差補正係
数を生成する。この動き補償誤差補正係数は、加算器４
１に送られる。The 4 × 8 discrete cosine transform unit 56 performs a 4 × 8 discrete cosine transform on the received 4 × 8 motion compensation error correction value, and generates a motion compensation error correction coefficient in the frequency domain. This motion compensation error correction coefficient is calculated by the adder 4
Sent to 1.

【０２２９】そして、この加算器４１において、逆量子
化装置２６により逆量子化された離散コサイン変換係数
から、この動き補償誤差補正係数を減算することによっ
て、動き補償に起因する誤差の補正がされる。In the adder 41, the motion compensation error is corrected by subtracting the motion compensation error correction coefficient from the discrete cosine transform coefficient dequantized by the dequantizer 26. You.

【０２３０】このように画像情報変換装置５０では、８
×８の離散コサイン係数として表される動き補償に起因
する誤差成分のうち、誤差補正に対して大きく寄与する
水平方向の低域成分及び垂直方向の全周波数成分に対し
てのみ演算処理を行い、水平方向の高周波成分に対して
は演算処理を行わない。水平方向の誤差の高周波成分
は、元々誤差の分布の頻度は低く、さらに、帯域制限装
置２７が高周波成分を削減する処理を行っているので、
演算処理を行わなくてもほとんど画質の劣化が生じな
い。もっとも、垂直方向の誤差成分は、例えば入力とな
る画像圧縮情報が飛び越し走査によるものである場合に
は、フィールド間の時間差に関する情報が高域成分に含
まれているので、全周波数成分に対して演算処理を行
う。As described above, in the image information conversion apparatus 50, 8
Of the error components due to motion compensation represented as × 8 discrete cosine coefficients, the arithmetic processing is performed only on the horizontal low-frequency components and all the vertical frequency components that greatly contribute to the error correction, The arithmetic processing is not performed on the high frequency component in the horizontal direction. As for the high frequency component of the error in the horizontal direction, the frequency of distribution of the error is originally low, and the band limiting device 27 performs a process of reducing the high frequency component.
Even if the arithmetic processing is not performed, the image quality hardly deteriorates. However, the error component in the vertical direction is, for example, when the image compression information to be input is based on interlaced scanning, since information on the time difference between fields is included in the high frequency component, Perform arithmetic processing.

【０２３１】以上のように、本発明の第３の実施の形態
の画像情報変換装置５０では、周波数領域で各ブロック
のデータの受け渡しを行って符号量（ビットレート）を
削減することができるので、ベースバンドのビデオデー
タまで復号した後符号化する従来の画像情報変換装置に
比べて、演算量が少なくなり、また、回路構成を大幅に
削減することができる。これとともに、画像情報変換装
置５０では、動き補償誤差の蓄積に起因する画質劣化を
生じさせずに、符号量を削減することができる。また、
動き補償誤差補正係数を求める動き補償誤差補正装置５
１が、水平方向の高周波成分に対しては演算処理を行わ
ないので、さらに演算量を少なくすることができる。As described above, in the image information conversion apparatus 50 according to the third embodiment of the present invention, the code amount (bit rate) can be reduced by transferring the data of each block in the frequency domain. As compared with a conventional image information conversion apparatus that decodes and encodes baseband video data, the amount of calculation is reduced, and the circuit configuration can be significantly reduced. At the same time, the image information conversion device 50 can reduce the code amount without causing image quality deterioration due to accumulation of motion compensation errors. Also,
Motion compensation error correction device 5 for obtaining motion compensation error correction coefficient
1, the arithmetic processing is not performed on the high frequency component in the horizontal direction, so that the amount of operation can be further reduced.

【０２３２】また、４×８逆離散コサイン変換装置５２
及び４×８離散コサイン変換装置５６の垂直方向に対す
る処理としては、通常の８次の逆離散コサイン変換及び
８次の離散コサイン変換を施す。この際、４×８逆離散
コサイン変換装置５２及び４×８離散コサイン変換装置
５６では、高速アルゴリズムを適用することが可能であ
る。The 4 × 8 inverse discrete cosine transform unit 52
As the processing in the vertical direction of the 4 × 8 discrete cosine transform device 56, normal 8th-order inverse discrete cosine transform and 8th-order discrete cosine transform are performed. At this time, the 4 × 8 inverse discrete cosine transform device 52 and the 4 × 8 discrete cosine transform device 56 can apply a high-speed algorithm.

【０２３３】図３６に、文献”Ａｆａｓｔｃｏｍｐ
ｕｔａｔｉｏｎａｌａｌｇｏｒｉｔｈｍｆｏｒｔ
ｈｅｄｉｓｃｒｅｔｅｃｏｓｉｎｅｔｒａｎｓｆ
ｏｒｍ”（ＩＥＥＥＴｒａｎｓ．Ｃｏｍｍｕｎ．，ｖ
ｏｌ．２５，ｎｏ．９ｐｐ．１００４−１００９，１
９７７）に示されている高速アルゴリズムを適用した場
合のフロー図を示す。FIG. 36 shows a document “A fast comp”.
national algorithm fort
he discrete cosine transf
orm "(IEEE Trans. Commun., v
ol. 25, no. 9 pp. 1004-109,1
977) is a flowchart when the high-speed algorithm shown in FIG.

【０２３４】この方式は、図３６中の矢印の方向（変数
［ｘ（０）・・・ｘ（７）］から変数［Ｘ（０）・・・
Ｘ（７）］へ向かう方向）に処理することで離散コサイ
ン変換を実行することが可能である。また、図３６中の
矢印と反対の方向（変数［Ｘ（０）・・・Ｘ（７）］か
ら変数［ｘ（０）・・・ｘ（７）］へ向かう方向）に処
理することで逆離散コサイン変換を実行することが可能
である。なお、この図３６で示す“Ｃ_l ⁱ”は、ＣＯＳ
（ｉπ／ｌ）を意味する。In this method, the variables [X (0),..., X (0),...
X (7)], it is possible to execute the discrete cosine transform. Also, by processing in the direction opposite to the arrow in FIG. 36 (the direction from the variable [X (0)... X (7)] to the variable [x (0)... X (7)]). It is possible to perform an inverse discrete cosine transform. Note that “C _l ⁱ ” shown in FIG.
(Iπ / l).

【０２３５】また、４×８逆離散コサイン変換装置５２
及び４×８離散コサイン変換装置５６の水平方向に対す
る処理としては、以下の２つの方式がある。The 4 × 8 inverse discrete cosine transform unit 52
As the processing in the horizontal direction of the 4 × 8 discrete cosine transform device 56, there are the following two methods.

【０２３６】第１の方式は、４×８逆離散コサイン変換
装置５２において、８次の離散コサイン係数である周波
数領域の量子化誤差係数のうち、低域４係数のみに対し
て４次の逆離散コサイン変換を施する。４×８離散コサ
イン変換装置５６において、動き補償によって生成され
た４×８の空間領域の動き補償誤差補正値に対して、水
平方向の４次の離散コサイン変換を施す。このことによ
り、４×８の周波数領域の動き補償誤差補正係数を出力
することができる。ここで、４次の逆離散コサイン変換
及び４次の離散コサイン変換に高速アルゴリズムを適用
することが可能である。図３７に、４次の高速アルゴリ
ズムを適用した場合のフロー図を示す。この４次の高速
アルゴリズムは、Ｆ（０）〜Ｆ（３）を入力とし、ｆ
（０）〜ｆ（３）を出力とすることにより、逆離散コサ
イン変換を実行することが可能である。また、ｆ（０）
〜ｆ（３）を入力とし、Ｆ（０）〜Ｆ（３）を出力とす
ることにより、離散コサイン変換を実行することが可能
である。In the first method, in the 4 × 8 inverse discrete cosine transform device 52, of the quantization error coefficients in the frequency domain, which are the 8th-order discrete cosine coefficients, only the 4th order inverse Perform discrete cosine transform. The 4 × 8 discrete cosine transform unit 56 performs a fourth-order discrete cosine transform in the horizontal direction on the motion compensation error correction value in the 4 × 8 spatial region generated by the motion compensation. This makes it possible to output a motion compensation error correction coefficient in the 4 × 8 frequency domain. Here, a high-speed algorithm can be applied to the fourth-order inverse discrete cosine transform and the fourth-order discrete cosine transform. FIG. 37 shows a flowchart in the case where the fourth-order high-speed algorithm is applied. This fourth-order high-speed algorithm takes F (0) to F (3) as inputs and f
By setting (0) to f (3) as outputs, it is possible to execute inverse discrete cosine transform. Also, f (0)
To f (3) as input and F (0) to F (3) as output, it is possible to execute discrete cosine transform.

【０２３７】第２の方式は、４×８逆離散コサイン変換
装置５２において、８次の離散コサイン係数である周波
数領域の量子化誤差係数のうち、高域４係数を０と置き
換えて８次の離散コサイン変換を施して、８次の空間領
域の量子化誤差値を得る。続いて、４×８逆離散コサイ
ン変換装置５２は、８次の空間領域の量子化誤差値に対
して、水平方向に平均化処理又は間引き処理を行い、空
間領域の４次の量子化誤差値を生成する。４×８離散コ
サイン変換装置５６は、動き補償によって得られた画素
領域の４点の誤差補正値を、補間処理によって８点にす
る。続いて、４×８離散コサイン変換装置５６は、補間
処理によって８点にした空間領域の動き補償画素補正値
に対して離散コサイン変換を施した後、４次までの低域
成分を取り出す。このことにより４×８の周波数領域で
の誤差補正係数を出力することができる。In the second method, the 4 × 8 inverse discrete cosine transform unit 52 replaces the high-frequency 4 coefficients of the frequency-domain quantization error coefficients, which are the 8th-order discrete cosine coefficients, with 0, and replaces them with the 8th-order discrete cosine coefficients. A discrete cosine transform is performed to obtain a quantization error value of an eighth-order spatial domain. Subsequently, the 4 × 8 inverse discrete cosine transform device 52 performs an averaging process or a thinning process in the horizontal direction on the quantization error value of the eighth-order spatial domain to obtain a fourth-order quantization error value of the spatial domain. Generate The 4 × 8 discrete cosine transform device 56 converts the four error correction values of the pixel area obtained by the motion compensation into eight points by interpolation processing. Subsequently, the 4 × 8 discrete cosine transform device 56 performs discrete cosine transform on the motion compensation pixel correction values in the spatial region that has been made into eight points by the interpolation process, and then extracts low-order components up to the fourth order. This makes it possible to output an error correction coefficient in a 4 × 8 frequency domain.

【０２３８】なお、この第２の方式において、４×８逆
離散コサイン変換装置５２は、逆離散コサイン変換と平
均化処理、又は、逆離散コサイン変換と間引き処理を、
一連の処理として１つの行列で演算しても良い。逆離散
コサイン変換と平均化処理とを１つの行列で演算する場
合の演算式（ｉＤ₄＿ａｖｅ）を図３８に示す。また、
逆離散コサイン変換と間引き処理とを１つの行列で演算
する場合の演算式（ｉＤ₄＿ｄｅｃｉ）を図３９に示
す。In the second method, the 4 × 8 inverse discrete cosine transform unit 52 performs the inverse discrete cosine transform and the averaging process or the inverse discrete cosine transform and the thinning process.
A series of processes may be performed using one matrix. FIG. 38 shows an arithmetic expression (iD ₄ _ave) in the case where the inverse discrete cosine transform and the averaging process are performed using one matrix. Also,
FIG. 39 shows an arithmetic expression (iD ₄ _deci) in the case where the inverse discrete cosine transform and the thinning-out process are performed with one matrix.

【０２３９】また、４×８離散コサイン変換装置５６
も、同様に、補間処理と離散コサイン変換とを一連の処
理として１つの行列で演算しても良い。平均化処理に対
応する補間処理と離散コサイン変換とを１つの行列で演
算する場合の演算式（Ｄ₄＿ａｖｅ）、及び、間引き処
理に対応する補間処理と逆離散コサイン変換とを１つの
行列で演算する場合の演算式（Ｄ₄＿ｄｅｃｉ）は、以
下の式で示される。なお、^t（）は転置行列を示してい
る。The 4 × 8 discrete cosine transform unit 56
Similarly, the interpolation process and the discrete cosine transform may be calculated as a series of processes using one matrix. An arithmetic expression (D ₄ _ave) in the case where the interpolation process corresponding to the averaging process and the discrete cosine transform are calculated using one matrix, and the interpolation process corresponding to the thinning process and the inverse discrete cosine transform are calculated using one matrix. An arithmetic expression (D ₄ _deci) for performing the arithmetic operation is represented by the following expression. Note that ^t () indicates a transposed matrix.

【０２４０】[0240]

【数２７】 [Equation 27]

【０２４１】また、画像における色差信号の劣化は、輝
度信号の劣化に比べ、人間の目には分かり難いという特
色を有している。そのため、４×８逆離散コサイン変換
装置５２では、図４０に示すように、色差信号の離散コ
サイン係数の垂直方向の高域成分４係数を０に置き換え
て、逆直交変換をして演算量の削減を行っても良い。Further, the deterioration of the color difference signal in the image has a characteristic feature that it is difficult for the human eye to understand the deterioration of the luminance signal compared to the deterioration of the luminance signal. For this reason, as shown in FIG. 40, the 4 × 8 inverse discrete cosine transform device 52 replaces the vertical high-frequency component 4 coefficients of the discrete cosine coefficients of the color difference signal with 0, performs inverse orthogonal transform, and reduces the amount of operation. Reduction may be performed.

【０２４２】また、以上、入力として、ＭＰＥＧ２によ
る画像圧縮情報（ビットストリーム）を対象としてきた
が、直交変換と動き補償によって符号化された画像圧縮
情報（ビットストリーム）では、本装置と同様の構成で
符号量（ビットレート）の削減を行うことが可能であ
る。Although the above description has been directed to image compression information (bit stream) by MPEG2 as input, image compression information (bit stream) encoded by orthogonal transformation and motion compensation has the same configuration as that of the present apparatus. Thus, it is possible to reduce the code amount (bit rate).

【０２４３】[0243]

【発明の効果】本発明にかかる画像情報変換装置及び方
法では、直交変換係数を逆量子化した後量子化幅を変え
て再量子化する。このことにより本発明では、画像信号
をベースバンドで処理をせずに周波数領域で処理をする
ことができ、そのため、少ない演算量及び回路規模でビ
ットレートの削減をすることができる。また、本発明で
は、再量子化による歪みを低減でき画質劣化を少なくし
てビットレートの削減をすることができる。In the image information conversion apparatus and method according to the present invention, the orthogonal transform coefficients are inversely quantized and then requantized by changing the quantization width. As a result, according to the present invention, the image signal can be processed in the frequency domain without being processed in the baseband. Therefore, the bit rate can be reduced with a small amount of calculation and a small circuit scale. Further, according to the present invention, distortion due to requantization can be reduced, image quality degradation can be reduced, and the bit rate can be reduced.

【０２４４】また、本発明にかかる画像情報変換装置及
び方法では、直交変換係数を逆量子化した後、直交変換
係数の水平方向の高周波成の値を制限し、量子化幅を変
えて再量子化する。このことにより、本発明では、再量
子化による歪みを低減でき画質劣化を少なくしてビット
レートの削減をすることができる。また、本発明では、
フィールド間の差分情報を含む垂直方向の情報を制限し
ないので、画質劣化を少なくしてビットレートの削減を
することができる。In the image information conversion apparatus and method according to the present invention, after the orthogonal transform coefficient is dequantized, the value of the high frequency component of the orthogonal transform coefficient in the horizontal direction is limited, and the quantization width is changed to change the requantization. Become As a result, in the present invention, distortion due to requantization can be reduced, image quality degradation can be reduced, and the bit rate can be reduced. In the present invention,
Since information in the vertical direction including difference information between fields is not limited, it is possible to reduce image quality degradation and reduce the bit rate.

【０２４５】また、本発明にかかる画像情報変換装置及
び方法では、直交変換係数を逆量子化した後量子化幅を
変えて再量子化するとともに、入力された直交変換係数
と再量子化した直交変換係数との差分を動き補償して動
き補償による誤差を補正する。このことにより、再量子
化にともなう動き補償の誤差の蓄積が減少し、画質の劣
化を少なくしてビットレートの削減をすることができ
る。In the image information conversion apparatus and method according to the present invention, the orthogonal transform coefficient is inversely quantized and then requantized by changing the quantization width, and the input orthogonal transform coefficient and the requantized orthogonal The difference from the transform coefficient is motion-compensated to correct the error due to the motion compensation. As a result, the accumulation of motion compensation errors due to requantization is reduced, and the bit rate can be reduced with less deterioration in image quality.

【０２４６】また、本発明にかかる画像情報変換装置及
び方法では、直交変換係数を逆量子化した後直交変換係
数の水平方向の高周波成の値を制限し量子化幅を変えて
再量子化するとともに、入力された直交変換係数と再量
子化した直交変換係数との差分の水平方向の低域４係数
及び垂直方向の８係数とを、動き補償して動き補償によ
る誤差を補正する。このことにより、再量子化にともな
う動き補償の誤差の蓄積が減少し、画質の劣化を少なく
してビットレートの削減をすることができるとともに、
回路規模を削減することができる。Further, in the image information converting apparatus and method according to the present invention, after the orthogonal transform coefficient is dequantized, the value of the high frequency component of the orthogonal transform coefficient in the horizontal direction is limited, and the quantization width is changed to perform requantization. At the same time, the four low-frequency coefficients in the horizontal direction and the eight coefficients in the vertical direction of the difference between the input orthogonal transform coefficient and the requantized orthogonal transform coefficient are motion-compensated to correct an error due to motion compensation. As a result, the accumulation of errors in motion compensation due to requantization is reduced, and the bit rate can be reduced by reducing the deterioration of image quality.
The circuit scale can be reduced.

[Brief description of the drawings]

【図１】ＭＰＥＧ−２画像情報符号化装置のブロック図
である。FIG. 1 is a block diagram of an MPEG-2 image information encoding device.

【図２】上記ＭＰＥＧ−２画像情報符号化装置の画面並
べ替えバッファによるフレーム順序の並べ替え処理を説
明する図である。（Ａ）は並べ替え前のフレーム順序を
示す図であり、（Ｂ）は並べ替え後のフレーム順序を示
す図である。FIG. 2 is a diagram illustrating a frame order rearrangement process performed by a screen rearrangement buffer of the MPEG-2 image information encoding apparatus. (A) is a diagram showing a frame order before rearrangement, and (B) is a diagram showing a frame order after rearrangement.

【図３】ビデオデータが４：２：０信号の場合のマクロ
ブロックの構成を説明する図である。FIG. 3 is a diagram illustrating a configuration of a macroblock when video data is a 4: 2: 0 signal.

【図４】フレーム離散コサイン変換モードの際のＤＣＴ
ブロックを説明する図である。FIG. 4 shows DCT in the frame discrete cosine transform mode
It is a figure explaining a block.

【図５】フィールド離散コサイン変換モードの際のＤＣ
Ｔブロックを説明する図である。FIG. 5 shows DC in field discrete cosine transform mode
It is a figure explaining a T block.

【図６】可変長符号化をする際の離散コサイン変換係数
のスキャン順序を示す図である。（Ａ）はジグザグスキ
ャンのスキャン順序を示す図であり、（Ｂ）はオルタネ
ートスキャンのスキャン順序を示す図である。FIG. 6 is a diagram illustrating a scan order of discrete cosine transform coefficients when performing variable length coding. (A) is a figure which shows the scan order of a zigzag scan, (B) is a figure which shows the scan order of an alternate scan.

【図７】イントラＤＣ精度に対する逆量子化係数の関係
を示す図である。FIG. 7 is a diagram illustrating a relationship between an inverse quantization coefficient and intra DC precision.

【図８】量子化行列のデフォルト値を示す図である。
（Ａ）はイントラマクロブロックについて用いられるデ
フォルトに設定された量子化行列を示す図である。
（Ｂ）はインターマクロブロックについて用いられるデ
フォルト値に設定された量子化行列を示す図である。FIG. 8 is a diagram showing default values of a quantization matrix.
(A) is a figure which shows the quantization matrix set by default used about an intra macroblock.
(B) is a diagram showing a quantization matrix set to a default value used for an inter macroblock.

【図９】ＭＰＥＧ２画像圧縮情報（ビットストリーム）
の構文中で定義されるｑｕａｎｔｉｓｅｒ＿ｓｃａｌｅ
（量子化スケール）を説明する図である。FIG. 9 shows MPEG2 image compression information (bit stream).
Quantizer_scale defined in the syntax of
It is a figure explaining (quantization scale).

【図１０】上記ＭＰＥＧ−２画像情報符号化装置の符号
量制御装置の動作内容を示すフローチャートである。FIG. 10 is a flowchart showing an operation content of a code amount control device of the MPEG-2 image information encoding device.

【図１１】ＶＢＶバッファのビット占有量の変化を示す
図である。FIG. 11 is a diagram illustrating a change in a bit occupancy of a VBV buffer.

【図１２】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）の階層構成を示す図である。FIG. 12 is a diagram showing a hierarchical structure of MPEG2 image compression information (bit stream).

【図１３】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）の各階層におけるスタートコードのコード値を示す
図である。FIG. 13 is a diagram showing a code value of a start code in each layer of MPEG2 image compression information (bit stream).

【図１４】ＭＰＥＧ２画像圧縮情報（ビットストリー
ム）の各階層のブロック構成を示す図である。FIG. 14 is a diagram illustrating a block configuration of each layer of MPEG2 image compression information (bit stream).

【図１５】シーケンスヘッダに含まれる情報を説明する
図である。FIG. 15 is a diagram illustrating information included in a sequence header.

【図１６】シーケンス拡張部に含まれる情報を説明する
図である。FIG. 16 is a diagram illustrating information included in a sequence extension unit.

【図１７】拡張部＆ユーザデータ（１）に含まれる情報
を説明する図である。FIG. 17 is a diagram illustrating information included in an extension part and user data (1).

【図１８】ＧＯＰヘッダに含まれる情報を説明する図で
ある。FIG. 18 is a diagram illustrating information included in a GOP header.

【図１９】拡張部＆ユーザデータ（２）に含まれる情報
を説明する図である。FIG. 19 is a diagram illustrating information included in an extension part and user data (2).

【図２０】ピクチャヘッダに含まれる情報を説明する図
である。FIG. 20 is a diagram illustrating information included in a picture header.

【図２１】ピクチャ符号化拡張部に含まれる情報を説明
する図である。FIG. 21 is a diagram illustrating information included in a picture coding extension unit.

【図２２】拡張部＆ユーザデータ（３）に含まれる情報
を説明する図である。FIG. 22 is a diagram illustrating information included in an extension part and user data (3).

【図２３】画面内に分割されたスライス層を説明する図
である。FIG. 23 is a diagram illustrating a slice layer divided in a screen.

【図２４】スライス層を定義するスライスデータを説明
する図である。FIG. 24 is a diagram illustrating slice data defining a slice layer.

【図２５】スライスデータに含まれる情報を説明する図
である。FIG. 25 is a diagram illustrating information included in slice data.

【図２６】マクロブロックデータに含まれる情報を説明
する図である。FIG. 26 is a diagram illustrating information included in macroblock data.

【図２７】ブロックデータに含まれる情報を説明する図
である。FIG. 27 is a diagram illustrating information included in block data.

【図２８】本発明の第１の実施の形態の画像情報変換装
置のブロック図である。FIG. 28 is a block diagram of an image information conversion device according to the first embodiment of the present invention.

【図２９】上記画像情報変換装置の帯域制限装置による
離散コサイン変換係数の水平高周波成分の帯域制限例を
説明する図である。（Ａ）は輝度信号に対する離散コサ
イン変換係数の帯域制限例を示す図であり、（Ｂ）は色
差信号に対する離散コサイン変換係数の帯域制限例を示
す図である。FIG. 29 is a diagram illustrating an example of band limitation of horizontal high-frequency components of discrete cosine transform coefficients by the band limitation device of the image information conversion device. (A) is a figure which shows the example of a band limitation of the discrete cosine transform coefficient with respect to a luminance signal, (B) is a figure which shows the example of a band limitation of the discrete cosine transform coefficient with respect to a chrominance signal.

【図３０】上記画像情報変換装置の符号量制御装置の動
作内容を示すフローチャートである。FIG. 30 is a flowchart showing the operation of a code amount control device of the image information conversion device.

【図３１】入力された信号のスキャン方式に関わらず、
オルタネートスキャン方式により離散コサイン変換係数
をスキャンすることを説明する図である。（Ａ）は帯域
制限まえの離散コサイン変換係数を示す図であり、
（Ｂ）は帯域制限後の離散コサイン変換係数を示す図で
ある。FIG. 31 regardless of the scanning method of the input signal.
FIG. 5 is a diagram illustrating scanning of a discrete cosine transform coefficient by an alternate scan method. (A) is a diagram showing discrete cosine transform coefficients before band limitation,
(B) is a diagram showing discrete cosine transform coefficients after band limitation.

【図３２】本発明の第２の実施の形態の画像情報変換装
置のブロック図である。FIG. 32 is a block diagram of an image information conversion device according to a second embodiment of the present invention.

【図３３】離散コサイン変換及び逆離散コサイン変換に
高速アルゴリズムを適用した場合の処理フローを示す図
である。FIG. 33 is a diagram showing a processing flow when a high-speed algorithm is applied to discrete cosine transform and inverse discrete cosine transform.

【図３４】本発明の第３の実施の形態の画像情報変換装
置のブロック図である。FIG. 34 is a block diagram of an image information conversion device according to a third embodiment of the present invention.

【図３５】線形内挿により水平方向の１／４画素精度の
補間を行うときの処理内容を示す図である。FIG. 35 is a diagram showing processing contents when performing interpolation with １／ pixel accuracy in the horizontal direction by linear interpolation.

【図３６】４×８離散コサイン変換及び４×８逆離散コ
サイン変換に高速アルゴリズムを適用した場合の処理フ
ローを示す図である。FIG. 36 is a diagram showing a processing flow when a high-speed algorithm is applied to 4 × 8 discrete cosine transform and 4 × 8 inverse discrete cosine transform.

【図３７】４×８離散コサイン変換及び４×８逆離散コ
サイン変換に他の高速アルゴリズムを適用した場合の処
理フローを示す図である。FIG. 37 is a diagram showing a processing flow when another high-speed algorithm is applied to 4 × 8 discrete cosine transform and 4 × 8 inverse discrete cosine transform.

【図３８】逆離散コサイン変換と平均化処理とを１つの
行列で演算する場合の演算式を説明する図である。FIG. 38 is a diagram illustrating an arithmetic expression in a case where the inverse discrete cosine transform and the averaging process are performed using one matrix.

【図３９】逆離散コサイン変換と間引き処理とを１つの
行列で演算する場合の演算式を説明する図である。FIG. 39 is a diagram illustrating an arithmetic expression in the case where the inverse discrete cosine transform and the thinning-out process are performed using one matrix.

【図４０】色差信号の離散コサイン係数の垂直方向の高
域成分４係数を０に置き換える処理を説明する図であ
る。FIG. 40 is a diagram illustrating a process of replacing 4 coefficients in the vertical direction of the discrete cosine coefficient of the color difference signal with 0.

【図４１】従来の画像情報変換装置のブロック図であ
る。FIG. 41 is a block diagram of a conventional image information conversion device.

[Explanation of symbols]

２０，４０，５０画像情報変換装置、２４圧縮情報
解析装置、２５、可変長復号化装置、２６逆量子化装
置、２７帯域制限装置、２８量子化装置、２９可
変長符号化装置、３１符号量制御装置、４１，４４
加算器、４２，５１動き補償誤差補正装置、４３逆量
子化装置、４５逆離散コサイン変換装置、４６，５３
ビデオメモリ、４７，５５動き補償予測装置、４８
離散コサイン変換装置、５２４×８逆離散コサイン変
換装置、５４補間装置、５６４×８離散コサイン変換
装置20, 40, 50 image information conversion device, 24 compression information analysis device, 25, variable length decoding device, 26 inverse quantization device, 27 band limiting device, 28 quantization device, 29 variable length coding device, 31 code amount Control device, 41, 44
Adder, 42, 51 motion compensation error correction device, 43 inverse quantization device, 45 inverse discrete cosine transform device, 46, 53
Video memory, 47, 55 motion compensation prediction device, 48
Discrete Cosine Transformer, 52 4 × 8 Inverse Discrete Cosine Transformer, 54 Interpolator, 564 × 8 Discrete Cosine Transformer

フロントページの続き (72)発明者岡田紳太郎東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者リュウイク東京都品川区北品川６丁目７番35号ソニー株式会社内 (72)発明者柳原尚史東京都品川区北品川６丁目７番35号ソニー株式会社内Ｆターム(参考） 5C059 KK01 KK22 MA00 MA23 MC11 MC38 ME01 NN01 PP04 PP16 SS01 SS02 SS13 TA53 TB08 TC10 TD02 TD03 UA02 UA05 UA11 Continued on the front page (72) Inventor Shintaro Okada 6-7-35 Kita-Shinagawa, Shinagawa-ku, Tokyo Inside Sony Corporation (72) Inventor Ryuik 6-7-35 Kita-Shinagawa, Shinagawa-ku, Tokyo Sony Corporation In-house (72) Inventor Naofumi Yanagihara 6-35, Kita-Shinagawa, Shinagawa-ku, Tokyo F-term in Sony Corporation (reference) 5C059 KK01 KK22 MA00 MA23 MC11 MC38 ME01 NN01 PP04 PP16 SS01 SS02 SS13 TA53 TB08 TC10 TD02 TD03 UA02 UA05 UA11

Claims

[Claims]

1. An image signal is compressed and encoded by performing orthogonal transformation in a predetermined pixel block (orthogonal transformation block) unit and quantizing orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. An image information conversion apparatus for converting the first compressed image information of the first bit rate into second compressed image information of a second bit rate lower than the first bit rate. An inverse quantization means for inversely quantizing the orthogonal transform coefficient according to the quantization width of the orthogonal transform coefficient of the image compression information; and a quantizer for outputting the second image compression information having the second bit rate. An image information conversion device comprising: a quantization unit that requantizes the orthogonal transform coefficient inversely quantized by the inverse quantization unit according to the quantization width.

2. The image processing apparatus according to claim 1, further comprising a band limiting unit configured to limit a value of a high-frequency component in a horizontal direction of the orthogonal transform coefficient inversely quantized by the inverse quantization unit, wherein the quantization unit outputs second image compression information to be output. 2. The method according to claim 1, wherein the orthogonal transform coefficient whose horizontal high-frequency component is limited by the band limiting means is re-quantized in accordance with a quantization width at which the second bit rate becomes the second bit rate. Image information conversion device.

3. The apparatus according to claim 2, wherein said band limiting means limits the orthogonal transform coefficient of the chrominance signal to the orthogonal transform coefficient of a lower component than the orthogonal transform coefficient of the luminance component.
The image information conversion device described in the above.

4. An image compression information analyzing means for analyzing a code amount of each picture of the input first image compression information, and a second image compression means for controlling and outputting a quantization width of the quantization means. Code amount control means for controlling the code amount of the information, wherein the code amount control means sets the code amount per picture of the input first image compression information to B, the first bit rate to R1, When the second bit rate is R2, a target code amount (target bit) T of each picture of the second image compression information is obtained by T = B (R2 / R1) to control the quantization width. The image information conversion device according to claim 1, wherein:

5. The orthogonal quantizer inversely quantized by the inverse quantizer using a quantization matrix different from a quantization matrix obtained by quantizing an orthogonal transform coefficient of the first image compression information. 2. The image information conversion device according to claim 1, wherein the conversion coefficient is requantized.

6. Variable-length decoding means for performing variable-length decoding on the input first image compression information, and variable-length coding means for performing variable-length coding on the image compression information requantized by the quantization means. 2. The image information conversion apparatus according to claim 1, wherein the variable length coding means converts the orthogonal transform block into a one-dimensional signal by an alternate scan method and performs variable length coding.

7. An image signal is compressed and encoded by performing orthogonal transformation in units of a predetermined pixel block (orthogonal transformation block) and quantizing orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. An image information conversion apparatus for converting the first compressed image information of the first bit rate into second compressed image information of a second bit rate lower than the first bit rate. First inverse quantizing means for inversely quantizing the orthogonal transform coefficient according to the quantization width of the orthogonal transform coefficient of the image compression information; and orthogonal transform coefficient inversely quantized by the first inverse quantizing means. Means for adding the motion compensation error coefficient and the motion compensation error correction coefficient, and the motion compensation error coefficient is added by the addition means in accordance with the quantization width such that the output second image compression information has the second bit rate. Sa Quantizing means for requantizing the orthogonal transform coefficient obtained, second inverse quantizing means for inversely quantizing the orthogonal transform coefficient requantized by the quantizing means, and second inverse quantizing means Subtraction means for generating a quantization error coefficient in the frequency domain by subtracting the orthogonal transformation coefficient to which the motion compensation correction coefficient has been added by the addition means from the orthogonal transformation coefficient inversely quantized by the addition means; Motion compensation based on a motion vector by orthogonally transforming the orthogonal transformation coefficient, and a motion compensation error correction means for generating the motion compensation error correction coefficient by performing an inverse orthogonal transformation on a value obtained by the motion compensation. Image information conversion device.

8. A band limiting means for limiting a value of a high frequency component in a horizontal direction of the orthogonal transform coefficient dequantized by the first dequantizing means, wherein the quantizing means outputs The orthogonal transform coefficient whose horizontal high-frequency component is limited by the band limiting means is re-quantized according to a quantization width at which image compression information becomes the second bit rate. 7. The image information conversion device according to 7.

9. The apparatus according to claim 8, wherein said band limiting means limits the orthogonal transform coefficient of the chrominance signal to the orthogonal transform coefficient of a lower component than the orthogonal transform coefficient of the luminance component.
The image information conversion device described in the above.

10. The image information conversion apparatus according to claim 8, wherein said motion compensation error correction means does not perform inverse orthogonal transform and orthogonal transform on the orthogonal transform coefficient set to 0 by said band limiting means.

11. An image compression information analysis means for analyzing the code amount of each picture of the input first image compression information, and a second image compression means for controlling and outputting a quantization width of the quantization means. Code amount control means for controlling the code amount of the information, wherein the code amount control means sets the code amount per picture of the input first image compression information to B, the first bit rate to R1, When the second bit rate is R2, a target code amount (target bit) T of each picture of the second image compression information is obtained by T = B (R2 / R1) to control the quantization width. The image information conversion device according to claim 7, wherein:

12. The orthogonal quantizer inversely quantized by the inverse quantizer using a quantization matrix different from a quantization matrix obtained by quantizing the orthogonal transform coefficient of the first image compression information. The image information conversion device according to claim 7, wherein the conversion coefficient is requantized.

13. Variable-length decoding means for performing variable-length decoding on the input first image compression information, and variable-length coding means for performing variable-length encoding on the image compression information requantized by the quantization means. Wherein the variable length coding means converts the orthogonal transform block into a one-dimensional signal by an alternate scan method and performs variable length coding irrespective of the scan method of the input first image compression information. The image information conversion device according to claim 7, wherein

14. The image information according to claim 7, wherein said motion compensation error correction means generates a motion compensation error correction coefficient for a P picture, and generates a motion compensation error correction coefficient for a B picture. Conversion device.

15. The image information according to claim 7, wherein said motion compensation error correction means generates a motion compensation error correction coefficient for the luminance signal and a motion compensation error correction coefficient for the color difference signal. Conversion device.

16. The image information conversion apparatus according to claim 7, wherein said motion compensation error correction means performs inverse orthogonal transformation and orthogonal transformation based on a high-speed algorithm.

17. The orthogonal transformation coefficient comprises 8 × 8 coefficients of 8 coefficients in a horizontal direction and 8 coefficients in a vertical direction, and the motion compensation error correction means comprises a quantization error comprising 8 coefficients in a horizontal direction and 8 coefficients in a vertical direction. A 4 × 8 inverse orthogonal transform unit for performing a 4 × 8 inverse orthogonal transform on the coefficients to generate a quantization error value in the spatial domain; and a quantum in the spatial domain generated by the 4 × 8 inverse orthogonal transform unit. Motion compensating unit for performing motion compensation on the quantization error value in the horizontal direction with １／ pixel accuracy and motion compensation in the vertical direction with 画素 pixel accuracy to generate a quantization error correction value in the spatial domain A 4 × 8 orthogonal transform unit that performs 4 × 8 orthogonal transformation on the spatial domain quantization error correction value generated by the motion compensation unit, and generates the motion compensation error correction coefficient in the frequency domain. 8. The image information conversion device according to claim 7, comprising: .

18. The inverse orthogonal transform unit according to claim 17, wherein the 4 × 8 inverse orthogonal transform unit performs an inverse orthogonal transform by replacing a value of a vertical high-frequency component of an orthogonal transform coefficient of the color difference signal with 0. Image information conversion device.

19. The motion compensation error correction means according to claim 4, wherein
An interpolation unit that performs pixel interpolation with 1/4 pixel accuracy in the horizontal direction on the quantization error value in the spatial domain generated by the x8 inverse orthogonal transform unit; 18. The image information conversion device according to claim 17, wherein horizontal motion compensation is performed using the interpolated interpolation value.

20. The 4 × 8 inverse orthogonal transform unit performs a fourth-order inverse orthogonal transform on only four low-frequency coefficients among the eighth-order quantization error coefficients in the frequency domain in the horizontal direction, Is 8
The following inverse orthogonal transform is performed. The 4 × 8 orthogonal transform unit performs a fourth-order orthogonal transform on the motion compensation error correction value in the spatial domain in the horizontal direction and an eighth-order orthogonal transform on the vertical direction. 18. The image information conversion device according to claim 17, wherein:

21. The 4 × 8 inverse orthogonal transform section, in the horizontal direction, outputs a high-frequency 4th-order quantized error coefficient in the frequency domain.
After the coefficient is replaced with 0 and an 8th-order inverse orthogonal transform is performed, a thinning process or an averaging process is performed to generate a 4th-order spatial domain quantization error value. Has 4 in the spatial domain
18. The image information conversion device according to claim 17, wherein discrete cosine transform is performed after the motion compensation error correction values of the points are interpolated into eight motion compensation error correction values.

22. The 4 × 8 inverse orthogonal transform unit performs an inverse orthogonal transform and thinning process or an inverse orthogonal transform and an averaging process by one matrix, and the 4 × 8 orthogonal transform unit performs an interpolation process. 22. The image information conversion device according to claim 21, wherein the image information and the orthogonal transform are calculated using one matrix.

23. An image signal is compressed and encoded by performing orthogonal transformation in units of a predetermined pixel block (orthogonal transformation block) and quantizing orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. An image information conversion method for converting the first compressed image information of the first bit rate into second compressed image information of a second bit rate lower than the first bit rate. Is input according to the quantization width of the orthogonal transformation coefficient of the input first image compression information, and the second image compression information to be output is The dequantized orthogonal transform coefficient is requantized in accordance with a quantization width that becomes the second bit rate, and the second image compression information generated by requantization is output. Do Image information conversion method.

24. A value of a high-frequency component in the horizontal direction of the inversely quantized orthogonal transform coefficient is limited, and the second image compression information to be output is determined according to a quantization width such that the second bit rate is obtained. 24. The image information conversion method according to claim 23, further comprising requantizing the orthogonal transform coefficient in which high frequency components in the horizontal direction are limited.

25. Compression encoding of an image signal by performing orthogonal transformation on a predetermined pixel block (orthogonal transformation block) basis and quantizing orthogonal transformation coefficients in the orthogonal transformation block obtained by orthogonal transformation. An image information conversion method for converting the first compressed image information of the first bit rate into second compressed image information of a second bit rate lower than the first bit rate. Is input, and the orthogonal transform coefficient is inversely quantized according to the quantization width of the orthogonal transform coefficient of the input first image compressed information. And the motion compensation error correction coefficient, and the orthogonal transform coefficient to which the motion compensation error coefficient has been added is reproduced according to the quantization width such that the output second image compression information has the second bit rate. And outputs the second image compression information generated by requantization, dequantizes the requantized orthogonal transform coefficient, and adds a motion compensation correction coefficient from the dequantized orthogonal transform coefficient. Subtracting the orthogonal transformation coefficient, orthogonally transforming the subtracted orthogonal transformation coefficient, performing motion compensation based on the motion vector, and performing inverse orthogonal transformation on the value subjected to the motion compensation to generate the motion compensation error correction coefficient. Image information conversion method.

26. A value of a high-frequency component in the horizontal direction of the inversely quantized orthogonal transform coefficient is limited, and the second image compression information to be output is set in accordance with a quantization width such that the second bit rate is obtained. 26. The image information conversion method according to claim 25, further comprising requantizing an orthogonal transform coefficient in which high frequency components in the horizontal direction are restricted.

27. The orthogonal transform coefficient is composed of 8 × 8 coefficients of 8 coefficients in the horizontal direction and 8 coefficients in the vertical direction, and is 4 × 8 with respect to a quantization error coefficient composed of 8 coefficients in the horizontal direction and 8 coefficients in the vertical direction. And generates a quantization error value in the spatial domain. The quantization error value in the spatial domain is 1 /
Motion compensation is performed with 4-pixel accuracy, and motion compensation is performed with 1 / 2-pixel accuracy in the vertical direction to generate a quantization error correction value in the spatial domain. 26. The image information conversion method according to claim 25, wherein an orthogonal transformation of x8 is performed to generate the motion compensation error correction coefficient in the frequency domain.