JP2006086863A

JP2006086863A - Image stream transforming apparatus

Info

Publication number: JP2006086863A
Application number: JP2004270105A
Authority: JP
Inventors: Motoharu Ueda; 基晴上田
Original assignee: Victor Company of Japan Ltd
Current assignee: Victor Company of Japan Ltd
Priority date: 2004-09-16
Filing date: 2004-09-16
Publication date: 2006-03-30

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image stream transforming apparatus (transcoder) capable of generating a satisfactory encoded stream at the time of re-encoding in an image stream transformation processing. <P>SOLUTION: A header extracting circuit 4 extracts header information concerning an image stream to be an object of image transformation, and a syntax evaluation circuit 5 refers to the header information to determine whether or not an encoded syntax of the image stream is right. In this case, if it is determined that the encoded syntax is not right (wrong), for example, evaluation concerning change in encoding characteristics such as change in picture type is performed to search a proper encoded syntax. Then, a syntax re-constructing circuit 6 and an encoding information transformation circuit 9 re-construct re-encoding conditions in order to generate more satisfactory re-encoding stream. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、画像符号化されたビットストリーム（画像ビットストリーム、画像ストリーム、符号化ストリームなどと呼ばれることもある）に対して、再変換処理を行い解像度・符号化ビットレート・符号化方式などを変換することが可能な画像ストリーム変換装置（トランスコーダとも呼ばれる）に関し、特に、画像の特性を考慮した良好な変換方法を実現することが可能な画像ストリーム変換装置に関する。 The present invention performs a re-conversion process on an image-encoded bit stream (sometimes called an image bit stream, an image stream, an encoded stream, etc.) to obtain a resolution, an encoding bit rate, an encoding method, and the like. More particularly, the present invention relates to an image stream conversion apparatus capable of realizing a good conversion method in consideration of image characteristics.

近年、デジタル化された画像信号に対して高能率符号化による圧縮された情報を用いて、衛星波、地上波、電話回線などの様々な伝送路を通じて、情報を配信するサービスが実用化されている。このようなサービスでは、動画像・音声などの情報を配信する際、動画像・音声の高能率符号化方式として、国際規格であるＭＰＥＧ２（Moving Picture Experts Group Phase 2）が用いられている。ＭＰＥＧ２は、画像信号の隣接画素間（空間方向）の相関や、隣接フレーム間又は隣接フィールド間（時間方向）の相関を利用して、画像信号の情報量を圧縮する符号化方式である。 In recent years, services that distribute information through various transmission paths such as satellite waves, terrestrial waves, and telephone lines using information compressed by high-efficiency coding for digitized image signals have been put into practical use. Yes. In such a service, when distributing information such as moving images / sounds, MPEG2 (Moving Picture Experts Group Phase 2), which is an international standard, is used as a high-efficiency encoding method for moving images / sounds. MPEG2 is an encoding method that compresses the information amount of an image signal by using a correlation between adjacent pixels (spatial direction) of an image signal and a correlation between adjacent frames or adjacent fields (time direction).

ＭＰＥＧ２規格における画像符号化は、下記のようなアルゴリズムで処理される。まず、時間的に連続する画像フレームを、基準フレームと予測フレームに振り分ける。基準フレームは、空間方向の相関のみを用いて符号化することで、そのフレームの符号化データのみで、元の画像を復元することができる。一方、予測フレームは、基準となるフレームからの時間方向の相関と空間方向の相関とを用いて符号化することにより、基準フレームに比べて高い符号化効率を実現することができる。なお、予測フレームの符号化データは、復元された基準フレームと、予測フレームの符号化データとによって復元される。 Image encoding in the MPEG2 standard is processed by the following algorithm. First, temporally continuous image frames are divided into a reference frame and a prediction frame. By encoding the reference frame using only the spatial correlation, the original image can be restored using only the encoded data of the frame. On the other hand, the prediction frame is encoded using the correlation in the time direction and the correlation in the spatial direction from the reference frame, thereby realizing higher encoding efficiency than the reference frame. Note that the encoded data of the prediction frame is recovered by the recovered reference frame and the encoded data of the prediction frame.

次に、具体的なＭＰＥＧ２画像符号化で用いられる符号化体系について、図４を用いて説明する。なお、図４では、必要に応じて、識別可能となるように各ピクチャタイプに番号を付している。図４（Ａ）中に『Ｉ』と示されている基準フレームであるＩピクチャ（Ｉフレーム）は、定期的に存在し、復号処理の基準となる情報である。一方、予測フレームには、図４（Ａ）中に『Ｐ』と示されている、時間的に前（過去）の基準フレームからの予測のみで符号化されるＰピクチャ（Ｐフレーム）と、図４（Ａ）中に『Ｂ』と示されている、時間的に前後（過去と未来）の２つの基準フレームから予測符号化されるＢピクチャ（Ｂフレーム）とが存在する。なお、図４（Ａ）中の矢印は、Ｐピクチャ及びＢピクチャに係る予測方向を示すものである。Ｐピクチャは、自身が予測フレームであるとともに、他のＰピクチャやＢピクチャの基準フレームとしても利用される。 Next, a specific encoding system used in MPEG2 image encoding will be described with reference to FIG. In FIG. 4, numbers are assigned to the picture types as necessary so that they can be identified. An I picture (I frame), which is a reference frame indicated as “I” in FIG. 4A, is periodically present and is information serving as a reference for decoding processing. On the other hand, in the prediction frame, a P picture (P frame) encoded by only prediction from a temporally previous (past) reference frame, indicated by “P” in FIG. There is a B picture (B frame) that is predictively encoded from two reference frames before and after (past and future) in time, which are indicated as “B” in FIG. Note that arrows in FIG. 4A indicate prediction directions related to the P picture and the B picture. The P picture itself is a prediction frame and is also used as a reference frame for other P pictures and B pictures.

Ｉピクチャの画像信号は、輝度信号に基づいて水平１６画素×垂直１６画素のマクロブロックと呼ばれる処理単位に分割される。分割されたマクロブロックのデータは、更に８画素×８画素単位の２次元ブロックに分割され、直交変換の一種であるＤＣＴ（Discrete Cosine Transform：離散コサイン変換）処理が行われる。 An I-picture image signal is divided into processing units called macroblocks of horizontal 16 pixels × vertical 16 pixels based on the luminance signal. The divided macroblock data is further divided into two-dimensional blocks of 8 pixels × 8 pixels and subjected to DCT (Discrete Cosine Transform) processing which is a kind of orthogonal transform.

ＤＣＴ処理後の信号は、その２次元ブロックの周波数成分に準じた値を示すため、一般的な画像では低域に成分が集中する。また、高周波数成分の情報劣化は低周波数成分の情報劣化よりも視覚的に目立ちにくい性質がある。よって、低域成分を細かく、高域成分を粗く量子化し、その係数成分と成分が無い係数０の連続する長さを可変長符号化することにより、情報量を圧縮している。 Since the signal after the DCT processing shows a value according to the frequency component of the two-dimensional block, the component is concentrated in a low band in a general image. In addition, information degradation of high frequency components has a property that is visually less noticeable than information degradation of low frequency components. Therefore, the amount of information is compressed by finely quantizing the low frequency components and coarsely quantizing the high frequency components, and variable length coding the coefficient component and the continuous length of coefficient 0 having no component.

Ｐピクチャの画像信号も、Ｉピクチャと同様に、輝度信号に基づいて水平１６画素×垂直１６画素のマクロブロックの単位に分割される。Ｐピクチャでは、マクロブロックごとに基準フレームとの間の動きベクトルが計算される。動きベクトルの検出は、一般的にブロックマッチングにより求められる。このブロックマッチングでは、マクロブロックの各画素と、動きベクトル値だけマクロブロックの存在する水平・垂直の位置を動かした場所の基準フレームを水平１６画素×垂直１６画素にブロック化した各画素との差分絶対値総和（あるいは差分２乗総和）が求められ、その最小値を取る動きベクトルの値が、検出された動きベクトルとして出力される。 Similarly to the I picture, the P picture image signal is also divided into units of macroblocks of horizontal 16 pixels × vertical 16 pixels based on the luminance signal. In the P picture, a motion vector between the reference frame and each macroblock is calculated. Motion vector detection is generally obtained by block matching. In this block matching, the difference between each pixel of the macroblock and each pixel obtained by blocking the reference frame where the horizontal / vertical position where the macroblock exists by the motion vector value is moved into 16 horizontal pixels × 16 vertical pixels. The absolute value sum (or the sum of squared differences) is obtained, and the value of the motion vector taking the minimum value is output as the detected motion vector.

マクロブロックの各画素は、動きベクトルにより切り出された２次元ブロックの各画素との差分が取られる。正確な動きベクトルが検出された場合には、差分ブロックの情報量は元のマクロブロックの持っている情報量よりも大幅に少なくなるため、Ｉピクチャよりも粗い量子化処理が可能となる。実際には、差分ブロックを符号化するか、あるいは非差分ブロック（イントラ（Intra）ブロック）を符号化するかが選択され（予測モード判定）、選択されたブロックに対してＩピクチャと同様のＤＣＴ・可変長符号化処理が施されて情報量が圧縮される。 Each pixel of the macro block is subjected to a difference from each pixel of the two-dimensional block cut out by the motion vector. When an accurate motion vector is detected, the information amount of the difference block is significantly smaller than the information amount of the original macroblock, so that coarser quantization processing than that of the I picture is possible. Actually, it is selected whether to encode a differential block or a non-differential block (intra block) (prediction mode determination), and DCT similar to an I picture is selected for the selected block. A variable length encoding process is performed to compress the amount of information.

また、Ｂピクチャに関しても、Ｐピクチャと同様の処理が行われるが、基準フレームであるＩ、Ｐピクチャが時間的に前後に存在しており、各基準フレームとの間で動きベクトルの検出が行われる。Ｂピクチャでは予測の選択肢が、前基準フレームからの予測（フォワード（Forward）予測）・後基準フレームからの予測（バックワード（Backward）予測）・２つの予測ブロックの画素ごとの平均値（アベレージ（Average）予測）の３種類存在し、イントラブロックのみで復号を行う方式を合わせた４種類の方式の中から予測モード判定が行われる。 In addition, the same processing as that for the P picture is performed for the B picture, but the I and P pictures that are the reference frames exist before and after the time frame, and motion vectors are detected between the reference frames. Is called. In the B picture, prediction options are prediction from a previous reference frame (forward prediction), prediction from a subsequent reference frame (backward prediction), and an average value (average (average (2)) of two prediction blocks. There are three types of (average) prediction), and prediction mode determination is performed from among four types of schemes including a scheme for decoding only by intra blocks.

Ｂピクチャは、時間的に前後の基準フレームから予測が可能となるため、Ｐピクチャよりも更に予測効率が向上する。したがって、一般的に、Ｂピクチャは、Ｐピクチャよりも更に粗く量子化される。なお、Ｂピクチャとして選択されたブロックは、Ｉ、Ｐピクチャと同様の符号化処理が行われる。 B pictures can be predicted from temporally preceding and following reference frames, so that prediction efficiency is further improved than P pictures. Therefore, in general, a B picture is quantized more coarsely than a P picture. The block selected as the B picture is subjected to the same encoding process as the I and P pictures.

Ｂピクチャの復号処理では、時間的に後の基準フレームからの予測処理も行われるため、この基準フレームは、Ｂピクチャに先行して符号化される必要がある。このため、符号化処理の際に、記録入力された画像信号は、図４（Ｂ）に示されるように、Ｂピクチャの基準フレームであるＩピクチャ又はＰピクチャの後にＢピクチャが配置されるように、順序の並べ替えが行われて符号化される。すなわち、符号化処理時には、復号処理時の符号化順序に鑑みて、原画像の入力オーダの順序の並べ替えが行われる。一方、復号処理では、図４（Ｃ）に示すように、図４（Ｂ）の順序に対して逆の並べ替えを行って出力することにより、入力された画像信号の順序で復号画像が再生可能となる。 In the decoding process of a B picture, a prediction process from a later reference frame is also performed, and thus this reference frame needs to be encoded before the B picture. For this reason, in the encoding process, as shown in FIG. 4B, the image signal recorded and input is arranged such that the B picture is arranged after the I picture or P picture which is the reference frame of the B picture. Then, the order is rearranged and encoded. That is, during the encoding process, the order of the input order of the original image is rearranged in view of the encoding order during the decoding process. On the other hand, in the decoding process, as shown in FIG. 4C, the decoded image is reproduced in the order of the input image signals by performing the reverse rearrangement with respect to the order of FIG. It becomes possible.

次に、ＭＰＥＧ２画像符号化を実現するための一般的な符号化装置と復号装置について説明する。まず、従来の技術において一般的な符号化装置について説明する。図５は、従来の技術に係る一般的な符号化装置の一例を示すブロック図である。図５において、入力端子２０１から入力されたデジタル画像信号（入力画像信号）は、入力画像メモリ２０２に供給されて記憶され、符号化シンタックスに従って符号化される順番に並べ替えを行うために遅延される。そして、入力画像メモリ２０２から出力されたデジタル画像信号は、２次元ブロック変換回路２０３において、マクロブロックの切り出し処理が行われる。 Next, a general encoding device and decoding device for realizing MPEG2 image encoding will be described. First, a general encoding apparatus in the prior art will be described. FIG. 5 is a block diagram showing an example of a general encoding apparatus according to the conventional technique. In FIG. 5, the digital image signal (input image signal) input from the input terminal 201 is supplied to and stored in the input image memory 202 and is delayed in order to be rearranged in the encoding order according to the encoding syntax. Is done. The digital image signal output from the input image memory 202 is subjected to a macroblock cutout process in the two-dimensional block conversion circuit 203.

基準フレームに関するマクロブロックデータは、減算器２０４を介して直交変換回路２０５に供給され、ここで、水平８画素×垂直８画素単位でＤＣＴ処理が行われて、ＤＣＴ係数が算出される。ＤＣＴ係数は、さらに輝度信号に基づいて水平１６画素×垂直１６画素のマクロブロック単位にまとめられて、量子化回路２０６に送られる。量子化回路２０６においては、例えば、周波数成分ごとに異なる値を持つ量子化マトリクスによって、ＤＣＴ係数ごとに異なる値で除算することにより、量子化処理が行われる。量子化処理されたＤＣＴ係数は符号化回路２１４に送られ、符号化回路２１４において、符号化テーブル２１５の係数に対応したアドレスを参照することにより、可変長又は固定長の符号化が行われる。そして、マルチプレクサ２１６において、上記の符号化回路２１４における処理後の符号化データと、２次元ブロック変換回路２０３からの画面内でのマクロブロックの場所などを示す付加情報とが多重化され、画像ストリームバッファ２１８にいったん格納された後、ビットストリーム（出力画像ビットストリーム）として出力端子２１９から出力される。 Macroblock data relating to the reference frame is supplied to the orthogonal transform circuit 205 via the subtractor 204, where DCT processing is performed in units of horizontal 8 pixels × vertical 8 pixels to calculate DCT coefficients. The DCT coefficients are further collected in units of macroblocks of horizontal 16 pixels × vertical 16 pixels based on the luminance signal, and sent to the quantization circuit 206. In the quantization circuit 206, for example, the quantization process is performed by dividing by a different value for each DCT coefficient by a quantization matrix having a different value for each frequency component. The quantized DCT coefficient is sent to the encoding circuit 214, and variable length or fixed length encoding is performed by referring to an address corresponding to the coefficient of the encoding table 215. Then, the multiplexer 216 multiplexes the encoded data after the processing in the encoding circuit 214 and the additional information indicating the location of the macroblock in the screen from the two-dimensional block conversion circuit 203, and the image stream After being stored in the buffer 218 once, it is output from the output terminal 219 as a bit stream (output image bit stream).

また、量子化回路２０６において量子化されたＤＣＴ係数は、逆量子化回路２１２及び逆直交変換回路２１３において逆量子化処理及び逆ＤＣＴ処理が行われて、量子化されたＤＣＴ係数が復号され、加算器２１０及びデブロック回路２１１を介して参照画像メモリ２０９に供給されて格納される。この参照画像メモリ２０９に格納された画像は、予測フレームの符号化処理時に利用される。 Also, the DCT coefficients quantized by the quantization circuit 206 are subjected to inverse quantization processing and inverse DCT processing by the inverse quantization circuit 212 and the inverse orthogonal transform circuit 213, and the quantized DCT coefficients are decoded. The data is supplied to and stored in the reference image memory 209 via the adder 210 and the deblocking circuit 211. The image stored in the reference image memory 209 is used at the time of predictive frame encoding processing.

一方、予測フレームに関しては、入力画像メモリ２０２から切り出されたマクロブロックデータと参照画像メモリ２０９に格納されている画像との間で、動きベクトル検出回路２０７によって画像間における動きベクトルが求められる。動きベクトル検出回路２０７において求められた動きベクトルは、動き補償予測回路２０８に供給され、ここで、参照画像メモリ２０９からの参照画像から予測ブロックの切り出し処理が行われる。動き補償予測回路２０８では、切り出された複数の予測ブロックに従って、最適な予測モードの選択が行われ、符号化すべき入力画像ブロックとの差分信号が、直交変換回路２０５に送出される。この差分信号に関しては、上述の基準フレームの各ブロックと同様の処理が行われ、ＤＣＴ係数が量子化処理されて、動きベクトルや予測モードと共に出力画像ビットストリームとしてマルチプレクサ２１６から、画像ストリームバッファ２１８を経て、出力端子２１９より出力される。 On the other hand, for the predicted frame, a motion vector between images is obtained by the motion vector detection circuit 207 between the macroblock data cut out from the input image memory 202 and the image stored in the reference image memory 209. The motion vector obtained in the motion vector detection circuit 207 is supplied to the motion compensation prediction circuit 208, where a prediction block is cut out from the reference image from the reference image memory 209. The motion compensated prediction circuit 208 selects an optimal prediction mode according to the plurality of extracted prediction blocks, and sends a difference signal from the input image block to be encoded to the orthogonal transformation circuit 205. The difference signal is processed in the same manner as each block of the reference frame described above, the DCT coefficient is quantized, and the image stream buffer 218 is output from the multiplexer 216 as an output image bit stream together with the motion vector and the prediction mode. Then, it is output from the output terminal 219.

なお、符号量の制御に関しては、符号量制御回路２１７において、マルチプレクサ２１６から出力されたビットストリームの符号量と、目標とする符号量（目標符号量）との比較が行われ、目標符号量に近づけるために量子化回路２０６の量子化の細かさ（量子化スケール）の制御が行われる。そして、上述した３種類の情報量の異なるピクチャタイプ（フレームタイプ）に対し、設定された符号化ビットレートに対する各ピクチャタイプの性質及び出現頻度を用いて、各フレームに対する目標符号量が算出される。 Regarding the control of the code amount, the code amount control circuit 217 compares the code amount of the bit stream output from the multiplexer 216 with the target code amount (target code amount) to obtain the target code amount. In order to make it closer, the quantization fineness (quantization scale) of the quantization circuit 206 is controlled. Then, for the above-described three types of picture types (frame types) having different information amounts, the target code amount for each frame is calculated using the nature and appearance frequency of each picture type for the set encoding bit rate. .

また、目標符号量は、仮想的に復号装置シミュレートされたストリームバッファ（ＶＢＶ（Video Buffer Verifier）バッファと呼ばれる）に対して、バッファのオーバーフロー・アンダーフローが起きないように制限される。また、量子化スケールは、スケールと出力符号量とが一般的にほぼ反比例の関係があることを利用して、フレームタイプごとに目標符号量に対する量子化スケール値が計算されて、量子化処理が行われる。そして、ブロックごとに目標符号量に近づく方向に量子化スケールを変動させることによって、目標符号量内に符号化ストリームを抑えるように制御される。 Also, the target code amount is limited so that a buffer overflow / underflow does not occur in a stream buffer (called a VBV (Video Buffer Verifier) buffer) virtually simulated by a decoding device. In addition, the quantization scale is calculated by calculating the quantization scale value for the target code amount for each frame type by utilizing the fact that the scale and the output code amount are generally inversely proportional. Done. Then, by controlling the quantization scale in a direction approaching the target code amount for each block, control is performed to suppress the encoded stream within the target code amount.

次に、従来の技術において一般的な復号装置について説明する。図６は、従来の技術に係る一般的な復号装置の一例を示すブロック図である。図６において、まず、入力端子１０１から入力された画像ビットストリーム（画像ストリーム）が、画像ストリームバッファ１０２に蓄えられる。なお、画像ビットストリームには仮想的にシミュレートされたバッファ値が書かれており、そのバッファ値分だけ、画像ビットストリームが画像ストリームバッファ１０２に蓄えられてから下記の復号処理が行われるようにすることによって、バッファが破綻して復号処理が止まることを防ぐことが可能となる。画像ストリームバッファ１０２から出力された画像ビットストリームは、可変長復号回路１０３において、量子化スケール、予測モード、動きベクトルなどの付加情報が分離されるとともに、量子化されたＤＣＴ係数の復号が行われる。 Next, a general decoding device in the prior art will be described. FIG. 6 is a block diagram illustrating an example of a general decoding device according to the related art. In FIG. 6, first, an image bit stream (image stream) input from the input terminal 101 is stored in the image stream buffer 102. Note that a virtually simulated buffer value is written in the image bit stream, and the following decoding process is performed after the image bit stream is stored in the image stream buffer 102 by the buffer value. By doing so, it is possible to prevent the decoding process from stopping due to the failure of the buffer. In the image bit stream output from the image stream buffer 102, the variable length decoding circuit 103 separates additional information such as a quantization scale, a prediction mode, and a motion vector, and decodes the quantized DCT coefficient. .

復号されたＤＣＴ係数に関しては、符号化回路（図５に示す符号化装置）内の逆量子化回路２１２及び逆直交変換回路２１３と同様の処理が行われ、逆量子化回路１０５及び逆直交変換回路１１１において逆量子化処理及び逆ＤＣＴ処理が行われ、イントラブロック又は差分ブロックが復号されて、加算器１０７に供給される。また、予測ブロックの場合には、可変長復号回路１０３で復号された予測モードと動きベクトル値とにより、動き補償予測回路１０６において、参照画像メモリ１０９から読み出された参照画像信号（当該処理の前に、既に格納されたＩピクチャやＰピクチャの画像信号）から予測ブロックの切り出し処理が行われる。これにより、復号されたイントラブロック又は差分ブロックと、動き補償予測回路１０６において切り出された予測ブロックとの加算が加算器１０７にて行われ、マクロブロックの画像信号が復元される。 The decoded DCT coefficients are processed in the same manner as the inverse quantization circuit 212 and the inverse orthogonal transform circuit 213 in the encoding circuit (encoding apparatus shown in FIG. 5), and the inverse quantization circuit 105 and the inverse orthogonal transform are performed. In the circuit 111, inverse quantization processing and inverse DCT processing are performed, and an intra block or a difference block is decoded and supplied to the adder 107. In the case of a prediction block, a reference image signal read from the reference image memory 109 in the motion compensated prediction circuit 106 based on the prediction mode and the motion vector value decoded by the variable length decoding circuit 103 (of the process). A prediction block is cut out from a previously stored image signal of an I picture or a P picture. As a result, the decoded intra block or difference block and the prediction block cut out by the motion compensation prediction circuit 106 are added by the adder 107, and the image signal of the macro block is restored.

加算器１０７における加算処理によって復元されたマクロブロックデータ（マクロブロックの画像信号）は、デブロック回路１０８に供給されて、画像スキャン順に画像信号に戻される。このとき、Ｉ又はＰピクチャの場合には、参照画像メモリ１０９に書き込まれ、Ｂピクチャの場合には、出力フレームメモリ１１０にいったん蓄えられた後、画像信号（出力画像信号）として出力される。なお、参照画像メモリ１０９に蓄積されたＩ又はＰピクチャの画像データは、図４（Ａ）〜（Ｃ）に示すような画像出力タイミングに従って、出力フレームメモリ１１０にいったん蓄積された後、Ｂピクチャと同様に画像信号（出力画像信号）として出力される。 The macroblock data (macroblock image signal) restored by the addition processing in the adder 107 is supplied to the deblocking circuit 108 and returned to the image signal in the order of image scanning. At this time, in the case of an I or P picture, it is written in the reference image memory 109, and in the case of a B picture, it is once stored in the output frame memory 110 and then output as an image signal (output image signal). Note that the I or P picture image data stored in the reference image memory 109 is once stored in the output frame memory 110 in accordance with the image output timing as shown in FIGS. In the same manner as described above, an image signal (output image signal) is output.

また、上述のような画像符号化技術を用いて、画像情報の配信を行うシステムにおいては、いったん符号化されたビットストリームを復号した後、再度符号化処理を行う必要性が存在する。例えば、情報を取材・記録した場所から情報を配信する場所に伝送する場合、伝送路としては有線・無線の通信回線や記録媒体が考えられるが、情報を取材・記録した場所から情報を配信する場所に伝送するための伝送路の帯域幅と、情報を配信するシステムの伝送路の帯域幅とが異なるような場合には、再度符号化処理を行って、ビットストリームのビットレートを変更する必要がある。例えば、放送局が取材を行ってＶＴＲなどに記録した画像データを編集した後、放送する場合がこのような条件に当てはまる。また、放送などで送られてきた符号化ストリームを所定の記録メディアに記録したい場合にも、記録メディアの記録容量・記録レートに合わせた形で画像ストリームを再符号化することが必要となることもある。上述のような再符号化処理を行う装置は、トランスコーダ（画像ストリーム変換装置）と呼ばれる。トランスコーダにおける復号装置及び符号化装置の基本構成は、例えば、復号装置の画像出力（例えば、図６に示す出力端子１１２）と符号化装置の画像入力（例えば、図５に示す入力端子２０１）とが、直接接続された構成によって実現可能である。 Further, in a system that distributes image information using the above-described image encoding technology, there is a need to perform encoding processing again after decoding a once encoded bitstream. For example, when transmitting information from a place where information is collected / recorded to a place where information is distributed, a wired / wireless communication line or recording medium may be considered as a transmission path, but information is distributed from the place where information is collected / recorded. If the bandwidth of the transmission path for transmitting to the location is different from the bandwidth of the transmission path of the system that distributes information, it is necessary to re-encode and change the bit rate of the bit stream There is. For example, such a condition applies to a case where a broadcast station conducts interviewing and edits image data recorded on a VTR and then broadcasts. In addition, when it is desired to record an encoded stream sent by broadcasting or the like on a predetermined recording medium, it is necessary to re-encode the image stream according to the recording capacity and recording rate of the recording medium. There is also. An apparatus that performs the re-encoding process as described above is called a transcoder (image stream conversion apparatus). The basic configuration of the decoding device and the encoding device in the transcoder includes, for example, an image output of the decoding device (for example, the output terminal 112 shown in FIG. 6) and an image input of the encoding device (for example, the input terminal 201 shown in FIG. 5). Can be realized by a directly connected configuration.

このようなトランスコーダでは、再符号化時の符号化劣化を少なくするために、トランスコーダで伝送される画像信号（復号されたベースバンド信号）上に、符号化された際のピクチャタイプなどのフレーム情報やマクロブロック付加情報（以降、符号化情報と呼ぶ）を重畳させて伝送する方式が考えられている。例えば、ＭＰＥＧ２符号化においては、Ｉピクチャ、Ｐピクチャ、Ｂピクチャの順で画像信号の品質が劣化している場合が一般的である。Ｉピクチャは基準となるフレームであるため、他のフレームより細かく量子化されているとともに、他の画像からの参照がないため、参照フレーム劣化の影響を受けない。一方、ＰピクチャやＢピクチャに関しては、Ｐピクチャ、Ｂピクチャの順で、粗く量子化され、参照フレーム劣化の影響を受けやすい。 In such a transcoder, in order to reduce encoding degradation at the time of re-encoding, the picture type at the time of encoding on the image signal (decoded baseband signal) transmitted by the transcoder A scheme is considered in which frame information and macroblock additional information (hereinafter referred to as encoded information) are transmitted in a superimposed manner. For example, in MPEG2 encoding, the quality of an image signal generally deteriorates in the order of I picture, P picture, and B picture. Since the I picture is a reference frame, it is quantized more finely than other frames and is not affected by reference frame deterioration because there is no reference from other images. On the other hand, the P picture and the B picture are roughly quantized in the order of the P picture and the B picture, and are easily affected by reference frame deterioration.

トランスコーダでは、上述の符号化情報を有効に利用することによって、再符号化処理後の符号化データの品質向上や、効率的な再符号化処理を図ることが可能である。例えば、トランスコーダにおける再符号化の際に、符号化情報を参照して、トランスコーダにおける復号前の符号化データにおけるピクチャタイプと、再符号化時のピクチャタイプとを合わせることによって、上記の劣化要因を低減させることが可能である。また、マクロブロック付加情報を参照することによって、動きベクトル検出のための処理量を削減できるとともに、量子化スケール及び符号化ビット数を参照することによって、マクロブロックの持っている情報量の指針が得られ、ビットレート変換時に良好な符号量コントロールを行うことが可能となる。 In the transcoder, it is possible to improve the quality of encoded data after the re-encoding process and to efficiently perform the re-encoding process by effectively using the above-described encoded information. For example, at the time of re-encoding in the transcoder, referring to the encoding information, the picture type in the encoded data before decoding in the transcoder and the picture type at the time of re-encoding are matched, thereby Factors can be reduced. Also, by referring to the macroblock additional information, the processing amount for motion vector detection can be reduced, and by referring to the quantization scale and the number of encoded bits, a guideline for the information amount possessed by the macroblock can be obtained. As a result, good code amount control can be performed at the time of bit rate conversion.

ここで、上述のような符号化情報を伝送するトランスコーダにおける復号装置及び符号化装置の構成について説明する。図７は、従来の技術に係る一般的なストリーム変換記録装置の一例を示すブロック図である。なお、基本的には、図７に示すストリーム変換記録装置は、図５に図示されている符号化装置と、図６に図示されている復号装置とをつなげた構成であり、以下では、図７に示すストリーム変換記録装置において付加された機能ブロックについてのみ説明するとともに、画像符号化に関する部分についてのみ説明する。 Here, the configuration of the decoding apparatus and the encoding apparatus in the transcoder that transmits the encoding information as described above will be described. FIG. 7 is a block diagram showing an example of a general stream conversion recording apparatus according to the prior art. Note that the stream conversion recording apparatus shown in FIG. 7 basically has a configuration in which the encoding apparatus shown in FIG. 5 and the decoding apparatus shown in FIG. 6 are connected. Only the functional blocks added in the stream conversion recording apparatus shown in FIG. 7 will be described, and only the part related to image coding will be described.

画像ストリーム変換装置の構成要素である復号装置１００の可変長復号回路２（図６に示す可変長復号回路１０３に対応）は、画像ストリームを復号した際のフレーム情報、マクロブロック付加情報、マクロブロックの符号化ビット数を算出し、算出データを符号化情報生成回路７に供給する。符号化情報生成回路７は、フレーム情報及びマクロブロック付加情報をフレームごとにまとめてフォーマットし、符号化情報メモリ８に格納する。 The variable-length decoding circuit 2 (corresponding to the variable-length decoding circuit 103 shown in FIG. 6) of the decoding device 100, which is a component of the image stream conversion device, performs frame information, macroblock additional information, and macroblock when the image stream is decoded. Are calculated, and the calculated data is supplied to the encoded information generation circuit 7. The encoded information generation circuit 7 formats the frame information and macroblock additional information for each frame and stores them in the encoded information memory 8.

符号化情報メモリ８は、Ｉ、Ｐ、Ｂピクチャの出力並び替えに対応するように、出力フレームメモリ３（図６に示す出力フレームメモリ１１０に対応）が蓄える複数フレーム分の記憶容量を備えている。符号化情報メモリ８に格納されたフレーム情報及びマクロブロック付加情報からなる符号化情報は、符号化情報重畳回路１０に供給され、符号化情報重畳回路１０において、ピクチャタイプにより取り出す符号化情報の順番が変更され、復号装置１００内の出力フレームメモリ３から出力される画像データ（出力画像信号）と同期して出力される。すなわち、符号化情報重畳回路１０から出力されるデータは、復号装置１００内の出力フレームメモリ３から出力される画像データに付随した符号化情報である。この符号化情報重畳回路１０から出力された符号化情報は、符号化情報メモリ１１に格納される。また、このとき、画像ストリーム変換装置の構成要素である符号化装置２００において、出力フレームメモリ３から出力された画像データ（画像信号）が入力フレームメモリ１４（図５に示す入力画像メモリ２０２に対応）に格納される。 The encoded information memory 8 has a storage capacity for a plurality of frames stored in the output frame memory 3 (corresponding to the output frame memory 110 shown in FIG. 6) so as to correspond to output rearrangement of I, P, and B pictures. Yes. The encoded information composed of the frame information and the macroblock additional information stored in the encoded information memory 8 is supplied to the encoded information superimposing circuit 10, and the order of the encoded information extracted according to the picture type in the encoded information superimposing circuit 10. Are changed and output in synchronization with the image data (output image signal) output from the output frame memory 3 in the decoding apparatus 100. That is, the data output from the encoding information superimposing circuit 10 is encoding information attached to the image data output from the output frame memory 3 in the decoding device 100. The encoded information output from the encoded information superimposing circuit 10 is stored in the encoded information memory 11. At this time, in the encoding device 200 which is a component of the image stream conversion device, the image data (image signal) output from the output frame memory 3 corresponds to the input frame memory 14 (the input image memory 202 shown in FIG. 5). ).

符号化情報メモリ１１に格納された符号化情報は、符号化情報分離回路１２によって読み取られる。そして、符号化情報分離回路１２は、符号化情報からフレーム情報及びマクロブロック付加情報を分離し、符号化シンタックス制御回路１３とマクロブロック情報生成回路１５へ送る。符号化シンタックス制御回路１３では、送られてきたフレーム情報からフレームタイプを検出して、入力画像を符号化順に並べ替える制御が行われる。なお、符号化情報分離回路１２において、フレームの符号化情報を抜き取ることができなかった場合には、符号化情報分離回路１２から符号化シンタックス制御回路１３に非検出信号が送られ、符号化シンタックス制御回路１３において、通常の符号化処理と同様に符号化装置２００内で符号化シンタックスが構成されて、符号化処理が行われる。一方、マクロブロック情報生成回路１５は、符号化情報分離回路１２において符号化シンタックスに応じて順序が入れ替えられたマクロブロック付加情報を、符号化情報分離回路１２から受け取る。 The encoded information stored in the encoded information memory 11 is read by the encoded information separation circuit 12. Then, the encoded information separation circuit 12 separates the frame information and the macroblock additional information from the encoded information, and sends them to the encoding syntax control circuit 13 and the macroblock information generation circuit 15. The encoding syntax control circuit 13 performs control to detect the frame type from the received frame information and rearrange the input images in the encoding order. When the encoded information separation circuit 12 cannot extract the encoded information of the frame, a non-detection signal is sent from the encoded information separation circuit 12 to the encoding syntax control circuit 13, and the encoding information is separated. In the syntax control circuit 13, an encoding syntax is configured in the encoding device 200 in the same manner as a normal encoding process, and the encoding process is performed. On the other hand, the macroblock information generation circuit 15 receives from the encoding information separation circuit 12 the macroblock additional information whose order has been changed in accordance with the encoding syntax in the encoding information separation circuit 12.

マクロブロック情報生成回路１５は、符号化情報分離回路１２から受け取った符号化情報を、符号化装置２００内の動き補償予測回路１６（図５に示す動き補償予測回路２０８に対応）及び符号量制御回路１７（図５に示す符号量制御回路２１７に対応）に供給する。動き補償予測回路１６は、符号化情報に存在する動きベクトルと予測モードを用いて、参照画像メモリ１８（図５に示す参照画像メモリ２０９に対応）から予測ブロックを切り出して減算器１９（図５に示す減算器２０４に対応）に供給し、ここで、符号化する入力フレームメモリ１４からの入力画像ブロックとの差分信号を生成して、その差分信号を直交変換回路２０（図５に示す直交変換回路２０５に対応）に送出する。なお、符号化情報分離回路１２において、ピクチャの符号化情報あるいはマクロブロックごとの符号化情報を抜き取ることができなかった場合には、符号化情報分離回路１２からマクロブロック情報生成回路１５に非検出信号が送られ、符号化装置２００内において、通常の符号化処理と同様の動きベクトル検出及び予測モード選択処理が行われる。 The macroblock information generation circuit 15 converts the encoded information received from the encoded information separation circuit 12 into a motion compensation prediction circuit 16 (corresponding to the motion compensation prediction circuit 208 shown in FIG. 5) and code amount control in the encoding device 200. This is supplied to the circuit 17 (corresponding to the code amount control circuit 217 shown in FIG. 5). The motion compensation prediction circuit 16 cuts out a prediction block from the reference image memory 18 (corresponding to the reference image memory 209 shown in FIG. 5) using the motion vector and the prediction mode existing in the encoded information, and subtracter 19 (FIG. 5). The difference signal with the input image block from the input frame memory 14 to be encoded is generated, and the difference signal is converted into the orthogonal transformation circuit 20 (orthogonal shown in FIG. 5). Corresponding to the conversion circuit 205). If the encoded information separation circuit 12 cannot extract the encoded information of the picture or the encoded information for each macroblock, the encoded information is not detected by the macroblock information generation circuit 15 from the encoded information separation circuit 12. The signal is sent, and the motion vector detection and prediction mode selection processing similar to the normal encoding processing is performed in the encoding device 200.

また、符号量制御回路１７は、符号化時におけるマクロブロックごとの量子化スケールと、この処理に要したビット数とが符号化情報として供給され、これらの供給された情報と、現在制御ターゲットとなっている符号化ビットレートでの設定符号量との比較処理を行う。これにより、符号量制御回路１７において、適切な量子化スケールが決定されて、符号化処理が行われる。なお、上述の説明では、符号化情報メモリ８、１１を別々に設けているが、復号装置１００と符号化装置２００とが一体となっているトランスコーダなどにおいては、これらの符号化情報メモリ８、１１を１つにまとめることが可能である。 Further, the code amount control circuit 17 is supplied with the quantization scale for each macroblock at the time of encoding and the number of bits required for this processing as encoded information, and the supplied information, the current control target, Comparison processing with the set code amount at the encoding bit rate is performed. As a result, the code amount control circuit 17 determines an appropriate quantization scale and performs the encoding process. In the above description, the encoded information memories 8 and 11 are provided separately. However, in a transcoder in which the decoding apparatus 100 and the encoding apparatus 200 are integrated, these encoded information memories 8 and 11 are provided. , 11 can be combined into one.

また、上述のトランスコーダにおける処理とは異なり、符号化情報の抽出は行わず、入力画像ストリームのピクチャタイプを再符号化の際のピクチャタイプとして常に継承できるようにすることによって、ＭＰＥＧ符号化における復号回路や符号化回路の並べ替え処理の削減を図る方法も存在する。この方法は、例えば、下記の特許文献１に開示されている。
特開２０００−９２４９７号公報（段落００１９〜００２５、図１） Also, unlike the process in the transcoder described above, encoding information is not extracted, and the picture type of the input image stream is always inherited as the picture type at the time of re-encoding. There is also a method for reducing the rearrangement processing of the decoding circuit and the encoding circuit. This method is disclosed, for example, in Patent Document 1 below.
Japanese Unexamined Patent Publication No. 2000-92497 (paragraphs 0019 to 0025, FIG. 1)

従来の画像ストリーム変換装置では、再符号化を行う際は、入力される画像ストリームの符号化処理が理想の符号化処理に近いという想定に基づいて、画像ストリームから抽出される符号化情報やピクチャタイプなどを正しい情報として取り扱い、再符合化の際にこれらの情報をそのまま反映させることが一般的である。例えば、上述の特許文献１に開示されている技術もこの考え方に基づいていると言える。 In a conventional image stream conversion apparatus, when re-encoding is performed, encoding information and pictures extracted from an image stream are based on the assumption that the encoding process of an input image stream is close to an ideal encoding process. In general, the type is handled as correct information, and the information is reflected as it is when re-encoding. For example, it can be said that the technique disclosed in Patent Document 1 described above is also based on this concept.

しかしながら、例えば、入力された画像ストリームの素性が悪く、最善の符号化処理がなされていないような場合に、入力された画像ストリームに係る符号化情報を再符号化時に引き継いでしまうと、再符号化された画像ストリームにおいても、入力された画像ストリームに見られる悪い影響（品質劣化などの影響）を引き継いでしまうことになり、精度の悪い画像ストリーム変換処理が行われるという弊害が起こることがある。すなわち、従来の技術のように、オリジナルの画像ストリームを再符号化する際に、オリジナルの画像ストリームに係るパラメータをそのまま引き継いで反映させた場合、状況によっては、オリジナルの画像ストリームが有する都合の悪いパラメータもそのまま引き継がれてしまうという問題がある。 However, for example, when the input image stream has poor characteristics and the best encoding process is not performed, if the encoding information related to the input image stream is taken over at the time of re-encoding, re-encoding is performed. Even in the converted image stream, a bad influence (an influence of quality degradation, etc.) seen in the input image stream is taken over, and there is a possibility that an inaccurate image stream conversion process is performed. . That is, when re-encoding an original image stream as in the conventional technique, if parameters relating to the original image stream are taken over and reflected as they are, in some circumstances, the original image stream has inconvenience. There is a problem that parameters are also carried over as they are.

また、例えば、放送などを通じて受信した画像ストリームを圧縮して、記録メディアに記録する場合など、入力された画像ストリームに対して符号化ビットレートを大幅に変更して、符号化ビットレートに則さない再符号化処理を行う場合（特に低い符号化ビットレートに変換するような場合）には、適切かつ充分な圧縮処理を行うことができず、再符号化処理の際に、画像品質が大幅に劣化してしまうという問題がある。 In addition, for example, when compressing an image stream received through broadcasting or the like and recording it on a recording medium, the encoding bit rate is significantly changed for the input image stream to comply with the encoding bit rate. When re-encoding is not performed (especially when converting to a low encoding bit rate), appropriate and sufficient compression cannot be performed, and image quality is greatly reduced during re-encoding. There is a problem of deterioration.

上記の問題に鑑み、本発明は、１度符号化処理が行われた画像の符号化ストリームを復号するとともに、再度必要な符号量・画像サイズなどに変換して再符号化する画像ストリーム変換処理において、再符号化時に良好な符号化ストリームを生成することが可能な画像ストリーム変換装置を提供することを目的とする。 In view of the above problems, the present invention decodes an encoded stream of an image that has been encoded once, and converts it again into a necessary code amount, image size, etc., and re-encodes the image stream conversion process An object of the present invention is to provide an image stream conversion apparatus capable of generating a good encoded stream at the time of re-encoding.

上記の目的を達成するため、本発明によれば、画像ストリームの復号処理を行うとともに、前記復号処理によって復号された前記画像ストリームの再符号化処理を行うための画像ストリーム変換装置において、
前記画像ストリームに係るヘッダ情報を抽出するヘッダ情報抽出手段と、
前記ヘッダ情報抽出手段によって抽出された前記ヘッダ情報を参照し、前記画像ストリームの符号化シンタックスに基づく前記画像ストリームの再符号化処理を行った場合に、良好な符号化ストリームを生成することができるか否かを判定するとともに、前記良好な符号化ストリームを生成することができないと判定された場合には、前記符号化シンタックスを変更するか否かの判定を行うシンタックス評価手段と、
前記シンタックス評価手段による適切な前記符号化シンタックスに係る判定結果に基づいて、前記画像ストリームの前記符号化シンタックスを再構成するシンタックス再構成手段と、
前記シンタックス再構成手段によって再構成された前記符号化シンタックスに基づいて、前記画像ストリームの前記再符号化処理に利用される前記画像ストリームに係る符号化情報の変換処理を行う符号化情報変換手段とを、
有することを特徴とする画像ストリーム変換装置が提供される。 In order to achieve the above object, according to the present invention, in an image stream conversion apparatus for performing decoding processing of an image stream and performing re-encoding processing of the image stream decoded by the decoding processing,
Header information extraction means for extracting header information relating to the image stream;
When the header information extracted by the header information extraction unit is referenced and the image stream is re-encoded based on the encoding syntax of the image stream, a good encoded stream is generated. A syntax evaluation unit that determines whether or not to change the encoding syntax when it is determined that it is not possible to generate the good encoded stream;
Syntax reconstructing means for reconstructing the coding syntax of the image stream based on a determination result relating to the appropriate coding syntax by the syntax evaluation means;
Encoding information conversion for converting encoding information related to the image stream used for the re-encoding process of the image stream based on the encoding syntax reconstructed by the syntax reconstructing means Means,
There is provided an image stream conversion device characterized by comprising:

さらに、本発明によれば、上記の発明において、前記シンタックス評価手段が、前記画像ストリームにおけるピクチャごとの所要ビット数、動きベクトル値、予測モードの画面内分布状態、量子化スケールのフレーム総和のうちの少なくとも１つのパラメータを用いて、前記画像ストリームの各ピクチャのピクチャタイプに基づく前記画像ストリームの再符号化処理を行った場合に、良好な符号化ストリームを生成することができるか否かを判定するとともに、前記良好な符号化ストリームを生成することができないと判定された場合には、前記ピクチャタイプを変更するか否かの判定を行うことを特徴とする請求項１に記載の画像ストリーム変換装置が提供される。 Furthermore, according to the present invention, in the above invention, the syntax evaluation means includes a required number of bits for each picture in the image stream, a motion vector value, an intra-screen distribution state of a prediction mode, and a frame sum of a quantization scale. Whether or not a good encoded stream can be generated when re-encoding processing of the image stream based on the picture type of each picture of the image stream is performed using at least one of the parameters 2. The image stream according to claim 1, further comprising: determining whether or not to change the picture type when it is determined that the good encoded stream cannot be generated. A conversion device is provided.

本発明に係る画像ストリーム変換装置は、変換前の画像ストリームの符号化情報の再判定と、必要に応じて符号化情報の再構成を行うことにより、再符号化時に良好な符号化ストリームを生成することができるという効果を有している。 The image stream conversion apparatus according to the present invention generates a good encoded stream at the time of re-encoding by re-determining the encoding information of the image stream before conversion and reconfiguring the encoded information as necessary. It has the effect that it can be done.

以下、図面を参照しながら、本発明の実施の形態における画像ストリーム変換装置について説明する。図１は、本発明の実施の形態における画像ストリーム変換装置の一例を示すブロック図である。なお、図１に示す画像ストリーム変換装置は、上述した従来の技術に係る画像ストリーム変換装置（図７参照）と共通する構成要素を有しており、ここでは、これらの共通する構成要素の説明については省略する。 Hereinafter, an image stream conversion apparatus according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing an example of an image stream conversion apparatus according to an embodiment of the present invention. The image stream conversion apparatus shown in FIG. 1 has components common to the above-described conventional image stream conversion apparatus (see FIG. 7). Here, description of these common components is provided. Is omitted.

図１に示す画像ストリーム変換装置は、符号化処理が行われた画像の符号化ストリームをいったん復号した後に再符号化する際に、復号前の符号化ストリームを解析し、再符号化の際の所定の条件に、符号化情報に係る様々なパラメータの変更又は継承を決定することによって、再符号化時の符号化シンタックス制御処理、動きベクトルの検出処理、符号量制御処理などを適切かつ良好に行うことができるように構成されている。以下、具体的に、図１に示す画像ストリーム変換装置の構成及び動作について説明する。 The image stream conversion apparatus shown in FIG. 1 analyzes an encoded stream before decoding and re-encodes the encoded stream of the image that has been subjected to encoding processing, and re-encodes the encoded stream. By determining changes or inheritance of various parameters related to encoding information under predetermined conditions, appropriate and good encoding syntax control processing at the time of re-encoding, motion vector detection processing, code amount control processing, etc. It is configured to be able to be done. Hereinafter, the configuration and operation of the image stream conversion apparatus shown in FIG. 1 will be specifically described.

画像ストリーム変換装置の構成要素である復号装置１００の画像ストリームバッファ１（図６に示す画像ストリームバッファ１０２に対応）に入力ビットストリーム（画像ストリーム）が格納されると、復号装置１００の可変長復号回路２（図６に示す可変調復号回路１０３に対応）に出力される前に、即座にヘッダ抽出回路４に供給される。ヘッダ抽出回路４では、復号処理に先んじて（あるいは、復号処理とは独立して）、画像ストリームのヘッダ情報の抽出処理が行われる。このヘッダ情報の抽出処理では、基本的には、ＭＰＥＧ画像ストリームの各ピクチャのピクチャタイプ及び所要ビット量の抽出や計算が行われる。なお、所要ビット数は、ピクチャヘッダの間隔を計算することによって抽出可能である。 When an input bit stream (image stream) is stored in the image stream buffer 1 (corresponding to the image stream buffer 102 shown in FIG. 6) of the decoding device 100 that is a component of the image stream conversion device, the variable length decoding of the decoding device 100 is performed. Before being output to the circuit 2 (corresponding to the modulation modulation decoding circuit 103 shown in FIG. 6), it is immediately supplied to the header extraction circuit 4. In the header extraction circuit 4, the header information of the image stream is extracted prior to the decoding process (or independently of the decoding process). In this header information extraction process, basically, the picture type and the required bit amount of each picture of the MPEG image stream are extracted and calculated. The required number of bits can be extracted by calculating the interval between picture headers.

また、ヘッダ抽出回路４に、復号装置１００の可変長復号回路２と同様の復号処理機能を設けてもよく、この場合には、動きベクトル・予測モード・量子化スケールなどを算出することも可能である。なお、以下では、可変長復号回路２に復号処理機能が設けられており、上記の各パラメータの算出も可能な場合を想定して説明する。 Further, the header extraction circuit 4 may be provided with a decoding processing function similar to that of the variable length decoding circuit 2 of the decoding device 100. In this case, a motion vector, a prediction mode, a quantization scale, etc. can be calculated. It is. In the following description, it is assumed that the variable length decoding circuit 2 is provided with a decoding processing function and the above parameters can be calculated.

シンタックス評価回路５では、ヘッダ抽出回路４によって抽出されたヘッダ情報を参照することによって、符号化ストリームの符号化特性の評価が行われる。このとき、基本的に、Ｉピクチャに対しては、そのままＩピクチャで符号化を行うようにシンタックス（符号化シンタックスとも呼ばれる）の変更処理を行わず（シンタックスの継承）、一方、Ｐピクチャ／Ｂピクチャに対しては、ヘッダ情報よりシンタックスの有効性判断を行う。なお、入力される符号化ストリームやそのヘッダ情報からは、符号化ストリームにおけるピクチャごとの所要ビット数、動きベクトル値、予測モードの画面内分布状態やその頻度、量子化スケール値など、画像の符号化特性に係る様々なパラメータを取得することが可能であり、シンタックス評価回路５では、これらのパラメータのいずれか１つ又はそれらの組み合わせを利用して、符号化特性の評価を行うことが可能である。 The syntax evaluation circuit 5 evaluates the coding characteristics of the coded stream by referring to the header information extracted by the header extraction circuit 4. At this time, basically, I-pictures are not subjected to syntax change processing (also called coding syntax) so as to be encoded with I-pictures as they are (inheritance of syntax), while P For the picture / B picture, the validity of the syntax is determined from the header information. It should be noted that, from the input encoded stream and its header information, the image code such as the required number of bits for each picture in the encoded stream, the motion vector value, the intra-screen distribution state and the frequency of the prediction mode, and the quantization scale value are included. It is possible to acquire various parameters related to the encoding characteristics, and the syntax evaluation circuit 5 can evaluate the encoding characteristics using any one of these parameters or a combination thereof. It is.

シンタックス評価回路５におけるＰピクチャ／Ｂピクチャの評価アルゴリズムに関しては、例えば、以下のような方法で行うことが可能である。なお、ここでは、１つの基準ピクチャ（Ｉピクチャ又はＰピクチャ）とその間のＢピクチャとをペアにした評価単位によって評価を行う。 The P picture / B picture evaluation algorithm in the syntax evaluation circuit 5 can be performed, for example, by the following method. Here, the evaluation is performed by an evaluation unit in which one reference picture (I picture or P picture) and a B picture therebetween are paired.

ここで、図２を参照しながら、本発明に係る画像ストリーム変換装置のシンタックス評価回路５における基準ピクチャの正誤判定について説明する。図２は、本発明の実施の形態における画像ストリーム変換装置のシンタックス評価回路の基準ピクチャの正誤判定動作を示すフローチャートである。 Here, with reference to FIG. 2, the correctness determination of the reference picture in the syntax evaluation circuit 5 of the image stream conversion apparatus according to the present invention will be described. FIG. 2 is a flowchart showing the reference picture correctness determination operation of the syntax evaluation circuit of the image stream conversion apparatus according to the embodiment of the present invention.

まず、基準ピクチャがＩピクチャであるか、あるいはＰピクチャであるかの判定が行われる（ステップＳ１００１）。基準ピクチャがＩピクチャの場合には、このＩピクチャに関しては位置を変化させない。したがって、基準ピクチャ判定が正しいと判断される（ステップＳ１００３）。 First, it is determined whether the reference picture is an I picture or a P picture (step S1001). When the reference picture is an I picture, the position of the I picture is not changed. Therefore, it is determined that the reference picture determination is correct (step S1003).

一方、基準ピクチャがＰピクチャの場合には、このＰピクチャに係る予測的中精度（予測効率）の計測による判断が行われる。この場合、例えば、Ｐピクチャの要したビット数Bits(P)及び量子化スケール値AvgQ(P)を判断して、Bits(P)が閾値α以下で、かつAvgQ(P)が閾値β以下の場合（ステップＳ１００５で『はい』）には、基準ピクチャ判定が正しいと判定される。なお、閾値α及びβは、入力ストリームの符号化ビットレートによってコントロールされる変数である。また、量子化スケール値AvgQ(P)は、予測誤差の起こる可能性を示すパラメータと言える。上述のように、基準ピクチャ判定が正しいと判定された場合には、該当フレームのシンタックスは継承されて再符号化されることとなる。 On the other hand, when the reference picture is a P picture, a determination is made by measuring the predictive medium accuracy (prediction efficiency) of the P picture. In this case, for example, the number of bits Bits (P) required for the P picture and the quantization scale value AvgQ (P) are determined, and Bits (P) is less than or equal to the threshold value α and AvgQ (P) is less than or equal to the threshold value β. In the case (“Yes” in step S1005), it is determined that the reference picture determination is correct. The threshold values α and β are variables controlled by the encoding bit rate of the input stream. Further, the quantization scale value AvgQ (P) can be said to be a parameter indicating the possibility of a prediction error. As described above, when it is determined that the reference picture determination is correct, the syntax of the corresponding frame is inherited and re-encoded.

一方、上述の判定で正しいと判定されなかった場合（ステップＳ１００５で『いいえ』）には、最後に入力されたＩピクチャ（復号された際の基準となるＩピクチャ）のビット数Bits(PrevI)及び量子化スケール値AvgQ(PrevI)と、当該Ｐピクチャのビット数Bits(P)及び量子化スケール値AvgQ(P)との比較が行われる。この比較では、例えば、ビット数と量子化スケール値との積を用いることが可能である。 On the other hand, if it is not determined to be correct in the above determination (“No” in step S1005), the bit number Bits (PrevI) of the last input I picture (the reference I picture when decoded) The quantization scale value AvgQ (PrevI) is compared with the bit number Bits (P) of the P picture and the quantization scale value AvgQ (P). In this comparison, for example, the product of the number of bits and the quantization scale value can be used.

例えば、Ｐピクチャと最後に入力されたＩピクチャとの比較において、
Bits(P)*AvgQ(P)*γ ＞ Bits(PrevI)*AvgQ(PrevI)
PrevIは最後のＩピクチャを表す表記
の関係にある場合（ステップＳ１００７で『はい』）には、基準ピクチャ判定は誤りであると判定され、このＰピクチャはＩピクチャに修正される（ステップＳ１００９）。なお、上記の式に用いられているγは、Ｐピクチャの予測モード内のイントラモード数より算出される変数である。 For example, in comparing the P picture with the last input I picture,
Bits (P) * AvgQ (P) * γ> Bits (PrevI) * AvgQ (PrevI)
If PrevI is in the notation representing the last I picture (“Yes” in step S1007), the reference picture determination is determined to be erroneous, and this P picture is corrected to an I picture (step S1009). . Note that γ used in the above equation is a variable calculated from the number of intra modes in the prediction mode of the P picture.

また、上記の式の条件を満たさない場合（ステップＳ１００７で『いいえ』）には、基準ピクチャ判定が正しいと判定され、このＰピクチャのシンタックスは変更されずに保持され、Ｐピクチャのまま継承される（ステップＳ１１１１）。以上のように、基準ピクチャ判定に係るシンタックスの継承又は変更が決定される。 If the condition of the above equation is not satisfied (“No” in step S1007), it is determined that the reference picture determination is correct, and the syntax of this P picture is retained unchanged and inherited as the P picture. (Step S1111). As described above, the inheritance or change of the syntax relating to the reference picture determination is determined.

次に、図３を参照しながら、本発明に係る画像ストリーム変換装置のシンタックス評価回路５におけるＢピクチャの判定について説明する。図３は、本発明の実施の形態における画像ストリーム変換装置のシンタックス評価回路のＢピクチャの判定動作を示すフローチャートである。 Next, the B picture determination in the syntax evaluation circuit 5 of the image stream conversion apparatus according to the present invention will be described with reference to FIG. FIG. 3 is a flowchart showing the B picture determination operation of the syntax evaluation circuit of the image stream conversion apparatus according to the embodiment of the present invention.

Ｂピクチャの判定では、上述したＰピクチャの判定と同様にビット数及び量子化スケール値と共に、さらに、予測モードの出現度合いを判定材料として用いることが可能である。このＢピクチャの判定は、上述した基準ピクチャ判定の正誤判定動作の結果に基づいて行われる。すなわち、上述の基準ピクチャ判定の正誤判定動作では、基準ピクチャ判定が正しいか誤りであるかの判定結果が取得され、シンタックス評価回路５は、この判定結果に応じたＢピクチャの判定動作を行う。なお、以下の説明では、基準ピクチャがＰピクチャの場合を一例に挙げて説明するが、基準ピクチャがＩピクチャの場合も同様の処理が行われる。 In the determination of the B picture, the appearance degree of the prediction mode can be used as a determination material together with the number of bits and the quantization scale value as in the determination of the P picture. The determination of the B picture is performed based on the result of the correctness determination operation of the reference picture determination described above. That is, in the above-described correct / incorrect determination operation of the reference picture determination, a determination result of whether the reference picture determination is correct or incorrect is acquired, and the syntax evaluation circuit 5 performs a B picture determination operation according to the determination result. . In the following description, the case where the reference picture is a P picture will be described as an example, but the same processing is performed when the reference picture is an I picture.

Ｂピクチャの判定を行う際、まず、基準ピクチャ判定が正しいか誤りであるかの判定結果が取得される（ステップＳ２００１）。基準ピクチャ判定が正しいと判定されている場合（ステップＳ２００１で『はい』）には、Ｂピクチャに対して、孤立フレーム（周りに予測に値するフレームがないフレーム）であるか否かの判定が行われる。この孤立フレーム判定では、例えば、ビット数と量子化スケール値との積を用いることが可能である。 When determining a B picture, first, a determination result of whether the reference picture determination is correct or incorrect is acquired (step S2001). If it is determined that the reference picture determination is correct (“Yes” in step S2001), it is determined whether or not the B picture is an isolated frame (a frame in which there is no frame that deserves prediction). Is called. In this isolated frame determination, for example, the product of the number of bits and the quantization scale value can be used.

例えば、基準ピクチャ（ここでは、Ｐピクチャ）と、判定対象のＢピクチャとの比較において、
Bits(B)*AvgQ(B)*Δ ＞ Bits(P)*AvgQ(P)
の関係にあり、かつ予測モードのイントラモード数が閾値ε以上（すなわち、面内予測数がε以上）の場合（ステップＳ２００３で『はい』）には、このＢピクチャは孤立フレームであると判定され、このＢピクチャのピクチャタイプはＩピクチャに変更される（ステップＳ２００５）。なお、上記の式に用いられているΔは、Ｂピクチャの予測モード内のイントラモード数より算出される変数である。 For example, in a comparison between a reference picture (here, a P picture) and a determination-target B picture,
Bits (B) * AvgQ (B) * Δ> Bits (P) * AvgQ (P)
And the number of intra modes in the prediction mode is greater than or equal to the threshold ε (that is, the number of in-plane predictions is greater than or equal to ε) (“Yes” in step S2003), the B picture is determined to be an isolated frame. Then, the picture type of this B picture is changed to an I picture (step S2005). Note that Δ used in the above equation is a variable calculated from the number of intra modes in the prediction mode of the B picture.

また、上記の式の条件を満たさない場合（ステップＳ２００３で『いいえ』）には、Ｂピクチャは孤立フレームではないと判定されて、このＢピクチャのシンタックスは変更されずに保持され、Ｂピクチャのまま継承される（ステップＳ２００７）。 If the condition of the above equation is not satisfied (“No” in step S2003), it is determined that the B picture is not an isolated frame, and the syntax of this B picture is retained without being changed. (Step S2007).

一方、Ｐピクチャの基準ピクチャ判定が誤りであると判定されている場合（ステップＳ２００１で『いいえ』）には、例えば、Ｂピクチャにおいてシーンの切り替わり（シーンチェンジ）が発生したことを想定して、切り替わりポイントの探索と、その探索結果に応じたシンタックスの保持・変更が行われる（ステップＳ２００９）。 On the other hand, when it is determined that the reference picture determination of the P picture is incorrect (“NO” in step S2001), for example, assuming that a scene change (scene change) has occurred in the B picture, The search for the switching point and the holding / changing of the syntax according to the search result are performed (step S2009).

ここで、図４に図示されているような周期Ｍ＝３の場合における切り替わりポイントの探索（ステップＳ２００９の処理）の一例について説明する。周期Ｍ＝３の場合には、Ｐピクチャ間（又は、ＩピクチャとＰピクチャとの間）に２フレームのＢピクチャが存在している（Ｂ１、Ｂ２とする）。切り替わりポイントの探索では、まず、これらのＢピクチャＢ１、Ｂ２のそれぞれに関して、前後の予測モードの出現頻度の比較が行われる。 Here, an example of a search for a switching point (processing in step S2009) in the case where the period M = 3 as illustrated in FIG. 4 will be described. When the period M = 3, two frames of B pictures exist between P pictures (or between I pictures and P pictures) (referred to as B1 and B2). In the search for switching points, first, the appearance frequencies of the preceding and following prediction modes are compared for each of these B pictures B1 and B2.

このとき、ＢピクチャＢ１、Ｂ２の両方において、前方予測（フォワード予測）の出現頻度が閾値以上である場合（すなわち、ＢピクチャＢ１、Ｂ２共に、前方予測が支配的であると判断される場合）には、ＢピクチャＢ２の後にシーンチェンジがあることが予想される。この場合には、ＢピクチャＢ２をＰピクチャに変更することが望ましい。 At this time, when the appearance frequency of forward prediction (forward prediction) is greater than or equal to the threshold in both B pictures B1 and B2 (that is, when forward prediction is determined to be dominant for both B pictures B1 and B2). Is expected to have a scene change after the B picture B2. In this case, it is desirable to change B picture B2 to P picture.

また、ＢピクチャＢ１、Ｂ２の両方において、後方予測（バックワード予測）の出現頻度が閾値以上である場合（すなわち、ＢピクチャＢ１、Ｂ２共に、後方予測が支配的であると判断される場合）には、ＢピクチャＢ１の前にシーンチェンジがあることが予想される。この場合には、ＢピクチャＢ１、Ｂ２共に、そのままのシンタックスを保つことが望ましい。なお、シーンの変わり目でＧＯＰ（Group of Pictures）をクローズドＧＯＰにすることによって、ＢピクチャＢ１、Ｂ２をその前のＧＯＰに依存させないようにすることも可能であり、この場合には、クローズドＧＯＰフラグを設定する情報の出力を行う。 Also, when the appearance frequency of backward prediction (backward prediction) is greater than or equal to the threshold in both B pictures B1 and B2, (ie, when it is determined that backward prediction is dominant for both B pictures B1 and B2). Is expected to have a scene change before the B picture B1. In this case, it is desirable to maintain the same syntax for both B pictures B1 and B2. Note that it is possible to make the B pictures B1 and B2 independent of the previous GOP by making the GOP (Group of Pictures) a closed GOP at the transition of the scene. In this case, the closed GOP flag Outputs information to set.

また、ＢピクチャＢ１において前方予測の出現頻度が閾値以上であり、かつＢピクチャＢ２において後方予測の出現頻度が閾値以上の場合（すなわち、ＢピクチャＢ１では、前方予測が支配的であると判断され、ＢピクチャＢ２では、後方予測が支配的であると判断される場合）には、ＢピクチャＢ１とＢピクチャＢ２との間にシーンチェンジがあることが予想される。この場合には、ＢピクチャＢ１をＰピクチャに変更することが望ましい。なお、この場合も、シーンの変わり目をクローズドＧＯＰに設定することが可能である。また、上述以外の各条件に関しては、基本的に、シンタックスをそのまま保持することが望ましい。 Further, when the appearance frequency of the forward prediction is equal to or higher than the threshold value in the B picture B1, and the appearance frequency of the backward prediction is equal to or higher than the threshold value in the B picture B2 (that is, the forward prediction is determined to be dominant in the B picture B1). In the case of B picture B2, when it is determined that backward prediction is dominant), it is expected that there is a scene change between B picture B1 and B picture B2. In this case, it is desirable to change the B picture B1 to a P picture. In this case as well, it is possible to set a scene change point to a closed GOP. For each condition other than the above, it is basically desirable to keep the syntax as it is.

以上のように、シンタックス評価回路５によって、シンタックスの保持又は変更の判定処理が行われ、この判定結果情報は、シンタックス評価回路５からシンタックス再構成回路６に供給される。シンタックス再構成回路６では、この判定結果情報に従って、シンタックスの更新が行われ、符号化情報変換回路９に供給される。 As described above, the syntax evaluation circuit 5 performs syntax holding or change determination processing, and the determination result information is supplied from the syntax evaluation circuit 5 to the syntax reconfiguration circuit 6. In the syntax reconfiguration circuit 6, the syntax is updated according to the determination result information, and is supplied to the encoded information conversion circuit 9.

符号化情報変換回路９では、符号化情報メモリ８から供給されるピクチャタイプと、シンタックス再構成回路６から供給される更新されたシンタックスで生成されたピクチャタイプとの比較が行われ、これらのピクチャタイプが異なる場合には、動きベクトルと予測モードに関しての変換処理を行う。例えば、ピクチャタイプの比較の結果、ＢピクチャがＰピクチャになったと判断された場合には、動きベクトルに関しては、予測モードがフォワード予測（前参照画像からの予測）の場合にはそのまま使用し、バックワード予測又はアベレージ予測の場合には、イントラモードに変換される。 The encoded information conversion circuit 9 compares the picture type supplied from the encoded information memory 8 with the picture type generated with the updated syntax supplied from the syntax reconstruction circuit 6. If the picture types are different, conversion processing is performed on the motion vector and the prediction mode. For example, if it is determined that the B picture is a P picture as a result of the comparison of the picture types, the motion vector is used as it is when the prediction mode is forward prediction (prediction from the previous reference image), In the case of backward prediction or average prediction, it is converted to intra mode.

また、直後のＢピクチャ（例えば、ＢピクチャＢ２）がＰピクチャに変化したＢピクチャ（例えば、ＢピクチャＢ１）に関しては、このＢピクチャＢ１の後方参照画像が、入力画像ストリームの時点でＢピクチャＢ２であった画像に変化することになる。この場合には、ＢピクチャＢ１が後方予測されていた場合には、動きベクトルが予測フレーム間の距離に比例して縮小される。なお、上述のように、基本的には、ＢピクチャＢ１、Ｂ２間にシーンチェンジなどが存在し、ＢピクチャＢ１に関しては前方予測が支配的であると判断されている場合（後方予測がほとんどない場合）に、ＢピクチャＢ１の直後のＢピクチャＢ２がＰピクチャに変更されるため、動きベクトルの縮小による弊害はほとんど発生しないことが予想される。 Also, for a B picture (for example, B picture B1) in which a B picture (for example, B picture B2) immediately after is changed to a P picture, the back reference image of this B picture B1 is B picture B2 at the time of the input image stream. It will change to the image that was. In this case, when the B picture B1 is predicted backward, the motion vector is reduced in proportion to the distance between the prediction frames. As described above, basically, there is a scene change between the B pictures B1 and B2, and it is determined that the forward prediction is dominant for the B picture B1 (there is almost no backward prediction). In this case, since the B picture B2 immediately after the B picture B1 is changed to the P picture, it is expected that the adverse effect due to the reduction of the motion vector hardly occurs.

このように、符号化情報変換回路９では、動きベクトルと予測モードに関しての変換処理が行われるとともに、ピクチャタイプの書き替えなどを始めとする符号化情報の更新も行われる。そして、符号化情報変換回路９において更新された符号化情報は、符号化情報重畳回路１０に供給され、その後は、この更新された符号化情報に基づいて、従来の技術と同様の符号化処理が行われることとなる。 As described above, the encoding information conversion circuit 9 performs conversion processing on the motion vector and the prediction mode, and also updates the encoding information such as rewriting of the picture type. Then, the encoded information updated in the encoded information conversion circuit 9 is supplied to the encoded information superimposing circuit 10, and thereafter, based on the updated encoded information, an encoding process similar to the conventional technique is performed. Will be performed.

なお、上述の実施の形態では、シンタックス評価回路５に供給される判定材料として、入力画像ストリームの符号化情報のみを利用しているが、例えば、所定の再生時刻で符号化処理を終了したり、特定のフレームにＩピクチャを挿入したりする場合や、特定のフレームでＧＯＰを切り替えたい場合などにおいて、これらの位置を指定するためのフレーム位置指定情報を外部から入力することによって、上記の判定材料と同様に、このフレーム位置指定情報を利用したシンタックス切り替え処理が可能となる。この場合には、入力画像ストリームの符号化情報の有効部分を判断するとともに、補正可能な部分に関しての情報補正を行うことができるため、単に符号化装置２００でシンタックスの切り替えを行う場合に比べて、符号化効率が向上する。 In the above-described embodiment, only the encoding information of the input image stream is used as the determination material supplied to the syntax evaluation circuit 5, but for example, the encoding process is terminated at a predetermined reproduction time. When the I-picture is inserted into a specific frame, or when it is desired to switch the GOP at a specific frame, the frame position specifying information for specifying these positions is input from the outside. Similar to the determination material, syntax switching processing using this frame position designation information can be performed. In this case, since it is possible to determine the effective portion of the encoding information of the input image stream and to perform information correction regarding the correctable portion, it is compared with a case where the encoding device 200 simply switches the syntax. Thus, the coding efficiency is improved.

なお、本発明は、例えば、入力画像シーケンスの符号化・復号処理（変換処理）と同時にシンタックス評価処理を行うことによって実現することも可能である。また、本発明は、例えば、再符号化を行う入力画像ストリームが記録媒体に記録されている状態の場合には、あらかじめシーケンス全体に対してシンタックス評価処理を行って、その結果情報（シーケンス全体に係るシンタックスの保持又は変更に関する情報）を記録媒体やバッファなどに蓄えておき、その後、入力画像シーケンスの変換処理を行う際に、この結果情報を参照することによって実現することも可能である。 Note that the present invention can also be realized, for example, by performing syntax evaluation processing simultaneously with encoding / decoding processing (conversion processing) of an input image sequence. Also, the present invention, for example, in a state where an input image stream to be re-encoded is recorded on a recording medium, a syntax evaluation process is performed on the entire sequence in advance, and the result information (entire sequence) This information can be realized by referring to the result information when the input image sequence is converted, after the information on the retention or change of the syntax is stored in a recording medium or a buffer. .

また、上述の実施の形態では、本発明に係る画像ストリーム変換装置の構成要素の一例として、回路や模式的なブロックなどのハードウェア要素を図示しながら説明しているが、従来の画像ストリーム変換装置と同様に、コンピュータが実行可能なソフトウェア（プログラム）によって、これらのハードウェア要素を実現することも可能である。 In the above-described embodiment, hardware elements such as a circuit and a schematic block are illustrated as examples of components of the image stream conversion apparatus according to the present invention. Similar to the apparatus, these hardware elements can be realized by software (program) executable by a computer.

本発明に係る画像ストリーム変換装置は、再符号化時に良好な符号化ストリームを生成することができるという効果を有しており、画像符号化されたビットストリームの再変換（トランスコード）を行うための技術分野に適用可能である。 The image stream conversion apparatus according to the present invention has an effect that a good encoded stream can be generated at the time of re-encoding, and performs re-conversion (transcoding) of an image-encoded bit stream. It is applicable to the technical field of

本発明の実施の形態における画像ストリーム変換装置の一例を示すブロック図である。It is a block diagram which shows an example of the image stream conversion apparatus in embodiment of this invention. 本発明の実施の形態における画像ストリーム変換装置のシンタックス評価回路の基準ピクチャの正誤判定動作を示すフローチャートである。It is a flowchart which shows the correctness determination operation | movement of the reference | standard picture of the syntax evaluation circuit of the image stream converter in embodiment of this invention. 本発明の実施の形態における画像ストリーム変換装置のシンタックス評価回路のＢピクチャの判定動作を示すフローチャートである。It is a flowchart which shows the determination operation | movement of the B picture of the syntax evaluation circuit of the image stream converter in embodiment of this invention. 従来の技術に係るＭＰＥＧ２画像符号化における処理及び出力時の画像の並びを模式的に示す図であり、（Ａ）は、ＭＰＥＧ２画像符号化で用いられている符号化体系を示す図、（Ｂ）は、ＭＰＥＧ２画像符号化時の符号化順序の並べ替えを示す図、（Ｃ）は、ＭＰＥＧ２画像復号時のストリーム到達順序及び復号画像出力順序を示す図である。It is a figure which shows typically the arrangement | sequence of the image at the time of the process in MPEG2 image coding based on the prior art, and an output, (A) is a figure which shows the encoding system used by MPEG2 image coding, ) Is a diagram showing rearrangement of the encoding order at the time of MPEG2 image encoding, and (C) is a diagram showing the stream arrival order and the decoded image output order at the time of MPEG2 image decoding. 従来の技術に係る一般的な符号化装置の一例を示すブロック図である。It is a block diagram which shows an example of the general encoding apparatus which concerns on a prior art. 従来の技術に係る一般的な復号装置の一例を示すブロック図である。It is a block diagram which shows an example of the general decoding apparatus concerning a prior art. 従来の技術に係る一般的なストリーム変換記録装置の一例を示すブロック図である。It is a block diagram which shows an example of the general stream conversion recording device based on a prior art.

Explanation of symbols

１、１０２、２１８画像ストリームバッファ
２、１０３可変長復号回路
３、１１０出力フレームメモリ
４ヘッダ抽出回路
５シンタックス評価回路
６シンタックス再構成回路
７符号化情報生成回路
８、１１符号化情報メモリ
９符号化情報変換回路
１０符号化情報重畳回路
１２符号化情報分離回路
１３符号化シンタックス制御回路
１４入力フレームメモリ
１５マクロブロック情報生成回路
１６、１０６、２０８動き補償予測回路
１７、２１７、４０３符号量制御回路
１８、１０９、２０９参照画像メモリ
１９、２０４減算器
２０、２０５直交変換回路
１００復号装置
１０１、２０１入力端子
１０４、２１５符号化テーブル
１０５、２１２逆量子化回路
１０７、２１０加算器
１０８、２１１デブロック回路
１１１、２１３逆直交変換回路
１１２、２１９出力端子
２００符号化装置
２０２、４０４入力画像メモリ
２０３２次元ブロック変換回路
２０６量子化回路
２０７動きベクトル検出回路
２１４符号化回路
２１６マルチプレクサ

DESCRIPTION OF SYMBOLS 1,102,218 Image stream buffer 2,103 Variable length decoding circuit 3,110 Output frame memory 4 Header extraction circuit 5 Syntax evaluation circuit 6 Syntax reconstruction circuit 7 Encoding information generation circuit 8,11 Encoding information memory 9 Encoding information conversion circuit 10 Encoding information superposition circuit 12 Encoding information separation circuit 13 Encoding syntax control circuit 14 Input frame memory 15 Macroblock information generation circuit 16, 106, 208 Motion compensation prediction circuit 17, 217, 403 Code amount Control circuit 18, 109, 209 Reference image memory 19, 204 Subtractor 20, 205 Orthogonal transformation circuit 100 Decoding device 101, 201 Input terminal 104, 215 Coding table 105, 212 Inverse quantization circuit 107, 210 Adder 108, 211 Deblock circuit 11 , 213 inverse orthogonal transform circuit 112,219 output terminal 200 encoder 202,404 input image memory 203 2-dimensional block transform circuit 206 quantization circuit 207 motion vector detection circuit 214 encoding circuit 216 multiplexer

Claims

In the image stream conversion apparatus for performing the decoding process of the image stream and performing the re-encoding process of the image stream decoded by the decoding process,
Header information extraction means for extracting header information relating to the image stream;
When the header information extracted by the header information extraction unit is referenced and the image stream is re-encoded based on the encoding syntax of the image stream, a good encoded stream is generated. A syntax evaluation unit that determines whether or not to change the encoding syntax when it is determined that it is not possible to generate the good encoded stream;
Syntax reconstructing means for reconstructing the coding syntax of the image stream based on a determination result relating to the appropriate coding syntax by the syntax evaluation means;
Encoding information conversion for converting encoding information related to the image stream used for the re-encoding process of the image stream based on the encoding syntax reconstructed by the syntax reconstructing means Means,
An image stream converter characterized by comprising:

The syntax evaluation means uses the image stream using at least one parameter of a required number of bits for each picture in the image stream, a motion vector value, an intra-screen distribution state of a prediction mode, and a frame sum of quantization scales. When the re-encoding process of the image stream based on the picture type of each picture is performed, it is determined whether or not a good encoded stream can be generated and the good encoded stream is generated 2. The image stream conversion apparatus according to claim 1, wherein if it is determined that the picture type cannot be changed, it is determined whether or not to change the picture type.