JPH1066084A

JPH1066084A - Video data compressing device and its method

Info

Publication number: JPH1066084A
Application number: JP21648096A
Authority: JP
Inventors: Kanji Mihara; 寛司三原
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1996-08-16
Filing date: 1996-08-16
Publication date: 1998-03-06

Abstract

PROBLEM TO BE SOLVED: To make it possible to compress video data continuously including plural scenes and to improve the quality of a decoded image by controlling the compression ratio of non-compressed video data delayed for a prescribed time based on the difficulty of the non-compressed video data calculated from the difficulty of data obtained by preparatively executing the compression encoding of video data. SOLUTION: A host computer 20 receives the data quantity of compressed video data generated by preparatively executing the compression encoding of non-compressed video data by an encoder 162 in a simple two-pass processing part and the value of a DC component and the power value of an AC component of video data obtained after discrete cosine transformation processing through a control signal C16 and calculates the diffiiculty of a compressed video data pattern based on these received values. Then the host compuer 20 allocates the objective data of compressed video data generated by an encoder 18 to each picture through a control signal C18 based on the calculated difficulty, sets up the allocated result in a quantization circuit built in the encoder 18 and adaptively controls the compression ratio of the encoder 18 in each picture.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、非圧縮映像データ
を圧縮符号化する映像データ圧縮装置およびその方法に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a video data compression apparatus for compressing and encoding non-compressed video data and a method thereof.

【０００２】[0002]

【従来の技術および発明が解決しようとする課題】非圧
縮のディジタル映像データをＭＰＥＧ(moving picture
experts group)等の方法により、Ｉピクチャー(intra c
oded picture) 、Ｂピクチャー(bi-directionaly coded
picture) およびＰピクチャー(predictive coded pict
ure)から構成されるＧＯＰ(group of pictures) 単位に
圧縮符号化して光磁気ディスク（ＭＯディスク；magnet
o-oprical disc）等の記録媒体に記録する際には、圧縮
符号化後の圧縮映像データのデータ量（ビット量）を、
伸長復号後の映像の品質を高く保ちつつ記録媒体の記録
容量以下、あるいは、通信回線の伝送容量以下にする必
要がある。2. Description of the Related Art Uncompressed digital video data is stored in a moving picture (MPEG) format.
I-picture (intra c
oded picture), B picture (bi-directionaly coded
picture) and P-picture (predictive coded pict
ure), compression encoded in GOP (group of pictures) units, and a magneto-optical disk (MO disk;
When recording on a recording medium such as an o-oprical disc, the data amount (bit amount) of the compressed video data after compression encoding is
It is necessary to keep the quality of the video after decompression decoding high, while keeping it below the recording capacity of the recording medium or below the transmission capacity of the communication line.

【０００３】このために、まず、非圧縮映像データを予
備的に圧縮符号化して圧縮符号化後のデータ量を見積も
り（１パス目）、次に、見積もったデータ量に基づいて
圧縮率を調節し、圧縮符号化後のデータ量が記録媒体の
記録容量以下になるように圧縮符号化する（２パス目）
方法が採られる（以下、このような圧縮符号化方法を
「２パスエンコード」とも記す）。For this purpose, first, non-compressed video data is preliminarily compression-encoded and the data amount after compression-encoding is estimated (first pass). Next, the compression rate is adjusted based on the estimated data amount. Then, compression encoding is performed so that the data amount after the compression encoding becomes equal to or less than the recording capacity of the recording medium (second pass).
(Hereinafter, such a compression encoding method is also referred to as “two-pass encoding”).

【０００４】しかしながら、２パスエンコードにより圧
縮符号化を行うと、同じ非圧縮映像データに対して同様
な圧縮符号化処理を２回施す必要があり、時間がかかっ
てしまう。また、１回の圧縮符号化処理で最終的な圧縮
映像データを生成することができないために、撮影した
映像データをそのまま実時間的（リアルタイム）に圧縮
符号化し、記録することができない。However, if compression encoding is performed by two-pass encoding, it is necessary to perform the same compression encoding process twice on the same non-compressed video data, which takes time. In addition, since the final compressed video data cannot be generated by one compression encoding process, the captured video data cannot be directly compression-encoded and recorded in real time (real time).

【０００５】また、編集処理により、時間方向に相関し
ない複数の非圧縮映像データ（以下、シーンとも記す）
を連続的に接続して１つの非圧縮映像データ（編集映像
データ）とし、この編集映像データを、例えば、ピクチ
ャータイプシーケンスＩ，Ｂ，Ｐ，Ｂ，Ｐ，Ｂ，Ｐ，
Ｂ，Ｐ，Ｂ，Ｐ，Ｂで圧縮符号化すると、圧縮符号化後
の最初のピクチャーがＰピクチャーになることがある。
この最初のＰピクチャーを伸長復号するためには、他の
シーンから生成された圧縮映像データの直前のピクチャ
ーを参照する必要がある。しかしながら、最初のＰピク
チャーの伸長復号に、相関がない他のシーンから生成さ
れたピクチャーを用いると、動き予測誤差が著しく増大
するため膨大なデータ量が必要となり、限られたデータ
量しか使用できない場合には、伸長復号後の映像が劣化
してしまう。[0005] In addition, a plurality of uncompressed video data (hereinafter, also referred to as scenes) that are not correlated in the time direction due to editing processing.
Are continuously connected to form one uncompressed video data (edited video data), and this edited video data is, for example, a picture type sequence
When compression encoding is performed using B, P, B, P, and B, the first picture after compression encoding may be a P picture.
In order to decompress and decode the first P picture, it is necessary to refer to the picture immediately before the compressed video data generated from another scene. However, when a picture generated from another scene having no correlation is used for the expansion decoding of the first P picture, a huge amount of data is required because a motion prediction error is significantly increased, and only a limited data amount can be used. In such a case, the video after decompression decoding is deteriorated.

【０００６】かかる不具合を解消するために、例えば、
特開平７−１９３８１８号公報に画像処理方法および画
像処理装置が開示されている。特開平７−１９３８１８
号公報に開示された画像処理方法および画像処理装置
は、例えば２つのシーン（第１のシーンと第２のシー
ン）を含む非圧縮の編集映像データを、例えば、上記ピ
クチャータイプシーケンスＩ，Ｂ，Ｐ，Ｂ，Ｐ，Ｂ，
Ｐ，Ｂ，Ｐ，Ｂ，Ｐ，Ｂで圧縮符号化する際に、第２の
シーンを圧縮符号化した第２の圧縮映像データ（下に示
すピクチャータイプシーケンスにおけるＩ₂，Ｂ₂，Ｐ
₂）の先頭のＰピクチャーを、第１のシーンを圧縮符号
化した第１の圧縮映像データ（下に示すピクチャータイ
プシーケンスにおけるＩ₁，Ｂ₁，Ｐ₁）の最後のピク
チャーを参照しないＩピクチャーに変更し、さらに、発
生するデータ量の増大を抑えるために、第１の圧縮映像
データの最後のＩピクチャーをＰピクチャーに変更して
圧縮符号化を行う。In order to solve such a problem, for example,
JP-A-7-193818 discloses an image processing method and an image processing apparatus. JP-A-7-193818
The image processing method and the image processing apparatus disclosed in Japanese Patent Application Laid-Open No. H10-15095 convert uncompressed edited video data including, for example, two scenes (a first scene and a second scene) into, for example, the picture type sequences I, B, P, B, P, B,
When compression encoding is performed using P, B, P, B, P, and B, second compressed video data obtained by compressing and encoding the second scene (I ₂ , B ₂ , P in the picture type sequence shown below)
₂ ) the first P picture is an I picture which does not refer to the last picture of the first compressed video data (I ₁ , B ₁ , P ₁ in the picture type sequence shown below) obtained by compression encoding the first scene. In order to suppress an increase in the amount of generated data, compression encoding is performed by changing the last I picture of the first compressed video data to a P picture.

【０００７】つまり具体的には、特開平７−１９３８１
８号公報に開示された画像処理方法および画像処理装置
は、上記ピクチャータイプシーケンスを変更せずに圧縮
符号化して、第１の圧縮映像データおよび第２の圧縮映
像データが、ピクチャータイプシーケンスＢ₁，Ｉ₁，
Ｂ₁，Ｐ₁，Ｂ₁，Ｐ₁，Ｂ₁，Ｐ₂，Ｂ₂，Ｐ₂，Ｂ
₂，Ｐ₂，Ｂ₂で得られる場合に、第１の圧縮映像デー
タの最後のＩピクチャーをＰピクチャーに変更し、さら
に、第２の圧縮映像データの最初のＰピクチャーをＩピ
クチャーに変更して圧縮符号化し、ピクチャータイプシ
ーケンスＢ₁，Ｐ₁，Ｂ₁，Ｐ₁，Ｂ₁，Ｐ₁，Ｂ₁，
Ｉ₂，Ｂ₂，Ｐ₂，Ｂ₂，Ｐ₂，Ｂ₂の第１の圧縮映像
データおよび第２の圧縮映像データを得るように構成さ
れている。That is, specifically, Japanese Patent Laid-Open No. 7-19381
The image processing method and image processing apparatus disclosed in 8 JP compresses encoded without changing the picture type sequence, the first compressed image data and second compressed image data, picture type sequence B ₁ , I ₁ ,
B ₁ , P ₁ , B ₁ , P ₁ , B ₁ , P ₂ , B ₂ , P ₂ , B
₂ , P ₂ , B ₂ , the last I picture of the first compressed video data is changed to a P picture, and the first P picture of the second compressed video data is changed to an I picture. To compress and encode the picture type sequences B ₁ , P ₁ , B ₁ , P ₁ , B ₁ , P ₁ , B ₁ ,
The first and second compressed video data of I ₂ , B ₂ , P ₂ , B ₂ , P ₂ , and B ₂ are obtained.

【０００８】本発明は上述した従来技術を改良してなさ
れたものであり、２パスエンコードによらずに、複数の
シーンを連続的に含む映像データを所定のデータ量以下
に圧縮符号化して圧縮映像データを生成することがで
き、しかも、連続的な複数のシーンの時間方向における
境界（シーンチェンジ）部分を圧縮符号化した圧縮映像
データを伸長復号して得られる映像の品質を保持するこ
とができる映像データ圧縮装置およびその方法を提供す
ることを目的とする。The present invention has been made by improving the above-mentioned prior art, and compresses and encodes video data continuously including a plurality of scenes to a predetermined data amount or less without performing two-pass encoding. Video data can be generated, and the quality of a video obtained by decompressing and decoding compressed video data obtained by compressing and encoding boundaries (scene changes) in the time direction between a plurality of continuous scenes can be maintained. It is an object of the present invention to provide a video data compression apparatus and a method therefor.

【０００９】[0009]

【課題を解決するための手段】上記目的を達成するため
に、本発明に係る映像データ圧縮装置は、連続して入力
される複数の非圧縮映像データの先頭が、所定の圧縮方
法によりＩピクチャー、ＰピクチャーおよびＢピクチャ
ーの組み合わせで構成される所定のピクチャータイプシ
ーケンスに圧縮された後に、ＩピクチャーまたはＰピク
チャーとなるように、ピクチャーの順序を入れ替える入
れ替え手段と、順序を入れ替えた前記非圧縮映像データ
を、前記所定の圧縮方法により圧縮して、第１の圧縮映
像データを生成する前記第１の圧縮手段と、順序を入れ
替えた前記非圧縮映像データを所定の遅延時間だけ遅延
する遅延手段と、前記所定の遅延時間に対応する前記非
圧縮映像データから生成された前記第１の圧縮映像デー
タのデータ量に基づいて、所定量の未生成の前記第１の
圧縮映像データのデータ量を予測する予測手段と、予測
した前記第１の圧縮映像データのデータ量と、実際に生
成した前記第１の圧縮映像データのデータ量（実際のデ
ータ量）とに基づいて、前記非圧縮映像データの先頭を
検出する先頭検出手段と、検出した前記非圧縮映像デー
タの先頭のピクチャーが、圧縮後に、他の映像データの
ピクチャーと関係を有さないように、前記所定のピクチ
ャータイプシーケンスを変更する変更手段と、生成した
前記第１の圧縮映像データ、および、予測した前記第１
の圧縮映像データのデータ量に基づいて、前記非圧縮映
像データの圧縮後のデータ量の目標値を生成する目標値
生成手段と、圧縮後のデータ量が、生成した前記目標値
になるように、遅延した前記非圧縮映像データを、前記
所定の圧縮方法により、変更した前記所定のピクチャー
タイプシーケンスに圧縮する第２の圧縮手段とを有す
る。In order to achieve the above object, a video data compression apparatus according to the present invention is characterized in that the head of a plurality of non-compressed video data which are continuously input is an I picture , A P picture, and a B picture, after being compressed into a predetermined picture type sequence, the picture sequence is changed to an I picture or a P picture, and the non-compressed video is reordered. First compression means for compressing data by the predetermined compression method to generate first compressed video data, and delay means for delaying the non-compressed video data whose order has been rearranged by a predetermined delay time; , Based on the data amount of the first compressed video data generated from the uncompressed video data corresponding to the predetermined delay time. Prediction means for predicting the data amount of the predetermined amount of the uncompressed first compressed video data, the predicted data amount of the first compressed video data, and the actually generated first compressed video data Head detecting means for detecting the head of the non-compressed video data based on the data amount (actual data amount) of the non-compressed video data. Changing means for changing the predetermined picture type sequence so as not to have a relationship with a picture; the generated first compressed video data; and the predicted first compressed video data.
Target value generating means for generating a target value of the data amount of the uncompressed video data after compression based on the data amount of the compressed video data, so that the data amount after compression becomes the generated target value. And second compression means for compressing the delayed uncompressed video data into the changed predetermined picture type sequence by the predetermined compression method.

【００１０】好適には、前記先頭検出手段は、Ｉピクチ
ャーおよびＰピクチャーの実際のデータ量が、予測した
前記第１の圧縮映像データのＩピクチャーおよびＰピク
チャーに対する比の値が、所定の範囲外になった場合
に、前記データ量が多くなったＰピクチャーに対応する
位置に、前記非圧縮映像データの先頭を検出する。[0010] Preferably, the head detecting means is arranged so that the actual data amount of the I picture and the P picture is such that the predicted ratio of the first compressed video data to the I picture and the P picture is out of a predetermined range. , The head of the uncompressed video data is detected at a position corresponding to the P picture whose data amount has increased.

【００１１】好適には、前記先頭検出手段は、Ｂピクチ
ャーの実際のデータ量が、予測した前記第１の圧縮映像
データのＢピクチャーのデータ量よりも所定の割合以
上、多くなった場合に、前記データ量が多くなったＢピ
クチャーの直前のＩピクチャーの位置に、前記非圧縮映
像データの先頭を検出する。Preferably, when the actual data amount of the B picture is larger than the predicted data amount of the B picture of the first compressed video data by a predetermined ratio or more, The head of the uncompressed video data is detected at the position of the I picture immediately before the B picture whose data amount has increased.

【００１２】好適には、前記変更手段は、前記所定のピ
クチャータイプシーケンスにおいて、前記非圧縮映像デ
ータの先頭がＰピクチャーに圧縮される場合に、前記非
圧縮映像データの先頭がＩピクチャーに圧縮されるよう
に、前記所定のピクチャータイプシーケンスを変更す
る。Preferably, in the predetermined picture type sequence, when the head of the uncompressed video data is compressed into a P picture, the changing unit compresses the head of the uncompressed video data into an I picture. Thus, the predetermined picture type sequence is changed.

【００１３】好適には、前記変更手段は、前記非圧縮映
像データの先頭がＩピクチャーに圧縮されるように前記
所定のピクチャータイプシーケンスを変更した場合に、
近傍の圧縮後にＩピクチャーになる前記非圧縮映像デー
タのピクチャーが、Ｐピクチャーに圧縮されるように、
前記所定のピクチャータイプシーケンスをさらに変更す
る。Preferably, the changing means changes the predetermined picture type sequence so that the head of the uncompressed video data is compressed into an I picture.
A picture of the uncompressed video data, which becomes an I picture after neighboring compression, is compressed into a P picture,
The predetermined picture type sequence is further changed.

【００１４】本発明に係る映像データ圧縮装置におい
て、例えば、非圧縮映像データをピクチャータイプシー
ケンスＩ，Ｂ，Ｂ，Ｐ，Ｂ，Ｂ，…，Ｐ，Ｂ，Ｂ（上記
ピクチャータイプシーケンスに圧縮される非圧縮映像デ
ータのピクチャーそれぞれを、ピクチャーＩ₁，Ｂ₂，
Ｂ₃，Ｐ₄，Ｂ₅，Ｂ₆，…，Ｐ₁₃，Ｂ₁₄，Ｂ₁₅と記
す）に圧縮する場合、入れ替え手段は、連続的に入力さ
れる複数のシーン（非圧縮映像データ）のピクチャーＩ
₁，Ｂ₂，Ｂ₃，Ｐ₄，Ｂ₅，Ｂ₆，Ｐ₇，…，Ｐ ₁₃，
Ｂ₁₄，Ｂ₁₅を、圧縮符号化に適した順序、ピクチャーＩ
₁，Ｂ_-2，Ｂ_-1，Ｐ ₄，Ｂ₁，Ｂ₂，…，Ｐ₁₃，Ｂ₁₁，
Ｂ₁₂に入れ替える。つまり、非圧縮映像データは、例え
ば、ＩピクチャーとＰピクチャーの間に挟まれる１組の
Ｂピクチャーを、直後のＩピクチャーまたはＰピクチャ
ーの後ろに移動させる。In the video data compression apparatus according to the present invention,
For example, if uncompressed video data is
Kens I, B, B, P, B, B, ..., P, B, B (above
Uncompressed video data compressed to a picture type sequence
Each picture of the data₁, B_Two,
B_Three, P_Four, B_Five, B₆, ..., P₁₃, B₁₄, B_FifteenNotation
), The replacement means is input continuously.
I of multiple scenes (uncompressed video data)
₁, B_Two, B_Three, P_Four, B_Five, B₆, P₇, ..., P ₁₃,
B₁₄, B_FifteenIn the order suitable for compression encoding, picture I
₁, B_-2, B_-1, P _Four, B₁, B_Two, ..., P₁₃, B₁₁,
B₁₂Replace with In other words, uncompressed video data
For example, a set of I-picture and P-picture
B picture is replaced with the immediately following I picture or P picture
Move it behind.

【００１５】第１の圧縮手段は、入れ替え手段がピクチ
ャーの順序を入れ替えた複数のシーンを予備的に圧縮符
号化し、圧縮後のピクチャーそれぞれに割り当てるデー
タ量を決めるために必要な難度データを求めるために用
いる第１の圧縮映像データを生成する。具体的には、第
１の圧縮手段は、例えば、ＭＰＥＧ方式により、各シー
ンをピクチャータイプシーケンスＩ，Ｂ，Ｂ，Ｐ，Ｂ，
Ｂ，…，Ｐ，Ｂ，Ｂから構成されるＧＯＰ(group of pi
ctures)単位に圧縮符号化し、第１の圧縮映像データを
生成する。なお、シーンのピクチャーの順序が、上述の
ように入れ替えられているために、シーンチェンジ（複
数のシーンの時間方向の境界）の直後のシーンの先頭の
ピクチャーは、ＩピクチャーまたはＰピクチャーとな
る。The first compression means preliminarily compression-encodes a plurality of scenes in which the order of the pictures has been changed by the replacement means, and obtains difficulty data necessary for determining the amount of data to be allocated to each of the compressed pictures. Of the first compressed video data used for. More specifically, the first compression unit converts each scene into a picture type sequence I, B, B, P, B,
GOP (group of pi) composed of B, ..., P, B, B
compression encoding) to generate first compressed video data. Since the order of the pictures of the scene is changed as described above, the first picture of the scene immediately after the scene change (the boundary in the time direction between a plurality of scenes) is an I picture or a P picture.

【００１６】遅延手段は、例えば、各シーンの所定の枚
数のピクチャーが入力される時間だけ、つまり、各シー
ンを圧縮して得られる圧縮映像データのピクチャーそれ
ぞれに割り当てるデータ量を算出するために充分な量の
難度データの生成に必要な第１の圧縮映像データを得る
ために充分な時間だけ、入力される各シーンを遅延す
る。予測手段は、第１の圧縮手段が生成した第１の圧縮
映像データのデータ量を、例えば、直線近似し、さら
に、近似により得た直線を、第１の圧縮映像データの未
生成の部分に外挿し、未生成の第１の圧縮映像データの
ピクチャーごとのデータ量を、ピクチャータイプ別に予
測する。The delay means is sufficient for calculating, for example, the time during which a predetermined number of pictures of each scene are input, that is, the amount of data allocated to each of the pictures of the compressed video data obtained by compressing each scene. Each scene to be input is delayed by a time sufficient to obtain the first compressed video data required to generate a large amount of difficulty data. The prediction means approximates, for example, a straight line to the data amount of the first compressed video data generated by the first compression means, and further converts a straight line obtained by the approximation into an ungenerated part of the first compressed video data. The extrapolated and ungenerated first compressed video data data amount for each picture is predicted for each picture type.

【００１７】先頭検出手段は、予測した第１の圧縮映像
データの未生成の部分が、後に実際に生成されると、予
測したピクチャーのデータ量と、実際に生成したピクチ
ャーのデータ量とを比較して、シーンの先頭部分（シー
ンチェンジ部分）を検出する。具体的には、先頭検出手
段は、例えば、予測したＩピクチャーおよびＰピクチャ
ーのデータ量と実際に生成したＩピクチャーおよびＰピ
クチャーのデータ量とを比較し、実際のデータ量の予測
した値に対する比の値が、所定の範囲外になった場合
に、これらのＩピクチャーおよびＰピクチャーに対応す
る部分で、シーンチェンジが生じたことを検出する。ま
た、具体的には、先頭検出手段は、シーンチェンジ後の
Ｂピクチャーのデータ量が、Ｐピクチャー並みに増加す
ることを利用して、例えば、予測したＢピクチャーのデ
ータ量と実際に生成したＢピクチャーのデータ量とを比
較し、実際のデータ量が予測した値よりも所定の割合以
上、大きい場合に、このＢピクチャーの直前のＩピクチ
ャーおよびＰピクチャーに対応する部分で、シーンチェ
ンジが生じたことを検出する。このように、Ｉピクチャ
ーおよびＰピクチャーのデータ量のみでなく、Ｂピクチ
ャーのデータ量をも監視することにより、先頭検出手段
は、シーンチェンジ部分を確実に行うことができる。The head detecting means compares the data amount of the predicted picture with the data amount of the actually generated picture when the predicted ungenerated portion of the first compressed video data is actually generated later. Then, the head part (scene change part) of the scene is detected. Specifically, the head detecting means compares, for example, the data amount of the predicted I-picture and P-picture with the data amount of the actually generated I-picture and P-picture, and calculates the ratio of the actual data amount to the predicted value. Is out of the predetermined range, it is detected that a scene change has occurred in portions corresponding to these I-pictures and P-pictures. Also, specifically, the head detecting means uses the fact that the data amount of the B picture after the scene change increases as much as the P picture, and for example, the data amount of the predicted B picture and the actually generated B picture The data amount of the picture is compared with that of the picture, and if the actual data amount is larger than the predicted value by a predetermined ratio or more, a scene change has occurred in a portion corresponding to the I picture and the P picture immediately before the B picture. Detect that. As described above, by monitoring not only the data amount of the I picture and the P picture but also the data amount of the B picture, the head detecting means can surely perform a scene change portion.

【００１８】変更手段は、先頭検出手段が検出したシー
ンの先頭のピクチャーが、前のピクチャー（前のシーン
の最後のピクチャー）と関係を有する（伸長時に前のピ
クチャーのデータを参照する）Ｐピクチャーに圧縮され
る場合に、ピクチャータイプシーケンスを変更し、シー
ンの先頭のピクチャーが、他のピクチャーと関係を有さ
ないＩピクチャーに圧縮されるようにする。また、目標
値生成手段は、生成した第１の圧縮映像データのデータ
量、および、予測した第１の圧縮映像データのデータ
量、またはこれらのいずれかに基づいて、最終的に生成
する圧縮映像データ（第２の圧縮映像データ）のデータ
量の目標値を生成する。[0018] The changing means is a P picture in which the first picture of the scene detected by the first detecting means has a relationship with the previous picture (the last picture of the previous scene) (refers to the data of the previous picture at the time of decompression). , The picture type sequence is changed so that the first picture of the scene is compressed into an I picture having no relation to other pictures. Further, the target value generating means may generate the compressed video finally generated based on the data amount of the generated first compressed video data and / or the predicted data amount of the first compressed video data. A target value of the data amount of the data (second compressed video data) is generated.

【００１９】第２の圧縮手段は、例えば、第１の圧縮手
段と同じＭＰＥＧ方式により、圧縮後のピクチャーそれ
ぞれのデータ量が、対応する目標値が示すデータ量にな
るように、遅延手段が遅延した各シーンを、変更手段変
更したピクチャータイプシーケンスに圧縮し、各シーン
の第２の圧縮映像データを生成する。[0019] The second compressing means uses the same MPEG system as the first compressing means, for example, so that the data amount of each picture after compression becomes equal to the data amount indicated by the corresponding target value. Each of the scenes thus compressed is compressed into a picture type sequence changed by the changing means, and second compressed video data of each scene is generated.

【００２０】また、本発明に係る映像データ圧縮方法
は、連続して入力される複数の非圧縮映像データの先頭
が、所定の圧縮方法によりＩピクチャー、Ｐピクチャー
およびＢピクチャーの組み合わせで構成される所定のピ
クチャータイプシーケンスに圧縮された後に、Ｉピクチ
ャーまたはＰピクチャーとなるように圧縮して、第１の
圧縮映像データを生成し、前記所定の遅延時間に対応す
る前記第１の圧縮映像データのデータ量に基づいて、所
定量の未生成の前記第１の圧縮映像データのデータ量を
予測し、前記第１の圧縮映像データの予測したデータ
量、実際に生成した前記第１の圧縮映像データのデータ
量（実際のデータ量）とに基づいて、前記非圧縮映像デ
ータの先頭のピクチャーを検出する。Further, in the video data compression method according to the present invention, the head of a plurality of non-compressed video data which are continuously input is composed of a combination of I picture, P picture and B picture by a predetermined compression method. After being compressed to a predetermined picture type sequence, the data is compressed to become an I picture or a P picture to generate first compressed video data, and the first compressed video data of the first compressed video data corresponding to the predetermined delay time is generated. A data amount of a predetermined amount of the uncompressed first compressed video data is predicted based on the data amount, and a predicted data amount of the first compressed video data and the actually generated first compressed video data are calculated. The first picture of the uncompressed video data is detected based on the data amount (actual data amount).

【００２１】好適には、前記非圧縮映像データを所定の
遅延時間だけ遅延し、検出した部分のＰピクチャーが、
圧縮後に、Ｉピクチャーになるように前記所定のピクチ
ャータイプシーケンスを変更し、生成した前記第１の圧
縮映像データと予測した前記第１の圧縮映像データとの
データ量に基づいて、圧縮後のデータ量の目標値を生成
し、圧縮後のデータ量が、生成した前記目標値になるよ
うに、遅延した前記非圧縮映像データを、前記所定の圧
縮方法により、変更した前記所定のピクチャータイプシ
ーケンスに圧縮する。Preferably, the uncompressed video data is delayed by a predetermined delay time, and the detected P picture is
After the compression, the predetermined picture type sequence is changed so as to become an I picture, and the data after compression is determined based on the data amount of the generated first compressed video data and the predicted first compressed video data. An amount target value is generated, and the uncompressed video data delayed by the predetermined compression method is changed to the predetermined picture type sequence so that the data amount after compression becomes the generated target value. Compress.

【００２２】[0022]

【発明の実施の形態】第１実施形態以下、本発明の第１の実施形態を説明する。ＭＰＥＧ方
式といった映像データの圧縮符号化方式により、高い周
波数成分が多い絵柄、あるいは、動きが多い絵柄といっ
た難度(difficulty)が高い映像データを圧縮符号化する
と、一般的に圧縮に伴う歪みが生じやすくなる。このた
め、難度が高い映像データは低い圧縮率で圧縮符号化す
る必要があり、難度が高いデータを圧縮符号化して得ら
れる圧縮映像データに対しては、難度が低い絵柄の映像
データの圧縮映像データに比べて、多くの目標データ量
を配分する必要がある。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS First Embodiment Hereinafter, a first embodiment of the present invention will be described. By compression encoding video data such as the MPEG method, when compression encoding video data with a high degree of difficulty, such as a pattern with many high frequency components or a pattern with a lot of motion, distortion due to compression is likely to occur. Become. For this reason, it is necessary to compress and encode video data having a high degree of difficulty at a low compression ratio. For compressed video data obtained by compressing and encoding data having a high degree of difficulty, a compressed image of video data having a pattern having a low level of difficulty is obtained. It is necessary to allocate a larger amount of target data than data.

【００２３】このように、映像データの難度に対して適
応的に目標データ量を配分するためには、従来技術とし
て示した２パスエンコード方式が有効である。しかしな
がら、２パスエンコード方式は、実時間的な圧縮符号化
に不向きである。第１の実施形態として示す簡易２パス
エンコード方式は、かかる２パスエンコード方式の問題
点を解決するためになされたものであり、非圧縮映像デ
ータを予備的に圧縮符号化して得られる圧縮映像データ
の難度データから非圧縮映像データの難度を算出し、予
備的な圧縮符号化により算出した難度に基づいて、ＦＩ
ＦＯメモリ等により所定の時間だけ遅延した非圧縮映像
データの圧縮率を適応的に制御することができる。As described above, the two-pass encoding method shown as a conventional technique is effective for adaptively allocating a target data amount to the difficulty of video data. However, the two-pass encoding method is not suitable for real-time compression encoding. The simplified two-pass encoding method shown as the first embodiment has been made to solve the problem of the two-pass encoding method, and the compressed video data obtained by preliminary compression-encoding the non-compressed video data The difficulty level of the uncompressed video data is calculated from the difficulty level data, and the FI level is calculated based on the difficulty level calculated by the preliminary compression encoding.
The compression rate of the uncompressed video data delayed by a predetermined time by the FO memory or the like can be adaptively controlled.

【００２４】図１は、本発明に係る映像データ圧縮装置
１の構成を示す図である。図１に示すように、映像デー
タ圧縮装置１は、圧縮符号化部１０およびホストコンピ
ュータ２０から構成され、圧縮符号化部１０は、エンコ
ーダ制御部１２、動き検出器(motion estimator)１４、
簡易２パス処理部１６、第２のエンコーダ(encoder) １
８から構成され、簡易２パス処理部１６は、ＦＩＦＯメ
モリ１６０および第１のエンコーダ１６２から構成され
る。映像データ圧縮装置１は、これらの構成部分によ
り、編集装置およびビデオテープレコーダ装置等の外部
機器（図示せず）から入力される非圧縮映像データＶＩ
Ｎに対して、上述した簡易２パスエンコードを実現す
る。FIG. 1 is a diagram showing a configuration of a video data compression device 1 according to the present invention. As shown in FIG. 1, the video data compression apparatus 1 includes a compression encoding unit 10 and a host computer 20. The compression encoding unit 10 includes an encoder control unit 12, a motion estimator 14,
Simple 2-pass processing unit 16, second encoder (encoder) 1
8, and the simple two-pass processing unit 16 includes a FIFO memory 160 and a first encoder 162. The video data compression device 1 uses these components to generate uncompressed video data VI input from external devices (not shown) such as an editing device and a video tape recorder device.
For N, the above-described simple two-pass encoding is realized.

【００２５】映像データ圧縮装置１において、ホストコ
ンピュータ２０は、映像データ圧縮装置１の各構成部分
の動作を制御する。また、ホストコンピュータ２０は、
簡易２パス処理部１６のエンコーダ１６２が非圧縮映像
データＶＩＮを予備的に圧縮符号化して生成した圧縮映
像データのデータ量、ＤＣＴ処理後の映像データの直流
成分（ＤＣ成分）の値および直流成分（ＡＣ成分）の電
力値を制御信号Ｃ１６を介して受け、受けたこれらの値
に基づいて圧縮映像データの絵柄の難度を算出する。さ
らに、ホストコンピュータ２０は、算出した難度に基づ
いて、エンコーダ１８が生成する圧縮映像データの目標
データ量Ｔ_jを制御信号Ｃ１８を介してピクチャーごと
に割り当て、エンコーダ１８の量子化回路１６６（図
３）に設定し、エンコーダ１８の圧縮率をピクチャー単
位に適応的に制御する。In the video data compression apparatus 1, a host computer 20 controls the operation of each component of the video data compression apparatus 1. Also, the host computer 20
The data amount of the compressed video data generated by the encoder 162 of the simple two-pass processing unit 16 preliminarily compression-encoding the non-compressed video data VIN, the value of the DC component (DC component) of the DCT-processed video data, and the DC component The power value of the (AC component) is received via the control signal C16, and the difficulty of the picture of the compressed video data is calculated based on the received values. Further, the host computer 20 based on the calculated difficulty, assigned to each picture of the target amount of data T _j of the compressed video data encoder 18 is generated via a control signal C18, the quantization circuit 166 of the encoder 18 (FIG. 3 ), And the compression rate of the encoder 18 is adaptively controlled on a picture basis.

【００２６】エンコーダ制御部１２は、非圧縮映像デー
タＶＩＮのピクチャーの有無をホストコンピュータ２０
に通知し、さらに、非圧縮映像データＶＩＮのピクチャ
ーごとに圧縮符号化のための前処理を行う。つまり、エ
ンコーダ制御部１２は、入力された非圧縮映像データを
符号化順に並べ替え、ピクチャー・フィールド変換を行
い、非圧縮映像データＶＩＮが映画の映像データである
場合に３：２プルダウン処理（映画の２４フレーム／秒
の映像データを、３０フレーム／秒の映像データに変換
し、冗長性を圧縮符号化前に取り除く処理）等を行い、
映像データＳ１２として簡易２パス処理部１６のＦＩＦ
Ｏメモリ１６０およびエンコーダ１６２に対して出力す
る。動き検出器１４は、非圧縮映像データの動きベクト
ルの検出を行し、エンコーダ制御部１２およびエンコー
ダ１６２，１８に対して出力する。The encoder controller 12 determines whether or not there is a picture of the uncompressed video data VIN by the host computer 20.
And performs a pre-process for compression encoding for each picture of the uncompressed video data VIN. That is, the encoder control unit 12 rearranges the input non-compressed video data in the order of encoding, performs picture / field conversion, and performs 3: 2 pull-down processing (movie processing) when the non-compressed video data VIN is video data of a movie. Of the 24 frames / sec video data into 30 frames / sec video data, and removes the redundancy before the compression encoding.
The FIF of the simple 2-pass processing unit 16 is used as the video data S12.
Output to the O memory 160 and the encoder 162. The motion detector 14 detects a motion vector of the uncompressed video data, and outputs the motion vector to the encoder control unit 12 and the encoders 162 and 18.

【００２７】簡易２パス処理部１６において、ＦＩＦＯ
メモリ１６０は、エンコーダ制御部１２から入力された
映像データＳ１２を、例えば、非圧縮映像データＶＩＮ
が、Ｌ（Ｌは整数）ピクチャー入力される時間だけ遅延
し、遅延映像データＳ１６としてエンコーダ１８に対し
て出力する。In the simple two-pass processing unit 16, the FIFO
The memory 160 converts the video data S12 input from the encoder control unit 12 into, for example, uncompressed video data VIN
Is delayed by the time of L (L is an integer) picture input, and is output to the encoder 18 as delayed video data S16.

【００２８】図２は、図１に示した簡易２パス処理部１
６のエンコーダ１６２の構成を示す図である。エンコー
ダ１６２は、例えば、図２に示すように、加算回路１６
４、ＤＣＴ回路１６６、量子化回路（Ｑ）１６８、可変
長符号化回路（ＶＬＣ）１７０、逆量子化回路（ＩＱ）
１７２、逆ＤＣＴ（ＩＤＣＴ）回路１７４、加算回路１
７６および動き補償回路１７８から構成される一般的な
映像データ用圧縮符号化器であって、入力される映像デ
ータＳ１２をＭＰＥＧ方式等により圧縮符号化し、圧縮
映像データのピクチャーごとのデータ量等をホストコン
ピュータ２０に対して出力する。FIG. 2 shows a simplified two-pass processing unit 1 shown in FIG.
6 is a diagram illustrating a configuration of a sixth encoder 162. FIG. The encoder 162 includes, for example, as shown in FIG.
4. DCT circuit 166, quantization circuit (Q) 168, variable length coding circuit (VLC) 170, inverse quantization circuit (IQ)
172, inverse DCT (IDCT) circuit 174, addition circuit 1
Is a general video data compression encoder composed of the video data S12 and the motion compensation circuit 178. The input video data S12 is compression-coded by the MPEG method or the like, and the amount of compressed video data for each picture is determined. Output to the host computer 20.

【００２９】加算回路１６４は、加算回路１７６の出力
データを映像データＳ１２から減算し、ＤＣＴ回路１６
６に対して出力する。ＤＣＴ回路１６６は、加算回路１
６４から入力される映像データを、例えば、１６画素×
１６画素のマクロブロック単位に離散コサイン変換（Ｄ
ＣＴ）処理し、時間領域のデータから周波数領域のデー
タに変換して量子化回路１６８に対して出力する。ま
た、ＤＣＴ回路１６６は、ＤＣＴ後の映像データのＤＣ
成分の値およびＡＣ成分の電力値をホストコンピュータ
２０に対して出力する。量子化回路１６８は、ＤＣＴ回
路１６６から入力された周波数領域のデータを、固定の
量子化値Ｑで量子化し、量子化データとして可変長符号
化回路１７０および逆量子化回路１７２に対して出力す
る。可変長符号化回路１７０は、量子化回路１６８から
入力された量子化データを可変長符号化し、可変長符号
化の結果として得られた圧縮映像データのデータ量を、
制御信号Ｃ１６を介してホストコンピュータ２０に対し
て出力する。逆量子化回路１７２は、可変長符号化回路
１６８から入力された量子化データを逆量子化し、逆量
子化データとして逆ＤＣＴ回路１７４に対して出力す
る。The addition circuit 164 subtracts the output data of the addition circuit 176 from the video data S12,
6 is output. The DCT circuit 166 includes the addition circuit 1
For example, the video data input from 64 is converted to 16 pixels ×
Discrete cosine transform (D
CT), converts the data in the time domain into the data in the frequency domain, and outputs the data to the quantization circuit 168. Further, the DCT circuit 166 controls the DCT of the video data after the DCT.
The value of the component and the power value of the AC component are output to the host computer 20. The quantization circuit 168 quantizes the frequency domain data input from the DCT circuit 166 with a fixed quantization value Q, and outputs the quantized data to the variable length coding circuit 170 and the inverse quantization circuit 172. . The variable length coding circuit 170 performs variable length coding on the quantized data input from the quantization circuit 168, and calculates the data amount of the compressed video data obtained as a result of the variable length coding.
Output to the host computer 20 via the control signal C16. The inverse quantization circuit 172 inversely quantizes the quantized data input from the variable length encoding circuit 168, and outputs the inversely quantized data to the inverse DCT circuit 174.

【００３０】逆ＤＣＴ回路１７４は、逆量子化回路１７
２から入力される逆量子化データに対して逆ＤＣＴ処理
を行い、加算回路１７６に対して出力する。加算回路１
７６は、動き補償回路１７８の出力データおよび逆ＤＣ
Ｔ回路１７４の出力データを加算し、加算回路１６４お
よび動き補償回路１７８に対して出力する。動き補償回
路１７８は、加算回路１７６の出力データに対して、動
き検出器１４から入力される動きベクトルに基づいて動
き補償処理を行い、加算回路１７６に対して出力する。The inverse DCT circuit 174 includes the inverse quantization circuit 17
Inverse DCT processing is performed on the inversely quantized data input from 2 and output to the adding circuit 176. Addition circuit 1
76 is the output data of the motion compensation circuit 178 and the inverse DC
The output data of the T circuit 174 is added and output to the addition circuit 164 and the motion compensation circuit 178. The motion compensation circuit 178 performs a motion compensation process on the output data of the addition circuit 176 based on the motion vector input from the motion detector 14, and outputs the result to the addition circuit 176.

【００３１】図３は、図１に示したエンコーダ１８の構
成を示す図である。図３に示すように、エンコーダ１８
は、図２に示したエンコーダ１６２に、量子化制御回路
１８０を加えた構成になっている。エンコーダ１８は、
これらの構成部分により、ホストコンピュータ２０から
設定される目標データ量Ｔ_jに基づいて、ＦＩＦＯメモ
リ１６０によりＬピクチャー分遅延された遅延映像デー
タＳ１６に対して動き補償処理、ＤＣＴ処理、量子化処
理および可変長符号化処理を施して、ＭＰＥＧ方式等の
圧縮映像データＶＯＵＴを生成し、外部機器（図示せ
ず）に出力する。FIG. 3 is a diagram showing a configuration of the encoder 18 shown in FIG. As shown in FIG.
Has a configuration in which a quantization control circuit 180 is added to the encoder 162 shown in FIG. The encoder 18
With these components, based on the target amount of data T _j set from the host computer 20, the motion compensation process to the L picture delayed by the delayed video data S16 by the FIFO memory 160, DCT processing, quantization processing and The variable-length encoding processing is performed to generate compressed video data VOUT in the MPEG format or the like, and output the same to an external device (not shown).

【００３２】エンコーダ１８において、量子化制御回路
１８０は、可変長量子化回路１７０が出力する圧縮映像
データＶＯＵＴのデータ量を順次、監視し、遅延映像デ
ータＳ１６の第ｊ番目のピクチャーから最終的に生成さ
れる圧縮映像データのデータ量が、ホストコンピュータ
２０から設定された目標データ量Ｔ_jに近づくように、
順次、量子化回路１６８に設定する量子化値Ｑ_jを調節
する。また、可変長量子化回路１７０は、圧縮映像デー
タＶＯＵＴを外部に出力する他に、遅延映像データＳ１
６を圧縮符号化して得られた圧縮映像データＶＯＵＴの
実際のデータ量Ｓ_jを制御信号Ｃ１８を介してホストコ
ンピュータ２０に対して出力する。In the encoder 18, the quantization control circuit 180 sequentially monitors the data amount of the compressed video data VOUT output from the variable length quantization circuit 170, and finally starts from the j-th picture of the delayed video data S 16. data amount of the compressed video data generated is, so as to approach the target amount of data T _j set from the host computer 20,
The quantization value Q _j to be set in the quantization circuit 168 is sequentially adjusted. The variable-length quantization circuit 170 outputs the compressed video data VOUT to the outside, and also outputs the delayed video data S1
6 is output to the host computer 20 via the control signal C18, the actual data amount _Sj of the compressed video data VOUT obtained by compression-encoding 6.

【００３３】以下、第１の実施形態における映像データ
圧縮装置１の簡易２パスエンコード動作を説明する。図
４（Ａ）〜（Ｃ）は、第１の実施形態における映像デー
タ圧縮装置１の簡易２パスエンコードの動作を示す図で
ある。エンコーダ制御部１２は、映像データ圧縮装置１
に入力された非圧縮映像データＶＩＮに対して、エンコ
ーダ制御部１２により符号化順にピクチャーを並べ替え
る等の前処理を行い、図４（Ａ）に示すように映像デー
タＳ１２としてＦＩＦＯメモリ１６０およびエンコーダ
１６２に対して出力する。なお、エンコーダ制御部１２
によるピクチャーの順番並べ替えにより、図４等に示す
ピクチャーの符号化の順番と伸長復号後の表示の順番と
は異なる。Hereinafter, a simple two-pass encoding operation of the video data compression device 1 according to the first embodiment will be described. FIGS. 4A to 4C are diagrams illustrating the operation of the simple two-pass encoding of the video data compression device 1 according to the first embodiment. The encoder control unit 12 controls the video data compression device 1
4A, the encoder control unit 12 performs preprocessing such as rearranging the pictures in the encoding order, and as shown in FIG. 4A, the FIFO memory 160 and the encoder 162. Note that the encoder control unit 12
, The order of picture encoding shown in FIG. 4 and the like differs from the order of display after decompression decoding.

【００３４】ＦＩＦＯメモリ１６０は、入力された映像
データＳ１２の各ピクチャーをＬピクチャー分だけ遅延
し、エンコーダ１８に対して出力する。エンコーダ１６
２は、入力された映像データＳ１２のピクチャーを予備
的に順次、圧縮符号化し、第ｊ（ｊは整数）番目のピク
チャーを圧縮符号化して得られた圧縮符号化データのデ
ータ量、ＤＣＴ処理後の映像データのＤＣ成分の値、お
よび、ＡＣ成分の電力値をホストコンピュータ２０に対
して出力する。The FIFO memory 160 delays each picture of the input video data S12 by L pictures and outputs it to the encoder 18. Encoder 16
Reference numeral 2 denotes a data amount of compression-encoded data obtained by compression-encoding a picture of the input video data S12 in a preliminary and sequential manner, and compression-encoding a j-th (j is an integer) picture; And outputs the DC component value and AC component power value of the video data to the host computer 20.

【００３５】例えば、エンコーダ１８に入力される遅延
映像データＳ１６は、ＦＩＦＯメモリ１６０によりＬピ
クチャーだけ遅延されているので、図４（Ｂ）に示すよ
うに、エンコーダ１８が、遅延映像データＳ１６の第ｊ
（ｊは整数）番目のピクチャー（図４（Ｂ）のピクチャ
ーａ）を圧縮符号化している際には、エンコーダ１６２
は、映像データＳ１２の第ｊ番目のピクチャーからＬピ
クチャー分先の第（ｊ＋Ｌ）番目のピクチャー（図４
（Ｂ）のピクチャーｂ）を圧縮符号化していることにな
る。従って、エンコーダ１８が遅延映像データＳ１６の
第ｊ番目のピクチャーの圧縮符号化を開始する際には、
エンコーダ１６２は映像データＳ１２の第ｊ番目〜第
（ｊ＋Ｌ−１）番目のピクチャー（図４（Ｂ）の範囲
ｃ）の圧縮符号化を完了しており、これらのピクチャー
の圧縮符号化後の実難度データＤ_j，Ｄ _j+1，Ｄ_j+2，
…，Ｄ_j+L-1は、ホストコンピュータ２０により既に算
出されている。For example, the delay input to the encoder 18
The video data S16 is stored in the L memory by the FIFO memory 160.
As shown in FIG. 4 (B),
As described above, the encoder 18 determines the j-th
(J is an integer) picture (picture of FIG. 4B)
-A), the encoder 162
Are L-pins from the j-th picture of the video data S12.
The (j + L) -th picture ahead of the kuture (FIG. 4
This means that picture b) of (B) is compression-encoded.
You. Therefore, the encoder 18 transmits the delayed video data S16.
When starting the compression encoding of the j-th picture,
The encoder 162 is configured to j-th to
(J + L-1) -th picture (range of FIG. 4B)
c) the compression encoding has been completed and these pictures
Difficulty data D after compression encoding_j, D _{j + 1}, D_{j + 2},
…, D_{j + L-1}Is already calculated by the host computer 20.
Has been issued.

【００３６】ホストコンピュータ２０は、下に示す式１
により、エンコーダ１８が遅延映像データＳ１６の第ｊ
番目のピクチャーを圧縮符号化して得られる圧縮映像デ
ータに割り当てる目標データ量Ｔ_jを算出し、算出した
目標データ量Ｔ_jを量子化制御回路１８０に設定する。The host computer 20 uses the following formula 1
As a result, the encoder 18 sets the j-th
A target data amount T _j to be allocated to the compressed video data obtained by compression-coding the third picture is calculated, and the calculated target data amount T _j is set in the quantization control circuit 180.

【００３７】[0037]

【数１】 (Equation 1)

【００３８】但し、式１において、Ｄ_jは映像データＳ
１２の第ｊ番目のピクチャーの実難度データであり、
Ｒ’_jは、映像データＳ１２，Ｓ１６の第ｊ番目〜第
（ｊ＋Ｌ−１）番目のピクチャーに割り当てることがで
きる目標データ量の平均であり、Ｒ’_jの初期値（Ｒ’
₁）は、圧縮映像データの各ピクチャーに平均して割り
当て可能な目標データ量であり、下に示す式２で表さ
れ、エンコーダ１８が圧縮映像データを１ピクチャー分
生成する度に、式３に示すように更新される。Where D _j is the video data S
12 is the actual difficulty data of the 12 th picture,
R ′ _j is the average of the target data amount that can be allocated to the j-th to (j + L−1) -th pictures of the video data S12 and S16, and the initial value of R ′ _j (R ′
₁ ) is a target data amount that can be allocated to each picture of the compressed video data on average, and is expressed by the following equation (2). Each time the encoder 18 generates one picture of the compressed video data, Updated as shown.

【００３９】[0039]

【数２】 (Equation 2)

【００４０】[0040]

【数３】 (Equation 3)

【００４１】なお、式３中の数値ビットレート(Bit rat
e)は、通信回線の伝送容量や、記録媒体の記録容量に基
づいて決められる１秒当たりのデータ量（ビット量）を
示し、ピクチャーレート(Picture rate)は、映像データ
に含まれる１秒当たりのピクチャーの数（３０枚／秒
（ＮＴＳＣ），２５枚／秒（ＰＡＬ））を示し、数値Ｆ
_j+Lは、ピクチャータイプに応じて定められるピクチャ
ー当たりの平均データ量を示す。エンコーダ１８のＤＣ
Ｔ回路１６６は、入力される遅延映像データＳ１６の第
ｊ番目のピクチャーをＤＣＴ処理し、量子化回路１６８
に対して出力する。量子化回路１６８は、ＤＣＴ回路１
６６から入力された第ｊ番目のピクチャーの周波数領域
のデータを、量子化制御回路１８０が目標データ量Ｔ_j
に基づいて調節する量子化値Ｑ_jにより量子化し、量子
化データとして可変長符号化回路１７０に対して出力す
る。可変長符号化回路１７０は、量子化回路１６８から
入力された第ｊ番目のピクチャーの量子化データを可変
長符号化して、ほぼ、目標データ量Ｔ_jに近いデータ量
の圧縮映像データＶＯＵＴを生成して出力する。Note that the numerical bit rate (Bit rat
e) is based on the transmission capacity of the communication line and the recording capacity of the recording medium.
Data amount per second (bit amount)
The picture rate (Picture rate) is
Number of pictures per second (30 pictures / sec.
(NTSC), 25 sheets / second (PAL))
_{j + L}Is a picture determined according to the picture type
Shows the average amount of data per group. DC of encoder 18
The T circuit 166 is configured to output the delayed video data S16
DCT processing is performed on the j-th picture, and a quantization circuit 168
Output to The quantization circuit 168 is a DCT circuit 1
Frequency domain of the j-th picture input from
Of the target data amount T by the quantization control circuit 180._j
Quantized value Q adjusted based on_jQuantized by
Output to the variable length coding circuit 170 as encoded data.
You. The variable length coding circuit 170
Variable quantized data of the input j-th picture
After long encoding, the target data amount T_jData volume close to
And outputs the compressed video data VOUT.

【００４２】同様に、図４（Ｂ）に示すように、エンコ
ーダ１８が、遅延映像データＳ１６の第（ｊ＋１）番目
のピクチャー（図４（Ｃ）のピクチャーａ’）を圧縮符
号化している際には、エンコーダ１６２は、映像データ
Ｓ１２の第（ｊ＋１）番目〜第（ｊ＋Ｌ）番目のピクチ
ャー（図４（Ｃ）の範囲ｃ’）の圧縮符号化を完了し、
これらのピクチャーの実難度データＤ_j+1，Ｄ_j+2，Ｄ
_j+3，・・・，Ｄ_j+Lは、ホストコンピュータ２０によ
り既に算出されている。Similarly, as shown in FIG. 4B, when the encoder 18 compresses and encodes the (j + 1) -th picture (the picture a 'in FIG. 4C) of the delayed video data S16. , The encoder 162 completes the compression encoding of the (j + 1) -th to (j + L) -th pictures (range c ′ in FIG. 4C) of the video data S12,
The actual difficulty data D _{j + 1} , D _{j + 2} , D of these pictures
_{j + 3} ,..., D _{j + L} have already been calculated by the host computer 20.

【００４３】ホストコンピュータ２０は、式１により、
エンコーダ１８が遅延映像データＳ１６の第（ｊ＋１）
番目のピクチャーを圧縮符号化して得られる圧縮映像デ
ータに割り当てる目標データ量Ｔ_j+1を算出し、エンコ
ーダ１８の量子化制御回路１８０に設定する。The host computer 20 uses the following equation (1).
The encoder 18 determines the (j + 1) th of the delayed video data S16.
A target data amount T _{j + 1} to be allocated to compressed video data obtained by compression-encoding the third picture is calculated and set in the quantization control circuit 180 of the encoder 18.

【００４４】エンコーダ１８は、ホストコンピュータ２
０から量子化制御回路１８０に設定された目量データ量
Ｔ_jに基づいて第（ｊ＋１）番目のピクチャーを圧縮符
号化し、目標データ量Ｔ_j+1に近いデータ量の圧縮映像
データＶＯＵＴを生成して出力する。さらに以下、同様
に、映像データ圧縮装置１は、遅延映像データＳ１６の
第ｋ番目のピクチャーを、量子化値Ｑ_k（ｋ＝ｊ＋２，
ｊ＋３，…）をピクチャーごとに変更して順次、圧縮符
号化し、圧縮映像データＶＯＵＴとして出力する。The encoder 18 is connected to the host computer 2
From 0, the (j + 1) -th picture is compression-encoded based on the _scale data amount _Tj set in the quantization control circuit 180, and compressed video data VOUT having a data size close to the target data size _{Tj + 1} is generated. And output. In the same manner, the video data compression device 1 similarly converts the k-th picture of the delayed video data S16 into a quantized value Q _k (k = j + 2,
j + 3,...) are changed for each picture, and are sequentially compression-encoded and output as compressed video data VOUT.

【００４５】以上説明したように、第１の実施形態に示
した映像データ圧縮装置１によれば、短時間で非圧縮映
像データＶＩＮの絵柄の難度を算出し、算出した難度に
応じた圧縮率で適応的に非圧縮映像データＶＩＮを圧縮
符号化することができる。つまり、第１の実施形態に示
した映像データ圧縮装置１によれば、２パスエンコード
方式と異なり、ほぼ実時間的に、非圧縮映像データＶＩ
Ｎの絵柄の難度に基づいて適応的に非圧縮映像データＶ
ＩＮを圧縮符号化をすることができ、実況放送といった
実時間性を要求される用途に応用可能である。なお、第
１の実施形態に示した他、本発明に係るデータ多重化装
置１は、エンコーダ１６２が圧縮符号化した圧縮映像デ
ータのデータ量を、そのまま難度データとして用い、ホ
ストコンピュータ２０の処理の簡略化を図る等、種々の
構成を採ることができる。As described above, according to the video data compression apparatus 1 shown in the first embodiment, the degree of difficulty of the pattern of the non-compressed video data VIN is calculated in a short time, and the compression ratio according to the calculated degree of difficulty is calculated. Thus, the non-compressed video data VIN can be adaptively compression-encoded. That is, according to the video data compression apparatus 1 shown in the first embodiment, unlike the two-pass encoding method, the non-compressed video data VI
N based on the degree of difficulty of the picture
IN can be compression-encoded, and can be applied to applications requiring real-time performance such as live broadcasting. In addition to the data multiplexing apparatus 1 according to the present invention, the data multiplexing apparatus 1 according to the present invention uses the data amount of the compressed video data compressed and encoded by the encoder 162 as difficulty data as it is, Various configurations can be adopted, such as simplification.

【００４６】第２実施形態第１の実施形態に示した簡易２パスエンコード方式によ
れば、実時間かつ、絵柄の難度に応じた適応的な非圧縮
映像データに対する圧縮符号化処理が可能である。しか
しながら、第１の実施形態に示した簡易２パスエンコー
ド方式を用いた場合、実時間性が厳しく要求される場合
には、ＦＩＦＯメモリ１６０の遅延時間を大きくするこ
とができず、真に適切な目標データ量Ｔ_jの算出が難し
く、圧縮映像データＶＯＵＴを伸長復号して得られる映
像の品質が低下してしまう可能性がある。 Second Embodiment According to the simple two-pass encoding method shown in the first embodiment, it is possible to perform compression encoding processing on uncompressed video data adaptively in real time according to the difficulty of a picture. . However, when the simple two-pass encoding method shown in the first embodiment is used, when strict real-time performance is required, the delay time of the FIFO memory 160 cannot be increased, and a truly appropriate calculation is difficult for the target data amount T _j, the quality of the image obtained compressed video data VOUT to expansion decoding is likely to decrease.

【００４７】第２の実施形態においては、第１の実施形
態に示した映像データ圧縮装置１（図１）を用い、ホス
トコンピュータ２０の処理内容を変更して、ＦＩＦＯメ
モリ１６０の遅延時間を長くしなくても適切な目標デー
タ量Ｔ_jの値を得ることができるように、非圧縮映像デ
ータをＬピクチャー分、予備的に圧縮符号化して得られ
た圧縮映像データの第ｊ番目のピクチャー〜第（ｊ＋Ｌ
−１）番目のピクチャーの実難度データＤ_j〜Ｄ_j+L-1
から、圧縮映像データの第（ｊ＋Ｌ）番目のピクチャー
〜第（ｊ＋Ｌ＋Ｂ）番目のピクチャー（Ｂは整数）の難
度データ（予測難度データ）Ｄ_j+L〜Ｄ_j+L+Bを算出
し、実際に得られた難度データＤ_j〜Ｄ_j+ _L-1（実難度
データ）および予測によって得られた難度データＤ’
_j+L〜Ｄ’_j+ _L+Bに基づいて、第１の実施形態に示した
簡易２パスエンコード方式よりも適切な目標データ量Ｔ
_jの値を得ることができる圧縮符号化方式（予測簡易２
パスエンコード方式）を説明する。In the second embodiment, the processing content of the host computer 20 is changed by using the video data compression apparatus 1 (FIG. 1) shown in the first embodiment, and the delay time of the FIFO memory 160 is increased. In order to obtain an appropriate value of the target data amount _Tj without performing the above processing, the j-th picture to the L-th picture of the uncompressed video data and the j-th picture of the compressed video data obtained by preliminary compression encoding are used. (J + L
-1) Actual difficulty data D _{j to} D _{j + L-1 of the first} picture
From the (j + L) -th picture to the (j + L + B) -th picture (B is an integer) of the compressed video data, the difficulty data (prediction difficulty data) D _{j + L to} D _{j + L + B} are calculated. Difficulty data D _{j to} D _{j +} _L-1 (actual difficulty data) and difficulty data D ′ obtained by prediction
_{Based on j + L to} D ′ _{j +} _{L + B} , the target data amount T is more appropriate than the simple two-pass encoding method shown in the first embodiment.
_The compression encoding method (simple prediction 2
The path encoding method will be described.

【００４８】まず、第２の実施形態で説明する予測簡易
２パスエンコード方式を概念的に説明する。予測簡易２
パスエンコード方式は、徐々に絵柄が難しくなってゆ
く、つまり、徐々に圧縮符号化時のＤＣＴ処理後の高い
周波数成分が多くなり、動きが速くなってゆく非圧縮映
像データの絵柄は、さらに難しくなってゆき、逆に、徐
々に絵柄が難しくなくなって（簡単になって）ゆく非圧
縮映像データの絵柄は、さらに簡単になってゆくであろ
うと予測可能であることを前提する。First, a simplified predictive two-pass encoding method described in the second embodiment will be conceptually described. Simple prediction 2
In the path encoding method, the picture becomes gradually more difficult, that is, the picture of the non-compressed video data, in which the high frequency components after the DCT processing in the compression encoding gradually increase and the movement becomes faster, becomes more difficult. On the contrary, it is assumed that the pattern of the uncompressed video data, in which the pattern gradually becomes difficult (simplifies), can be predicted to be further simplified.

【００４９】つまり、予測簡易２パスエンコード方式
は、ホストコンピュータ２０が、この前提に基づいて、
さらに絵柄が難しくなってゆくと予測される場合には、
さらに絵柄が難しいピクチャーに備えて、その時点で圧
縮符号化しているピクチャーに割り当てる目標データ量
を節約し、逆に、さらに絵柄が簡単になってゆくと予測
される場合には、その時点で圧縮符号化しているピクチ
ャーに割り当てる目標データ量を増やすようにエンコー
ダ１８に対する圧縮率の制御を行う。That is, in the simple predictive two-pass encoding method, the host computer 20 uses the
If the picture is expected to become more difficult,
In preparation for a picture with a more difficult picture, the target data amount to be allocated to the picture currently being compression-encoded is saved, and conversely, if the picture is expected to become simpler, the compression will be performed at that point. The compression rate of the encoder 18 is controlled so as to increase the target data amount allocated to the picture being coded.

【００５０】さらに、予測簡易２パスエンコード方式の
概念的な説明を続ける。映像データは、一般的に、時間
方向および空間方向について相関性が高く、映像データ
の圧縮符号化は、これらの相関性に着目し、冗長性を除
くことにより行われる。時間方向について相関性が高い
ということは、現時点の非圧縮映像データのピクチャー
の難度とそれ以降の非圧縮映像データのピクチャーの難
度とが近いということを意味する。また、難度の増減の
傾向も、現時点までの難度の増減の傾向がそれ以降も続
くことが多い。Further, a conceptual description of the simple predictive two-pass encoding method will be continued. Video data generally has high correlation in the time direction and the spatial direction, and compression coding of video data is performed by focusing on these correlations and removing redundancy. The high correlation in the time direction means that the difficulty level of the picture of the current uncompressed video data is close to the difficulty level of the picture of the subsequent uncompressed video data. In addition, the tendency of the increase and decrease of the difficulty level up to the present time often continues thereafter.

【００５１】具体例を挙げると、カメラが静止状態から
ゆっくりとカメラを水平方向に回し初め、最後に一定の
回転速度で回転しながら、静止している物体を撮影する
場合の非圧縮映像データの絵柄を考える。最初はカメラ
が停止状態であるため、静止映像が撮影され、絵柄の難
度は低くなる。次に、カメラを回し始めて１〜２秒後に
一定の回転速度になると仮定すると、カメラを回し始め
て１〜２秒間は絵柄の難度は高くなる傾向を示す。この
状態を、映像データ圧縮装置１側から見ると、数ＧＯＰ
分の圧縮映像データを生成する間、入力される非圧縮映
像データの絵柄の難度が高くなる傾向が続くことにな
る。As a specific example, the non-compressed video data of the case where the camera starts rotating slowly in the horizontal direction from the stationary state, and finally rotates at a constant rotational speed while photographing a stationary object. Think about the design. At first, since the camera is in a stopped state, a still image is captured, and the difficulty of the picture is reduced. Next, assuming that the rotation speed becomes constant after one to two seconds from starting to rotate the camera, the difficulty of the picture tends to increase from one to two seconds after starting to rotate the camera. When this state is viewed from the video data compression device 1 side, several GOPs
During the generation of the compressed video data, the pattern of the input non-compressed video data tends to be more difficult.

【００５２】従って、この具体例に示したような場合に
は、非圧縮映像データの絵柄の難度が増大傾向を示した
場合に、それ以降の絵柄の難度が増大傾向を示すと予測
するのは妥当である。以下に説明する予測簡易２パスエ
ンコード方式は、このような難度および難度の増減傾向
の時間的相関性を積極的に利用して、圧縮映像データの
各ピクチャーに対して、第１の実施形態に示した簡易２
パスエンコード方式においてよりも適切な目標データ量
の割り当てを行おうとするものである。Therefore, in the case shown in this specific example, when the difficulty of the pattern of the non-compressed video data shows a tendency to increase, it is predicted that the difficulty of the pattern after that shows a tendency to increase. Reasonable. The simple predictive two-pass encoding method described below positively utilizes the temporal correlation of the difficulty and the increasing / decreasing tendency of the difficulty to apply the first embodiment to each picture of the compressed video data. Simple 2 shown
It is intended to allocate a more appropriate target data amount than in the path encoding method.

【００５３】以下、第２の実施形態における映像データ
圧縮装置１の予測簡易２パスエンコードの動作を説明す
る。図５（Ａ）〜（Ｃ）は、映像データ圧縮装置１の動
作を示す図である。エンコーダ制御部１２は、第１の実
施形態においてと同様に、映像データ圧縮装置１に入力
された非圧縮映像データＶＩＮに対して、エンコーダ制
御部１２により符号化順にピクチャーを並べ替える等の
前処理を行い、図５（Ａ）に示すように映像データＳ１
２としてＦＩＦＯメモリ１６０およびエンコーダ１６２
に対して出力する。The operation of the predictive simple two-pass encoding of the video data compression device 1 according to the second embodiment will be described below. 5A to 5C are diagrams illustrating the operation of the video data compression device 1. As in the first embodiment, the encoder control unit 12 performs pre-processing such as rearranging pictures in the coding order by the encoder control unit 12 on the uncompressed video data VIN input to the video data compression device 1. Is performed, and as shown in FIG.
2 as FIFO memory 160 and encoder 162
Output to

【００５４】ＦＩＦＯメモリ１６０は、第１の実施形態
においてと同様に、入力された映像データＳ１２の各ピ
クチャーをＬピクチャー分だけ遅延し、エンコーダ１８
に対して出力する。エンコーダ１６２は、第１の実施形
態においてと同様に、入力された映像データＳ１２のピ
クチャーを予備的に順次、圧縮符号化し、第ｊ（ｊは整
数）番目のピクチャーを圧縮符号化して得られた圧縮符
号化データのデータ量、ＤＣＴ処理後の映像データのＤ
Ｃ成分の値およびＡＣ成分の電力値をホストコンピュー
タ２０に対して出力する。ホストコンピュータ２０は、
エンコーダ１６２から入力されたこれらの値に基づい
て、実難度データＤ_jを順次、算出する。As in the first embodiment, the FIFO memory 160 delays each picture of the input video data S12 by L pictures, and
Output to As in the first embodiment, the encoder 162 preliminary compresses and encodes the picture of the input video data S12 sequentially, and compresses and encodes the j-th (j is an integer) picture. Data amount of compression encoded data, D of video data after DCT processing
The value of the C component and the power value of the AC component are output to the host computer 20. The host computer 20
Based on these values input from the encoder 162, sequentially, to calculate the real difficulty data D _j.

【００５５】例えば、エンコーダ１８に入力される遅延
映像データＳ１６は、ＦＩＦＯメモリ１６０によりＬピ
クチャーだけ遅延されているので、図５（Ｂ）に示すよ
うに、エンコーダ１８が、遅延映像データＳ１６の第ｊ
番目のピクチャー（図５（Ｂ）のピクチャーａ）を圧縮
符号化している際には、エンコーダ１６２は、第１の実
施形態においてと同様に、映像データＳ１２の第ｊ番目
のピクチャーからＬピクチャー分先の第（ｊ＋Ｌ）番目
のピクチャー（図５（Ｂ）のピクチャーｂ）を圧縮符号
化していることになる。For example, since the delayed video data S16 input to the encoder 18 is delayed by L pictures by the FIFO memory 160, as shown in FIG. j
When the third picture (picture a in FIG. 5B) is compression-encoded, the encoder 162 performs L-pictures from the j-th picture of the video data S12 in the same manner as in the first embodiment. This means that the preceding (j + L) -th picture (picture b in FIG. 5B) has been compression-encoded.

【００５６】従って、エンコーダ１８が遅延映像データ
Ｓ１６の第ｊ番目のピクチャーの圧縮符号化を開始する
際には、エンコーダ１６２は映像データＳ１２の第（ｊ
−Ａ）番目〜第（ｊ＋Ｌ−１）番目のピクチャー（図５
（Ｂ）の範囲ｃ、但し、図５はＡ＝０の場合を示す）の
圧縮符号化を完了し、これらのピクチャーの圧縮符号化
後のデータ量、および、ＤＣＴ処理後の映像データのＤ
Ｃ成分の値およびＡＣ成分の電力値をホストコンピュー
タ２０に対して出力している。ホストコンピュータ２０
は、エンコーダ１６２から入力されたこれらの値に基づ
いて、難度データ（実難度データ、図５（Ｂ）の範囲
ｄ）Ｄ_j-A，Ｄ_j-A+1，…，Ｄ_j，Ｄ_j+1，Ｄ_j+2，
…，Ｄ_j+L-1の算出を既に終了している。なお、Ａは整
数であり、正負を問わない。Therefore, when the encoder 18 starts compression-encoding the j-th picture of the delayed video data S16, the encoder 162 sets the (j) -th picture of the video data S12.
-A) -th to (j + L-1) -th pictures (FIG. 5)
(B), where FIG. 5 shows the case where A = 0), completes the compression encoding, the data amount of these pictures after compression encoding, and the D of the video data after DCT processing.
The value of the C component and the power value of the AC component are output to the host computer 20. Host computer 20
_Are based on these values input from the encoder 162, based on the difficulty data (actual difficulty data, range d in FIG. 5B) D _jA , D _{j-A + 1} ,..., D _j , D _{j + 1} , D _{j + 2} ,
.., D _{j + L−1} has already been calculated. Note that A is an integer, and may be either positive or negative.

【００５７】ホストコンピュータ２０は、実難度データ
Ｄ_j-A，Ｄ_j-a+1，…，Ｄ_j，Ｄ_j+ ₁，Ｄ_j+2，…，Ｄ
_j+L-1に基づいて、映像データＳ１２の第（ｊ＋Ｌ）番
目〜第（ｊ＋Ｌ＋Ｂ）番目のピクチャーの圧縮符号化後
の難度データ（予測難度データ、図５（Ｂ）の範囲ｅ）
Ｄ’_j+L，Ｄ’_j+L+1，Ｄ’_j+L+2，…，Ｄ’_j+L+Bを
予測し、下に示す式４により、遅延映像データＳ１６の
第ｊ番目のピクチャーの圧縮符号化後の目標データ量Ｔ
_jを算出する。従って、遅延映像データＳ１６の第ｊ番
目のピクチャーの圧縮符号化後の目標データ量Ｔ_jを算
出するために、実難度データと予測難度データとを含め
て、図５（Ｂ）の範囲ｃの（Ａ＋Ｌ＋Ｂ＋１）ピクチャ
ー分の難度データを用いることになる。The host computer 20 stores actual difficulty data D _jA , D _{j-a + 1} ,..., D _j , D _{j +} ₁ , D _{j + 2} _,.
_{Based on j + L-1} , the difficulty data after the compression encoding of the (j + L) -th to (j + L + B) -th pictures of the video data S12 (predicted difficulty data, range e in FIG. 5B)
D ′ _{j + L} , D ′ _{j + L + 1} , D ′ _{j + L + 2} ,..., D ′ _{j + L + B,} and the j-th of the delayed video data S16 Target data amount T after compression encoding of the picture
Calculate _j . Therefore, in order to calculate the target amount of data T _j of the compressed encoding of the j-th picture of the delayed video data S16, including the real difficulty data of the predictive difficulty data, the range c shown in FIG. 5 (B) The difficulty data for (A + L + B + 1) pictures is used.

【００５８】[0058]

【数４】 (Equation 4)

【００５９】なお、式４の各記号は、式１の各記号に同
じである。エンコーダ１８は、第１の実施形態と同様
に、ホストコンピュータ２０により量子化制御回路１８
０に設定された目標データ量Ｔ_jに基づいて、目標デー
タ量Ｔ_jに近いデータ量の圧縮映像データＶＯＵＴを生
成して出力する。さらに、ホストコンピュータ２０は、
図５（Ｂ）に示した動作と同様に、遅延映像データＳ１
６の第（ｊ＋１）番目のピクチャー（図５（Ｃ）のピク
チャーａ’）に対しても、映像データＳ１２の第（ｊ＋
Ｌ＋１）番目のピクチャー（図５（Ｃ）のピクチャー
ｂ’）以前の図５（Ｃ）の範囲ｄ’の実難度データＤ
_j-A+1，Ｄ_j-A+2，…，Ｄ_j，Ｄ_j+1，Ｄ_j+2，…，Ｄ
_j+L、および、図５（Ｃ）の範囲ｅ’に示す予測難度デ
ータ、Ｄ’_j+L+1，Ｄ’_j+L+2，Ｄ’_j+L+3，…，Ｄ’
_j+L+B+1、つまり、図５（Ｃ）の範囲ｃ’に示す実難度
データと予測難度データとに基づいて、遅延映像データ
Ｓ１６の第（ｊ＋１）番目のピクチャーの圧縮符号化後
の目標データ量Ｔ_j+1を算出する。エンコーダ１８は、
ホストコンピュータ２０が算出した目量データ量Ｔ_j+1
に基づいて、遅延映像データＳ１６の第（ｊ＋１）番目
のピクチャーを圧縮符号化し、目標データ量Ｔ_j+1に近
いデータ量の圧縮符号化データＶＯＵＴを生成する。な
お、以上の映像データ圧縮装置１の予測簡易２パスエン
コード動作は、遅延映像データＳ１６の第（ｊ＋１）番
目のピクチャーに対しても同様である。Note that each symbol in Equation 4 is the same as each symbol in Equation 1.
The same. Encoder 18 is the same as in the first embodiment.
And the quantization control circuit 18 by the host computer 20.
Target data amount T set to 0_jBased on the goal date
Volume T_jProduces compressed video data VOUT with a data amount close to
And output. Further, the host computer 20
As in the operation shown in FIG. 5B, the delayed video data S1
6 (j + 1) -th picture (picture in FIG. 5C)
(Char + '), the (j +
L + 1) th picture (picture in FIG. 5C)
b ') The actual difficulty data D in the range d' in FIG.
_{j-A + 1}, D_{j-A + 2}, ..., D_j, D_{j + 1}, D_{j + 2}, ..., D
_{j + L}, And the prediction difficulty data shown in a range e ′ in FIG.
Data, D '_{j + L + 1}, D '_{j + L + 2}, D '_{j + L + 3}, ..., D '
_{j + L + B + 1}In other words, the actual difficulty shown in the range c 'in FIG.
Delay video data based on the
After compression encoding of the (j + 1) th picture in S16
Target data amount T_{j + 1}Is calculated. The encoder 18
Scale data amount T calculated by the host computer 20_{j + 1}
(J + 1) -th of the delayed video data S16 based on
Is compressed and coded, and the target data amount T_{j + 1}Close to
A large amount of compressed encoded data VOUT is generated. What
Note that the prediction simple 2-pass engine of the above video data compression apparatus 1
The code operation is the (j + 1) th of the delayed video data S16.
The same applies to eye pictures.

【００６０】以下、図６を参照して、第２の実施形態に
おける映像データ圧縮装置１の動作を整理して説明す
る。図６は、第２の実施形態における映像データ圧縮装
置１（図１）の動作を示すフローチャートである。図６
に示すように、ステップ１０２（Ｓ１０２）において、
ホストコンピュータ２０は、式１等に用いられる数値
ｊ，Ｒ’₁を、ｊ＝−（Ｌ−１），Ｒ’₁＝(Bit rate
×(L+B))/Picture rate として初期化する。Hereinafter, the operation of the video data compression apparatus 1 according to the second embodiment will be summarized and described with reference to FIG. FIG. 6 is a flowchart showing the operation of the video data compression device 1 (FIG. 1) in the second embodiment. FIG.
As shown in step 102, in step 102 (S102),
The host computer 20 converts the numerical values j and R ′ ₁ used in Expression 1 and the like into j = − (L−1), R ′ ₁ = (Bit rate
× (L + B)) / Picture rate

【００６１】ステップ１０４（Ｓ１０４）において、ホ
ストコンピュータ２０は、数値ｊが０より大きいか否か
を判断する。数値ｊが０より大きい場合にはＳ１０６の
処理に進み、小さい場合にはＳ１１０の処理に進む。ス
テップ１０６（Ｓ１０６）において、エンコーダ１６２
は、映像データＳ１２の第（ｊ＋Ｌ）番目のピクチャー
を圧縮符号化し、実難度データＤ_j+Lを生成する。In step 104 (S104), the host computer 20 determines whether or not the numerical value j is larger than 0. If the value j is larger than 0, the process proceeds to S106, and if it is smaller, the process proceeds to S110. In step 106 (S106), the encoder 162
Compresses and encodes the (j + L) -th picture of the video data S12 to generate actual difficulty data D _{j + L.}

【００６２】ステップ１０８（Ｓ１０８）において、ホ
ストコンピュータ２０は数値ｊをインクリメントする
（ｊ＝ｊ＋１）。ステップ１１０（Ｓ１１０）におい
て、ホストコンピュータ２０は、遅延映像データＳ１６
に第ｊ番目のピクチャーが存在するか否かを判断する。
第ｊ番目のピクチャーが存在する場合にはＳ１１２の処
理に進み、存在しない場合には圧縮符号化処理を終了す
る。In step 108 (S108), the host computer 20 increments the numerical value j (j = j + 1). In step 110 (S110), the host computer 20 transmits the delayed video data S16
It is determined whether the j-th picture exists.
If the j-th picture exists, the process proceeds to S112; otherwise, the compression encoding process ends.

【００６３】ステップ１１２（Ｓ１１２）において、ホ
ストコンピュータ２０は、数値ｊが数値Ａよりも大きい
か否かを判断する。数値ｊが数値Ａよりも大きい場合に
はＳ１１４の処理に進み、小さい場合にはＳ１１６の処
理に進む。ステップ１１４（Ｓ１１４）において、ホス
トコンピュータ２０は、実難度データＤ_j-A〜Ｄ_j+L-1
に基づいて、予測難度データＤ’_j+L〜Ｄ’_j+L+Bを算
出する。ステップ１１６（Ｓ１１６）において、ホスト
コンピュータ２０は実難度データＤ₁〜Ｄ_j+L-1から、
予測難度データＤ’_j+L〜Ｄ’_j+L+Bを算出する。In step 112 (S112), the host computer 20 determines whether or not the numerical value j is larger than the numerical value A. When the numerical value j is larger than the numerical value A, the process proceeds to S114, and when the numerical value j is smaller, the process proceeds to S116. In step 114 (S114), the host computer 20 _executes the actual difficulty data D _{jA to} D _{j + L-1.}
, The predicted difficulty level data D ′ _{j + L to} D ′ _{j + L + B} are calculated. At step 116 (S116), the host computer 20 is the real difficulty data _{_{D 1 ~D j + L-1}} ,
The prediction difficulty data D ′ _{j + L to} D ′ _{j + L + B} are calculated.

【００６４】ステップ１１８（Ｓ１１８）において、ホ
ストコンピュータ２０は、式４を用いて目標データ量Ｔ
_jを算出し、エンコーダ１８の量子化制御回路１８０に
設定する。さらに、エンコーダ１８は、量子化制御回路
１８０に設定された目標データ量Ｔ_jに基づいて遅延映
像データＳ１６の第ｊ番目のピクチャーを圧縮符号化
し、第ｊ番目のピクチャーから実際に得られた圧縮映像
データのデータ量Ｓ_jをホストコンピュータ２０に対し
て出力する。ステップ１２０（Ｓ１２０）において、ホ
ストコンピュータ２０は、エンコーダ１８からのデータ
量Ｓ_jを記憶し、さらに、映像データＳ１２の第（ｊ＋
Ｌ）番目のピクチャーの実難度データＤ_j+Lを出力す
る。In step 118 (S118), the host computer 20 calculates the target data amount T
_j is calculated and set in the quantization control circuit 180 of the encoder 18. Further, the encoder 18 compression-encodes the j-th picture of the delayed video data S16 based on the target data amount T _j set in the quantization control circuit 180, and compresses the compressed picture actually obtained from the j-th picture. The data amount _Sj of the video data is output to the host computer 20. In step 120 (S120), the host computer 20 stores the data amount _Sj from the encoder 18, and further stores the data amount _Sj of the video data S12.
L) Output the actual difficulty data D _{j + L} of the picture.

【００６５】ステップ１２２（Ｓ１２２）において、エ
ンコーダ１８は、遅延映像データＳ１６の第ｊ番目を圧
縮符号化して得られた圧縮映像データＶＯＵＴを外部に
出力する。ステップ１２４（Ｓ１２４）において、ホス
トコンピュータ２０は、ピクチャータイプに応じて、式
３中に用いられる数値Ｆ_j+Lを算出する。ステップ１２
６（Ｓ１２６）において、ホストコンピュータ２０は、
式３に示した演算（Ｒ’_j+1＝Ｒ’_j−Ｓ_j＋Ｆ_j+L）
を行う。In step 122 (S122), the encoder 18 outputs the compressed video data VOUT obtained by compression-coding the j-th delayed video data S16 to the outside. In step 124 (S124), the host computer 20 calculates the numerical value F _{j + L} used in Expression 3 according to the picture type. Step 12
6 (S126), the host computer 20
The operation shown in Equation 3 (R ′ _{j + 1} = R ′ _j −S _j + F _{j + L} )
I do.

【００６６】以上説明したように、第２の実施形態に示
した映像データ圧縮装置１による予測簡易２パスエンコ
ードによれば、短時間で非圧縮映像データＶＩＮの絵柄
の難度を算出し、算出した難度に基づいて予測した難度
をさらに用いて適応的に非圧縮映像データＶＩＮを圧縮
符号化することができ、簡易２パスエンコード方式に比
べて、より適切な目標データ量を圧縮映像データの各ピ
クチャーに割り当てることが可能である。従って、予測
簡易２パスエンコード方式による圧縮映像データを伸長
復号した場合、簡易２パスエンコード方式による圧縮映
像データを伸長復号した場合に比べて、より高品質な映
像を得ることができる。As described above, according to the prediction simple two-pass encoding by the video data compression apparatus 1 shown in the second embodiment, the difficulty of the pattern of the uncompressed video data VIN is calculated in a short time. The uncompressed video data VIN can be adaptively compression-encoded by further using the degree of difficulty predicted based on the degree of difficulty, and a more appropriate target data amount can be set for each picture of the compressed image data as compared with the simple two-pass encoding method. Can be assigned to Therefore, when the compressed video data is expanded and decoded by the predictive simple two-pass encoding method, a higher quality video can be obtained as compared with the case where the compressed video data is expanded and decoded by the simple two-pass encoding method.

【００６７】第３実施形態以下、本発明の第３の実施形態として、編集処理によ
り、複数の非圧縮映像データ（以下、非圧縮映像データ
をシーンとも記す）を連続的に接続して１つの非圧縮映
像データ（編集映像データ）とし、この複数のシーンか
らなる編集映像データを、第１の実施形態に示した映像
データ圧縮装置１（図１）を用いた簡易２パスエンコー
ド方式により圧縮符号化する方法を説明する。 Third Embodiment Hereinafter, as a third embodiment of the present invention, a plurality of uncompressed video data (hereinafter, also referred to as “scene”) are connected by editing processing to form one uncompressed video data. Uncompressed video data (edited video data) is used, and the edited video data composed of the plurality of scenes is compressed by a simple two-pass encoding method using the video data compression device 1 (FIG. 1) shown in the first embodiment. A method for converting the data will be described.

【００６８】図７（Ａ）〜（Ｃ）は、第２の実施形態に
おける予測簡易２パスエンコード方式、および、第３の
実施形態における改良予測簡易２パスエンコード方式に
よる、シーンチェンジの前後のピクチャーに対する圧縮
符号化を示す図である。第２の実施形態に示した予測簡
易２パスエンコード方式は、図７（Ａ）に示すように入
力される映像データに含まれるピクチャー間の時間的な
相関性を利用し、圧縮映像データのピクチャーそれぞれ
のデータ量を予測する。しかしながら、図７（Ｂ）に示
すタイミングでシーンチェンジ(scene change)が生じた
場合、シーンチェンジの前後では、ピクチャー間に相関
性がないので、図７（Ｃ）に示すように、シーンチェン
ジの前の難度データに基づいてシーンチェンジの後のピ
クチャーに対する目標データ量Ｔ_jを算出することとな
り、第２の実施形態に示した予測簡易２パスエンコード
方式の効果を得ることができないばかりか、却って、伸
長復号後の映像の品質が悪化してしまう可能性がある。FIGS. 7A to 7C show pictures before and after a scene change by the simple predictive two-pass encoding method according to the second embodiment and the improved simple predictive two-pass encoding method according to the third embodiment. FIG. 4 is a diagram illustrating compression encoding for. The simplified predictive two-pass encoding method shown in the second embodiment utilizes temporal correlation between pictures included in input video data as shown in FIG. Predict the amount of each data. However, if a scene change occurs at the timing shown in FIG. 7B, there is no correlation between pictures before and after the scene change, and therefore, as shown in FIG. It becomes possible to calculate the target amount of data T _j for pictures after the scene change based on the previous difficulty data, not only it is impossible to obtain the effect of the prediction simplified two pass encoding system shown in the second embodiment, rather However, there is a possibility that the quality of the video after the decompression decoding is deteriorated.

【００６９】つまり、具体例を挙げると、予測簡易２パ
スエンコード方式において、絵柄が簡単なシーンが入力
されている間にシーンチェンジが生じ、絵柄が難しいシ
ーンに代わった場合、ホストコンピュータ２０は、シー
ンチェンジ後も、入力される編集映像データの難度デー
タの値を小さく予測するにも関わらず、実際には、絵柄
が難しいピクチャーが入力され、後のシーンの各ピクチ
ャーに割り当てるデータ量が不足してしまう。このよう
に、割り当てるデータ量が不足した場合、シーンチェン
ジ部分の圧縮映像データに著しい符号化歪みが生じ、伸
長復号して得られる映像の品質が著しく低下してしま
う。That is, to give a specific example, in the predictive simple two-pass encoding method, when a scene change occurs while a scene with a simple pattern is input and the scene is replaced with a scene with a difficult pattern, the host computer 20 Even after the scene change, despite the fact that the value of the difficulty data of the input edited video data is predicted to be small, a picture with a difficult picture is actually input, and the amount of data allocated to each picture in the subsequent scene is insufficient. Would. As described above, when the data amount to be allocated is insufficient, remarkable coding distortion occurs in the compressed video data in the scene change portion, and the quality of the video obtained by decompression decoding is significantly reduced.

【００７０】第３の実施形態に示す予測簡易２パスエン
コード方式（改良予測簡易２パスエンコード方式）は、
かかる観点からなされたものであって、シーンチェンジ
の前後等において編集映像データの時間的な相関性が失
われた場合に、編集映像データの時間的な相関性が失わ
れた部分に生じる難度データの予測に基づくデータ量の
割り当てに起因する悪影響を除去し、さらに、シーンチ
ェンジ直後のピクチャーに割り当てる符号量を精度よく
予測し、効率的な圧縮符号化を行うことを目的とする。The simplified prediction two-pass encoding method (improved simplified prediction two-pass encoding method) shown in the third embodiment is as follows.
From such a viewpoint, when the temporal correlation of the edited video data is lost before and after a scene change, etc., difficulty data generated in a portion where the temporal correlation of the edited video data is lost It is an object of the present invention to eliminate an adverse effect caused by the data amount allocation based on the prediction of the above, further accurately predict the code amount to be allocated to the picture immediately after the scene change, and perform efficient compression encoding.

【００７１】この目的を達成するために、改良予測簡易
２パスエンコード方式は、第２の実施形態に示した映像
データ圧縮装置１（図１）を用いた予測簡易２パスエン
コード方式を改良し、シーンチェンジを検出し、圧縮映
像データのピクチャーに割り当てるデータ量の算出に用
いることができなくなったシーンチェンジ前の実難度デ
ータではなく、シーンチェンジ後に求めた実難度データ
を用いて、可能な限り正確に、その後の所定数のピクチ
ャーの難度を予測する。In order to achieve this object, the improved simplified predictive two-pass encoding method is to improve the simplified predictive two-pass encoding method using the video data compression device 1 (FIG. 1) shown in the second embodiment. Detects scene changes and uses the actual difficulty data obtained after the scene change instead of the actual difficulty data before the scene change, which can no longer be used to calculate the amount of data allocated to pictures of compressed video data. Then, the difficulty of a predetermined number of pictures is predicted.

【００７２】まず、図８および図９を参照して、改良予
測簡易２パスエンコード方式を概念的に説明する。図８
（Ａ）〜（Ｃ）は、エンコーダ制御部１２（図１）によ
る編集映像データのピクチャーの順序の入れ替え処理、
および、ホストコンピュータ２０によるピクチャーの種
類（ピクチャータイプ）の変更処理を示す図である。図
９は、編集映像データのシーンチェンジ部分付近の実難
度データの値の経時的な変化を例示する図である。な
お、図９において、Ｉピクチャー、Ｐピクチャーおよび
Ｂピクチャーは、編集映像データを圧縮符号化した後の
ピクチャータイプを示す。First, the improved prediction simple two-pass encoding method will be conceptually described with reference to FIGS. FIG.
(A) to (C) show a process of changing the order of pictures of edited video data by the encoder control unit 12 (FIG. 1).
FIG. 9 is a diagram illustrating a process of changing a picture type (picture type) by the host computer 20. FIG. 9 is a diagram exemplifying a change with time of the value of the actual difficulty data near the scene change portion of the edited video data. In FIG. 9, I picture, P picture, and B picture indicate the picture types after the edited video data is compression-encoded.

【００７３】編集映像データのシーンチェンジが圧縮符
号化後にＰピクチャーとなるピクチャー（以下、「圧縮
符号化後にＰピクチャーとなるピクチャー」等を、単に
「Ｐピクチャー」等とも記す）で生じると、エンコーダ
制御部１２（図１）が、図８（Ａ），（Ｂ）に示すよう
に編集映像データのピクチャーの順序を並び替えた映像
データＳ１２からエンコーダ１６２およびホストコンピ
ュータ２０が生成する実難度データＤ_jの値は、例え
ば、図９に示すように変化する。つまり、シーンチェン
ジの直後、編集映像データの先頭のＰピクチャーの実難
度データＤ_jは、このピクチャーから生成される圧縮映
像データのＰピクチャーが、前方のピクチャーを参照す
ることができないため増加し、Ｉピクチャーとほぼ、同
様の処理によって生成されることになる。従って、シー
ンの先頭のＰピクチャーの実難度データＤ_jの値は、例
えば、Ｉピクチャーの難度データＤ_jと同程度の値にな
る。When a scene change of edited video data occurs in a picture that becomes a P picture after compression encoding (hereinafter, a "picture that becomes a P picture after compression encoding" or the like is also simply referred to as a "P picture"). As shown in FIGS. 8A and 8B, the control unit 12 (FIG. 1) executes the actual difficulty data D generated by the encoder 162 and the host computer 20 from the video data S12 in which the order of the pictures of the edited video data is rearranged. The value of _j changes, for example, as shown in FIG. That is, immediately after the scene change, the real difficulty data D _j of the P picture of the head of the edited video data, P-picture of the compressed video data generated from the picture is increased because it is not possible to refer to the front of the picture, It is generated by the same processing as that of the I picture. Therefore, the value of the actual difficulty data D _j of the P picture at the head of the scene is, for example, about the same as the difficulty data D _j of the I picture.

【００７４】従って、ホストコンピュータ２０は、エン
コーダ１６２が生成する圧縮映像データのピクチャータ
イプシーケンスに基づいて、実難度データＤ_jの値の経
時的な変化を監視し、例えば、Ｐピクチャーの実難度デ
ータＤ_jの値が、直前のＰピクチャーの実難度データＤ
_jの１．５倍以上になった場合、直前のＩピクチャーの
実難度データＤ_jの０．７倍以上になった場合、あるい
は、第２の実施形態に示した予測簡易２パスエンコード
方式においてと同じ方法でホストコンピュータ２０が予
測した値に比べ、実際の実難度データの値が１．５倍以
上になった場合に、そのＰピクチャーに対応する編集映
像データのピクチャーでシーンチェンジが生じたと判断
することができる。[0074] Therefore, the host computer 20, based on the picture type sequence of compressed video data encoder 162 produces monitors the temporal change in the value of the real difficulty data D _j, for example, the real difficulty data of the P picture The value of D _j is the actual difficulty data D of the immediately preceding P picture.
_j , 1.5 times or more, the actual difficulty data D _j of the immediately preceding I picture, 0.7 times or more, or in the predictive simple two-pass encoding method shown in the second embodiment. If the value of the actual difficulty data is 1.5 times or more as compared with the value predicted by the host computer 20 in the same manner as above, it is determined that a scene change has occurred in the picture of the edited video data corresponding to the P picture. You can judge.

【００７５】しかしながら、編集映像データのシーンチ
ェンジが圧縮符号化後にＩピクチャーとなるピクチャー
で生じると、ホストコンピュータ２０が生成する実難度
データＤ_jの値はほとんど変化しないことがあり、逆
に、シーンチェンジ後の編集映像データの絵柄が単純な
場合等には、かえって、実難度データＤ_jの値が減少す
る可能性がある。また、シーンチェンジ前の編集映像デ
ータの絵柄が複雑で、シーンチェンジ後の編集映像デー
タの絵柄が平坦である場合、あるいは、シーンチェンジ
前後の編集映像データに非常に動きが大きい場合等に
は、Ｐピクチャーの実難度データＤ_jの値が顕著に増加
しない場合がある。しかしながら、事実上、シーンチェ
ンジの直後は後方のピクチャーのみしか参照できないの
で、シーンチェンジ直後のＢピクチャーの実難度データ
Ｄ_jの値は、Ｐピクチャーの実難度データＤ_jの値と同
程度にまで増大する。[0075] However, when a scene change of edited video data is generated in a picture which becomes the I picture after compression coding, the value of the real difficulty data D _j by the host computer 20 to generate is sometimes hardly changes. Conversely, the scene edited when the video data pattern is simple in the like after change, rather, there is a possibility that the value of the real difficulty data D _j is reduced. Also, when the pattern of the edited video data before the scene change is complicated and the pattern of the edited video data after the scene change is flat, or when the edited video data before and after the scene change has a very large movement, the value of the real difficulty data D _j of the P-picture may not increase significantly. However, practically, because immediately after the scene change can not be referred to only a rear picture, the value of the real difficulty data D _j of the B-picture immediately after the scene change, to the same extent as the value of the real difficulty data D _j of the P picture Increase.

【００７６】従って、ホストコンピュータ２０は、実難
度データＤ_jの値の経時的な変化を監視し、例えば、Ｂ
ピクチャーの実難度データＤ_jの値が、直前のＢピクチ
ャーの実難度データＤ_jの１．５倍以上になった場合、
あるいは、予測した値と比べ実際の実難度データＤ_jの
値が１．５倍以上になった場合に、そのＢピクチャーの
直前のＩピクチャーおよびＰピクチャーに対応する編集
映像データのピクチャーでシーンチェンジが生じたと判
断することができる。なお、Ｐピクチャーの実難度デー
タＤ_jの変化に基づいてシーンチェンジを検出する方
法、および、Ｂピクチャーの実難度データＤ_jの変化に
基づいてシーンチェンジを検出する方法を併用すること
により、ホストコンピュータ２０は、シーンチェンジの
検出を確実に行うことができる。[0076] Therefore, the host computer 20 monitors the temporal change in the value of the real difficulty data D _j, for example, B
When the value of the actual difficulty data D _j of the picture is 1.5 times or more the actual difficulty data D _j of the immediately preceding B picture,
Alternatively, if the actual value of the real difficulty data D _j compared with predicted values is equal to or greater than 1.5 times, scene change picture of edited video data corresponding to the I-picture and P-picture immediately before the B-picture Can be determined to have occurred. By using a method of detecting a scene change based on a change in the actual difficulty data D _j of the P picture and a method of detecting a scene change based on a change in the actual difficulty data D _j of the B picture in combination, The computer 20 can reliably detect a scene change.

【００７７】一方、シーンチェンジの発生により、編集
映像データのシーンチェンジ以前のピクチャーとシーン
チェンジ以降のピクチャーの相関性はなくなるので、第
２の実施形態に示した予測簡易２パスエンコード方式に
おけるシーンチェンジ以前の実難度データＤ_jを用い
た、シーンチェンジ以降のピクチャーに対する予測難度
データＤ’_jは意味を有さなくなる。しかしながら、編
集映像データのシーンチェンジ直後の数枚のピクチャー
は、それ以降のピクチャーと充分な相関性を有し、従っ
て、シーンチェンジ直後の数枚のピクチャーの実難度デ
ータＤ_jに基づいて、それ以降の所定枚数のピクチャー
の難度データＤ_jの値を予測することが可能である。On the other hand, when a scene change occurs, the correlation between the picture before the scene change of the edited video data and the picture after the scene change is lost. Predicted difficulty data D ′ _j for the picture after the scene change using the previous actual difficulty data D _j has no meaning. However, the number of sheets of pictures immediately after a scene change of the edited video data has a subsequent picture and a sufficient correlation, therefore, based on the real difficulty data D _j of the number of sheets of pictures immediately after a scene change, it it is possible to predict the value of the difficulty data D _j of the picture after a predetermined number.

【００７８】さらに、第２の実施形態に示した予測簡易
２パスエンコード方式においては、式４に示したように
目標データ量Ｔ_jを算出する。従って、目標データ量Ｔ
_jを算出するためには、下に示す式５において定義され
る総和値Ｓｕｍ_jを用いればよく、必ずしも個々の予測
難度データＤ’_jを求める必要はない。Further, in the simple predictive two-pass encoding method shown in the second embodiment, the target data amount _Tj is calculated as shown in Expression 4. Therefore, the target data amount T
_In order to calculate _j , it is sufficient to use the sum value Sum _j defined in Expression 5 shown below, and it is not always necessary to calculate individual prediction difficulty data D ′ _j .

【００７９】[0079]

【数５】 (Equation 5)

【００８０】式５において定義した総和値Ｓｕｍ_jを用
いると、式４は、下に示す式６に書き換えることができ
る。Using the sum Sum _j defined in Equation 5, Equation 4 can be rewritten as Equation 6 shown below.

【００８１】[0081]

【数６】 (Equation 6)

【００８２】つまり、ホストコンピュータ２０は、個々
の予測難度データＤ’_jではなく、総和値Ｓｕｍ_jを予
測することができさえすれば、目標データ量Ｔ_jを算出
することができる。[0082] That is, the host computer 20, rather than the individual predictive difficulty data D _'j, if only able to predict the sum value Sum _j, it is possible to calculate the target amount of data T _j.

【００８３】第３の実施形態における改良予測簡易２パ
スエンコード方式において、ホストコンピュータ２０
は、シーンチェンジ直後に生成した実難度データＤ_jに
基づいて総和値Ｓｕｍ_jを予測し、予測した総和値Ｓｕ
ｍ_jに基づいて、目標データ量Ｔ_jを精度よく算出す
る。続いて所定数の編集映像データのピクチャーが入力
される間、ホストコンピュータ２０は、その後に生成し
た実難度データＤ_jに基づいて、総和値Ｓｕｍ_jの値を
順次、補正する。さらに、ホストコンピュータ２０は、
シーンチェンジ以降、さらに所定数のピクチャーが入力
され、充分な数の実難度データＤ_jを生成した後には、
第２の実施形態に示した予測簡易２パスエンコード方式
においてと同じ方法により、目標データ量Ｔ_jを生成す
る。In the improved prediction simple two-pass encoding method according to the third embodiment, the host computer 20
Predicts the sum value Sum _j based on the actual difficulty data D _j generated immediately after the scene change, and calculates the predicted sum value Su.
Based on m _j , the target data amount T _j is accurately calculated. Subsequently, while a predetermined number of pictures of edited video data are input, the host computer 20 sequentially corrects the value of the sum Sum _j based on the actual difficulty data D _j generated thereafter. Further, the host computer 20
Scene change subsequent further input a predetermined number of picture, after generating the real difficulty data D _j of sufficient number,
The target data amount _Tj is generated by the same method as in the simple predictive two-pass encoding method shown in the second embodiment.

【００８４】次に、第３の実施形態における映像データ
圧縮装置１（図１）の動作を説明する。なお、説明の簡
略化のために、第３の実施形態においても、図７に示し
たように、映像データ圧縮装置１は、第２の実施形態に
おいてと同じピクチャータイプシーケンス（Ｎ＝１５，
Ｍ＝３；Ｎは１ＧＯＰに含まれるピクチャー数、ＭはＰ
ピクチャーの間のＢピクチャー数）に編集映像データを
圧縮符号化し、第２の実施形態においてと同様に、１５
個のピクチャーの実難度データＤ_jから、次の１５個の
ピクチャーの予測難度データＤ’_jを生成する場合を例
に説明する。Next, the operation of the video data compression device 1 (FIG. 1) in the third embodiment will be described. For simplicity of description, also in the third embodiment, as shown in FIG. 7, the video data compression device 1 uses the same picture type sequence (N = 15,
M = 3; N is the number of pictures included in one GOP, M is P
The edited video data is compression-encoded to (the number of B pictures between pictures), and is 15 bits as in the second embodiment.
An example will be described in which predicted difficulty data D ′ _j of the next 15 pictures are generated from actual difficulty data D _j of the pictures.

【００８５】エンコーダ制御部１２は、第１の実施形態
および第２の実施形態においてと同様の処理を行い、例
えば、図８（Ａ）に示したピクチャータイプシーケンス
で入力される非圧縮映像データのピクチャーの順番を、
図８（Ｂ）に示すように、エンコーダ１６２およびエン
コーダ１８における圧縮符号化に適した順番、つまり、
Ｂピクチャーが直後のＩピクチャーまたはＰピクチャー
の後ろになる順番に入れ替えて、映像データＳ１２とし
てエンコーダ１６２およびＦＩＦＯメモリ１６０に対し
て出力する。従って、例えば、図８（Ａ）に示したよう
に、第１のシーンのデータと第２のシーンのデータとの
間のシーンチェンジがＢピクチャーに圧縮符号化される
べきピクチャーであっても、エンコーダ１６２およびエ
ンコーダ１８に入力される後ろのシーンの最初のピクチ
ャータイプは必ずＰピクチャーまたはＩピクチャーにな
る。ＦＩＦＯメモリ１６０は、第１の実施形態および第
２の実施形態においてと同様に、例えば、入力される編
集映像データを１５ピクチャー分、遅延してエンコーダ
１８に対して出力する。The encoder control unit 12 performs the same processing as in the first and second embodiments, for example, for the non-compressed video data input in the picture type sequence shown in FIG. Change the picture order
As shown in FIG. 8B, an order suitable for compression encoding in the encoder 162 and the encoder 18, that is,
The B picture is rearranged in the order following the immediately following I picture or P picture, and is output to the encoder 162 and the FIFO memory 160 as video data S12. Therefore, for example, as shown in FIG. 8A, even if a scene change between the data of the first scene and the data of the second scene is a picture to be compression-encoded into a B picture, The first picture type of the subsequent scene input to the encoder 162 and the encoder 18 is always a P picture or an I picture. The FIFO memory 160 outputs the input edited video data to the encoder 18 with a delay of 15 pictures, for example, as in the first and second embodiments.

【００８６】エンコーダ１６２は、第１の実施形態およ
び第２の実施形態においてと同様に、シーンチェンジの
有無にかかわらず、映像データＳ１２をピクチャータイ
プシーケンスＩ，Ｂ，Ｂ，Ｐ，Ｂ，Ｂ，Ｐ，Ｂ，Ｂ，
Ｐ，Ｂ，Ｂ，Ｐ，Ｂ，Ｂ，Ｐ，Ｂ，Ｂで圧縮符号化し、
実難度データＤ_jを生成してホストコンピュータ２０に
対して出力する。エンコーダ１６２が生成する実難度デ
ータＤ_jの値の経時的な変化は、例えば、図９に示した
ようになり、一般的に、シーンチェンジが発生した直後
の後ろのシーンの最初のＰピクチャーの実難度データの
値は、他のＰピクチャーの実難度データの値と比べて大
きくなる。The encoder 162 converts the picture data S12 into the picture type sequence I, B, B, P, B, B, irrespective of the presence or absence of a scene change, as in the first and second embodiments. P, B, B,
P, B, B, P, B, B, P, B, B
And outputs to the host computer 20 to generate the real difficulty data D _j. Temporal change in the value of the real difficulty data D _j of the encoder 162 generates, for example, is as shown in FIG. 9, generally behind just after the scene change occurs scene of the first P-picture The value of the actual difficulty data is larger than the values of the actual difficulty data of the other P pictures.

【００８７】ホストコンピュータ２０は、エンコーダ１
６２から入力される実難度データの値の経時的な変化を
監視し、第３の実施形態において上述したように、実難
度データＤ_jの値が、直前のＰピクチャーの実難度デー
タＤ_j-1の、例えば１．５倍（実用的には１．４倍〜
１．８倍の間の値とすると好適）以上の値を示すＰピク
チャーを検出する等の方法によりＰピクチャーでシーン
チェンジが発生したことを判断する。シーンチェンジを
検出した場合、ホストコンピュータ２０はさらに、図８
（Ｃ）に示したように、後ろのシーンの最初のＰピクチ
ャーを前のシーンの最後のピクチャーを参照しないＩピ
クチャーに変更し、前のシーンの最後のＩピクチャーを
Ｐピクチャーに変更するように、エンコーダ１８を制御
して編集映像データのシーンチェンジの前後の部分を圧
縮符号化する際のピクチャータイプシーケンスを変更さ
せる。The host computer 20 includes the encoder 1
Monitoring the temporal change in the value of the real difficulty data input from 62, as described above in the third embodiment, the value of the real difficulty data D _j is the real difficulty data D immediately before the P-picture _{j- 1} , for example 1.5 times (practically 1.4 times ~
It is preferable to set a value between 1.8 times.) It is determined that a scene change has occurred in the P picture by a method such as detecting a P picture showing the above value. When a scene change is detected, the host computer 20 further transmits the information shown in FIG.
As shown in (C), the first P picture of the subsequent scene is changed to an I picture that does not refer to the last picture of the previous scene, and the last I picture of the previous scene is changed to a P picture. , And controls the encoder 18 to change the picture type sequence when the part before and after the scene change of the edited video data is compression-encoded.

【００８８】なお、シーンチェンジが生じてもＩピクチ
ャー自体のデータ量には大きな変化は生じるとは限らな
い。しかし、ホストコンピュータ２０は、第３の実施形
態において上述したように、Ｂピクチャーの実難度デー
タの値の経時的な変化を監視し、例えば、直前のＢピク
チャーの実難度データの１．５倍の値の実難度データを
有するＢピクチャーを検出する等の方法により、Ｉピク
チャーでシーンチェンジが生じたことを判断することが
できる。It should be noted that even if a scene change occurs, a large change does not always occur in the data amount of the I picture itself. However, as described above in the third embodiment, the host computer 20 monitors a temporal change in the value of the actual difficulty data of the B picture, and for example, 1.5 times the actual difficulty data of the immediately preceding B picture. It is possible to determine that a scene change has occurred in the I picture by a method such as detecting a B picture having the actual difficulty data of the value.

【００８９】図１０は、ホストコンピュータ２０が、編
集映像データにシーンチェンジが発生する場合に、実難
度データＤ₁〜Ｄ₁₅に基づいて予測難度データＤ’₁₆〜
Ｄ’ ₃₀を算出する方法、および、編集映像データにシー
ンチェンジが発生しない場合の予測難度データＤ’₁₆〜
Ｄ’₃₀を算出する方法を示す図である。ホストコンピュ
ータ２０は、編集映像データにシーンチェンジが発生し
ない場合には、エンコーダ１６２から得られたデータか
ら、図１０中に○印で示す実難度データＤ₁〜Ｄ₁₅を生
成し、生成した実難度データＤ₁〜Ｄ₁₅に基づいて、図
１０中に×印で示す予測難度データＤ’₁₆〜Ｄ’₃₀をピ
クチャーの種類（ピクチャータイプ）ごとに算出する。FIG. 10 shows that the host computer 20
Real difficulty when scene change occurs in the collected video data
Degree data D₁~ D_FifteenBased on the prediction difficulty data D '₁₆~
D ' ₃₀And how to calculate the
Prediction difficulty data D 'when no change occurs₁₆~
D '₃₀It is a figure showing the method of calculating. Host computer
Data 20 indicates that a scene change has occurred in the edited video data.
If not, the data obtained from encoder 162
The actual difficulty data D indicated by a circle in FIG.₁~ D_FifteenRaw
Actual difficulty data D₁~ D_FifteenBased on the figure
Predicted difficulty data D 'indicated by a cross in 10₁₆~ D '₃₀The
It is calculated for each type of picture (picture type).

【００９０】つまり、編集映像データにシーンチェンジ
が発生しない場合には、ホストコンピュータ２０は、Ｂ
ピクチャーの実難度データＤ₂，Ｄ₃，…，Ｄ₁₃，Ｄ₁₄
の値を、図１０中の点線Ａで直線近似して外挿し、Ｂピ
クチャーの予測難度データＤ’₁₆，Ｄ’₁₇，…，
Ｄ’₂₉，Ｄ’₃₀を生成し、Ｉピクチャーの実難度データ
Ｄ₄、および、必要に応じてこれ以前のＩピクチャーの
実難度データＤ_jの値を直線近似して外挿し、Ｉピクチ
ャーの予測難度データＤ’₁₈を生成し、Ｐピクチャーの
実難度データＤ₁，Ｄ₇，…，Ｄ₁₂、および、必要に応
じてこれ以前のＰピクチャーの実難度データＤ_jの値を
直線近似して外挿し、Ｐピクチャーの予測難度データ
Ｄ’₁₅，Ｄ’₂₁，…，Ｄ’₂₇を生成する。さらに、ホス
トコンピュータ２０は、これらの実難度データＤ_jおよ
び予測難度データＤ’_jを用いて、第２の実施形態に示
した予測簡易２パス方式により目標データ量Ｔ_jを算出
する。That is, when no scene change occurs in the edited video data, the host computer 20
Picture actual difficulty data D ₂ , D ₃ ,..., D ₁₃ , D ₁₄
The values, extrapolated linearly approximated by the dotted line A in FIG. 10, B-picture predictive difficulty data _{_{D '16, D' 17,}} ...,
D ′ ₂₉ and D ′ ₃₀ are generated, and the actual difficulty data D ₄ of the I picture and, if necessary, the values of the actual difficulty data D _j of the previous I picture are extrapolated by linear approximation to obtain the I picture. Predicted difficulty data D ′ ₁₈ is generated, and the actual difficulty data D ₁ , D ₇ ,..., D ₁₂ of the P picture and, if necessary, the actual difficulty data D _j of the previous P picture are linearly approximated. extrapolated Te, predictive difficulty data D _'15, D' ₂₁ P-picture, ..., to produce a D _'27. Further, the host computer 20 uses these real difficulty data D _j and the predicted difficulty data D _'j, the predicted simplified two pass method shown in the second embodiment calculates the target amount of data T _j.

【００９１】以下、ホストコンピュータ２０が、Ｐピク
チャーで編集映像データのシーンチェンジを検出した場
合の処理内容を、段階に分けて説明する。第１段階ホストコンピュータ２０が、Ｐピクチャーでシーンチェ
ンジが発生したことを検出した場合、図１０中に●で示
すＰピクチャーの実難度データＤ₁₅のみからでは、ピク
チャー間の動きの量等によって左右されるＢピクチャー
およびＰピクチャーの難度を予測することができない。
そこで、ホストコンピュータ２０は、予め実験等により
求められたＩピクチャー、ＰピクチャーおよびＢピクチ
ャーの実難度データの値の比率（ｉ：ｐ：ｂ）を用い
て、式５に定義した総和値Ｓｕｍ_jを求める。Hereinafter, the processing content when the host computer 20 detects a scene change of edited video data in a P picture will be described in stages. The first step host computer 20, when it is detected that scene change P picture is generated, from only the real difficulty data D ₁₅ of the P-picture indicated by ● in Fig. 10, left and right by the amount or the like of the motion between the picture It is impossible to predict the difficulty of the B picture and the P picture to be performed.
Therefore, the host computer 20 uses the ratio (i: p: b) of the values of the actual difficulty data of the I picture, the P picture, and the B picture obtained in advance by an experiment or the like to calculate the total sum Sum _j defined in Expression 5. Ask for.

【００９２】つまり、ホストコンピュータ２０は、第ｊ
＋１番目（図１０においてはｊ＝１）のピクチャーに対
する目標データ量を算出するために、例えば、下に示す
予め求めたＩピクチャー、ＰピクチャーおよびＢピクチ
ャーの実難度データの値の比率（ｉ：ｐ：ｂ）を用いた
式７に、シーンチェンジが生じたＰピクチャーの実難度
データＤ_j+15を代入して、第（ｊ＋１）番目のピクチャ
ーに対する目標データ量Ｔ_j+1の算出に用いる総和値Ｓ
ｕｍ_j+1を予測し、さらに、予測した総和値Ｓｕｍ_j+1
を式４に代入して、第（ｊ＋１）番目のピクチャーに対
する目標データ量Ｔ_j+1を算出する。That is, the host computer 20 executes the j-th
In order to calculate the target data amount for the + 1st (j = 1 in FIG. 10) picture, for example, the ratio of the previously obtained I, P, and B picture actual difficulty data values (i: Substituting the actual difficulty data D _{j + 15} of the P picture in which the scene change has occurred into Equation 7 using p: b), and using it for calculating the target data amount T _{j + 1} for the (j + 1) th picture Sum S
um _{j + 1} is predicted, and the predicted sum value Sum _{j + 1 is} further predicted.
Is substituted into Equation 4 to calculate a target data amount T _{j + 1} for the (j + 1) -th picture.

【００９３】[0093]

【数７】 (Equation 7)

【００９４】式７においては、シーンチェンジが発生し
たＰピクチャーの実難度データＤ_j+ ₁₅の値が、第３の実
施形態において上述したように、直後のＩピクチャーの
実難度データＤ_j+18と等しいことを前提とし、ホストコ
ンピュータ２０が、予め求めた比率（ｉ：ｐ：ｂ）、お
よび、１ＧＯＰに含まれるＩピクチャー、Ｐピクチャー
およびＢピクチャーの枚数を乗じた係数を、シーンチェ
ンジ後に最初に算出したＰピクチャーの実難度データＤ
_j+15に乗算し、さらに、所定の定数αを加算して総和値
Ｓｕｍ_j+1を算出することを意味している。In the equation 7, the value of the actual difficulty data D _{j +} ₁₅ of the P picture in which the scene change has occurred is, as described above in the third embodiment, the value of the actual difficulty data D _{j + 18} of the immediately succeeding I picture. On the premise that they are equal, the host computer 20 first calculates the ratio (i: p: b) determined in advance and the coefficient multiplied by the number of I-pictures, P-pictures, and B-pictures included in one GOP after the scene change. Actual difficulty data D of the calculated P picture
_This means that the sum Sum _{j + 1} is calculated by multiplying _{j + 15} and further adding a predetermined constant α.

【００９５】なお、式７においては、定数αは、実験等
により予め求められる所定の値をとり、図１０中の第
（ｊ＋１５）番目のＰピクチャーの直後、つまり、シー
ンチェンジ直後の第（ｊ＋１６）番目および第（ｊ＋１
７）番目のＢピクチャーが、前方予測または後方予測の
みにより生成されるために、他のＢピクチャーに比べて
データ量が多いことを見越したマージンとしての意味を
有する。In equation (7), the constant α takes a predetermined value obtained in advance by an experiment or the like, and is immediately after the (j + 15) th P picture in FIG. ) -Th and (j + 1) -th
7) Since the B-th picture is generated only by forward prediction or backward prediction, it has a meaning as a margin in anticipation that the data amount is larger than other B pictures.

【００９６】ホストコンピュータ２０が、式７により求
めた総和値Ｓｕｍ_jを用いて、第（ｊ＋１５）番目〜第
（ｊ＋３０）番目の難度データの直線予測を変更したと
仮定すると、予測難度データＤ’_j+15〜Ｄ’_j+30の値
は、シーンチェンジにより増加し、図１０中に点線Ｂで
示した値になる。ただし、目標データ量Ｔ_jの算出のた
めには総和値Ｓｕｍ_jの値のみを予測すればよく、ま
た、後述するように、定数αの値は、第（ｊ＋２）番目
のピクチャーに対する総和値Ｓｕｍ_j+1を算出する際に
補正されるので、ホストコンピュータ２０は、シーンチ
ェンジが発生しない場合と異なり、シーンチェンジが発
生した場合、難度データの予測をピクチャーの種類（ピ
クチャータイプ）別に敢えて行わない。Assuming that the host computer 20 changes the linear prediction of the (j + 15) -th to (j + 30) -th difficulty data using the sum value Sum _j obtained by the equation 7, the prediction difficulty data D ′ The values of _{j + 15 to} D' _{j + 30} increase due to the scene change and become the values indicated by the dotted line B in FIG. However, in order to calculate the target data amount T _j , only the value of the sum Sum _j needs to be predicted. As described later, the value of the constant α is the sum Sum Sum for the (j + 2) -th picture. Since the correction is made when calculating _{j + 1} , unlike the case where no scene change occurs, the host computer 20 does not dare to predict difficulty data for each picture type (picture type) when a scene change occurs. .

【００９７】第２段階ホストコンピュータ２０が、第（ｊ＋２）番目のピクチ
ャーに対する目標データ量Ｔ_j+2を算出する際には、第
（ｊ＋１６）番目のＢピクチャーの実難度データＤ_j+16
が算出されている。図１０に示した例においては、第
（ｊ＋１６）番目のＢピクチャーは、後ろのシーンに属
するが、図８（Ａ），（Ｂ）に示したように、エンコー
ダ制御部１２がピクチャーの順序を入れ替えているた
め、第（ｊ＋１６）番目のＢピクチャーが、前のシーン
に属している可能性があり、また、前方予測または後方
予測のみにより生成されているため、ホストコンピュー
タ２０は、第（ｊ＋１６）番目のＢピクチャーの実難度
データＤ_j+16を、第（ｊ＋２）番目のピクチャーに対す
る目標データ量Ｔ_j+2を算出する際の総和値Ｓｕｍ_j+2
の予測に用いることはできない。[0097] The second stage host computer 20, the (j + 2) th when calculating the target amount of data T _{j + 2} are for picture, the (j + 16) th B picture real difficulty data D _{j + 16}
Is calculated. In the example shown in FIG. 10, the (j + 16) -th B picture belongs to the subsequent scene, but as shown in FIGS. 8A and 8B, the encoder control unit 12 changes the order of the pictures. Since it has been replaced, the (j + 16) -th B picture may belong to the previous scene, and is generated only by forward prediction or backward prediction. ) The actual difficulty data D _{j + 16} of the B-picture and the sum Sum _{j + 2} for calculating the target data amount T _{j + 2} for the (j + 2) -th picture
Cannot be used to predict

【００９８】しかしながら、式７において、定数αとし
てマージンを考慮した２枚のＢピクチャーの内の最初の
１枚のＢピクチャーの実難度データＤ_j+16の値を用い
て、式７の定数αを補正することは可能である。そこ
で、ホストコンピュータ２０は、下に式８として示すよ
うに、式７の定数αを、実難度データＤ_j+16に基づいて
補正して定数α’を算出し、さらに精度が高い総和値Ｓ
ｕｍ_j+2を予測することができる。ホストコンピュータ
２０は、予測した総和値Ｓｕｍ_j+2を式４に代入して、
第（ｊ＋２）番目のピクチャーに対する目標データ量Ｔ
_j+2を算出する。However, in equation (7), using the value of the actual difficulty data D _{j + 16} of the first one of the two B pictures in consideration of the margin as the constant α, the constant α in equation (7) Can be corrected. Therefore, the host computer 20 calculates the constant α ′ by correcting the constant α in Expression 7 based on the actual difficulty data D _{j + 16} as shown in Expression 8 below, and further calculates the sum S
um _{j + 2} can be predicted. The host computer 20 substitutes the predicted sum value Sum _{j + 2} into Expression 4, and
Target data amount T for the (j + 2) th picture
Calculate _{j + 2} .

【００９９】[0099]

【数８】 (Equation 8)

【０１００】第３段階ホストコンピュータ２０が、第（ｊ＋３）番目のピクチ
ャーに対する目標データ量Ｔ_j+3を算出する際には、第
（ｊ＋１７）番目のＢピクチャーの実難度データＤ_j+17
が算出されている。従って、式７において、定数αとし
てマージンを考慮した２枚のＢピクチャーの両方、つま
り、図８（Ａ）〜（Ｃ）に示したピクチャータイプシー
ケンスにおいて、ＩピクチャーおよびＰピクチャーに挟
まれる１組のＢピクチャー全ての実難度データＤ_j+16，
Ｄ_j+16の値が判明したので、下に式９として示すよう
に、式７の定数αあるいは式８の定数α’は不要にな
る。 Third Stage When the host computer 20 calculates the target data amount T _{j + 3} for the (j + 3) -th picture, the actual difficulty data D _{j + 17 of} the (j + 17) -th B picture is calculated.
Is calculated. Therefore, in Equation 7, both sets of two B pictures in consideration of the margin as the constant α, that is, one set sandwiched between the I picture and the P picture in the picture type sequences shown in FIGS. Actual difficulty data D _{j + 16 for} all B pictures of
Since the value of D _{j + 16} has been found, the constant α in Expression 7 or the constant α ′ in Expression 8 is unnecessary as shown in Expression 9 below.

【０１０１】[0101]

【数９】 (Equation 9)

【０１０２】第４段階ホストコンピュータ２０が、第（ｊ＋４）番目のピクチ
ャーに対する目標データ量Ｔ_j+3を算出する際には、第
（ｊ＋１８）番目のＩピクチャーの実難度データＤ_j+18
が算出されている。この段階で、図１０に示した例にお
いては、シーンチェンジ以降の全ての種類（ピクチャー
タイプ）のピクチャーの実難度データＤ _iの値が判明す
る。そこで、式７〜式９において用いられた予め求めら
れた比率（ｉ：ｐ：ｂ）の値を、ホストコンピュータ２
０が実際に算出したＩピクチャーの実難度データ
Ｄ_j+18、Ｐピクチャーの実難度データＤ_j+15およびＰピ
クチャーの実難度データＤ_j+16（Ｄ_j+17）に置き換える
ことが可能になる。[0102]Fourth stage The host computer 20 receives the (j + 4) th picture
Target data amount T for_{j + 3}When calculating
Actual difficulty data D of the (j + 18) th I picture_{j + 18}
Is calculated. At this stage, the example shown in FIG.
For all types (pictures) after the scene change
Actual difficulty data D of type) picture _iThe value of
You. Therefore, the previously calculated values used in Expressions 7 to 9 are obtained.
The value of the ratio (i: p: b) obtained is
0 is the actual difficulty data of the I picture actually calculated
D_{j + 18}, P picture actual difficulty data D_{j + 15}And P
Kucha's actual difficulty data D_{j + 16}(D_{j + 17})
It becomes possible.

【０１０３】このように、ホストコンピュータ２０は、
予め求めた比率（ｉ：ｐ：ｂ）を、実際の比率
〔Ｄ_j+18：Ｄ_j+15：Ｄ_j+16（Ｄ_j+17）〕に置換した式９
を用いて、さらに精度よく総和値Ｓｕｍ_j+18を予測し、
式４に代入して第（ｊ＋４）番目のピクチャーに対する
目標データ量Ｔ_j+4を算出する。As described above, the host computer 20
Equation 9 in which the ratio (i: p: b) obtained in advance is replaced with the actual ratio [D _{j + 18} : D _{j + 15} : D _{j + 16} (D _{j + 17} )]
, The sum value Sum _{j + 18} is more accurately predicted,
The target data amount T _{j + 4} for the (j + 4) -th picture is calculated by substituting into Equation 4.

【０１０４】第５段階第４段階と同様に、第（ｊ＋５）番目以降の数枚（例え
ば６〜９枚）のピクチャーに対する目標データ量Ｔ_j+3
を算出し、予測難度データＤ’_iの算出に充分な数量の
実難度データＤ_iが得られた後は、ホストコンピュータ
２０は、シーンチェンジが発生しない場合と同様に、直
線近似により予測難度データＤ’_iを算出し、算出した
予測難度データＤ’_iを式４に代入して、目標データ量
Ｔ_iを算出する。 Fifth Step Similarly to the fourth step, the target data amount T _{j + 3} for several (for example, 6 to 9) pictures after the (j + 5) -th picture.
Is calculated, and the host computer 20 obtains the actual difficulty data D _i in a sufficient quantity for the calculation of the predicted difficulty data D ′ _i , and then, as in the case where the scene change does not occur, the predicted difficulty data D _i is obtained by linear approximation. D ′ _i is calculated, and the calculated prediction difficulty data D ′ _i is substituted into Equation 4 to calculate a target data amount T _i .

【０１０５】ホストコンピュータ２０が、第３の実施形
態において上述したように、Ｉピクチャーの実難度デー
タＤ_iの変化に基づいて、Ｉピクチャーでシーンチェン
ジが発生したと判断した場合、Ｐピクチャーでシーンチ
ェンジが発生したと判断した場合と同じ処理、つまり、
上述した第１段階〜第５段階の処理を行うことにより、
各ピクチャーに対する目標データ量Ｔ_iを算出すること
ができる。[0105] The host computer 20 is, as described above in the third embodiment, based on the change in the real difficulty data D _i of I-picture, if it is determined that a scene change has occurred in the I-picture, the scene in P picture The same process as determining that a change has occurred,
By performing the processing of the first to fifth stages described above,
The target data amount T _i for each picture can be calculated.

【０１０６】一方、ホストコンピュータ２０が、第３の
実施形態において上述したように、Ｂチャネルの実難度
データＤ_iの値の変化に基づいて、Ｉピクチャーでシー
ンチェンジが発生したと判断した場合、ホストコンピュ
ータ２０は、Ｐピクチャーでシーンチェンジが発生した
と判断した場合における第１段階または第２段階の処理
を行うことができない。従って、Ｂチャネルの実難度デ
ータＤ_iの値の変化に基づいてＩピクチャーでシーンチ
ェンジが発生したと判断した場合、ホストコンピュータ
２０は、Ｐピクチャーでシーンチェンジが発生したと判
断した場合における第２段階または第３段階の処理を行
い、各ピクチャーに対する目標データ量Ｔ_iを算出す
る。[0106] On the other hand, when the host computer 20, as described above in the third embodiment, based on the change in the value of the real difficulty data D _i of B channels, it is determined that a scene change has occurred in the I-picture, The host computer 20 cannot perform the first-stage or second-stage processing when determining that a scene change has occurred in the P picture. Therefore, if the real difficulty data D _i of a scene change in the I-picture based on the change in the value of the B-channel is determined to have occurred, the host computer 20, a second when it is determined that a scene change occurs in the P picture The processing of the third or third stage is performed to calculate the target data amount T _i for each picture.

【０１０７】以上説明した総和値Ｓｕｍ_iの予測および
目標データ量Ｔ_iの算出に係る処理の内容を、フローチ
ャートを参照して、さらに説明する。図１１および図１
２は、第３の実施形態における改良予測簡易２パスエン
コード方式における総和値Ｓｕｍ_iの予測および目標デ
ータ量Ｔ_iの算出に係る処理内容を示すフローチャート
図である。The processing for predicting the total sum Sum _i and calculating the target data amount T _i described above will be further described with reference to flowcharts. FIG. 11 and FIG.
FIG. 2 is a flowchart showing the processing contents related to the prediction of the sum Sum _i and the calculation of the target data amount T _i in the improved prediction simple two-pass encoding method in the third embodiment.

【０１０８】なお、図１１および図１２において、デー
タＳＣ＿Ｆｌａｇは、過去１５ピクチャー以内にシーン
チェンジが生じている場合にはシーンチェンジの位置を
示し、これ以外の場合には０に設定される。また、デー
タＩ＿Ｆｌａｇの値は、図８（Ａ）〜（Ｃ）に示したピ
クチャータイプシーケンスにおいて、Ｉピクチャーの直
後、３ピクチャーに対する処理が終了するまでは１とな
り、それ以外の場合には０になる。また、係数Ｉｔｈ
１，Ｉｔｈ２，Ｐｔｈ，Ｂｔｈは、シーンチェンジの検
出の際に、それぞれＩピクチャー、Ｐピクチャーおよび
Ｂピクチャーの値を判断するために用いる係数を示す。In FIGS. 11 and 12, the data SC_Flag indicates the position of a scene change when a scene change has occurred within the past 15 pictures, and is set to 0 in other cases. In addition, in the picture type sequences shown in FIGS. 8A to 8C, the value of the data I_Flag is 1 immediately after the I picture and until the processing for 3 pictures is completed, and is 0 otherwise. Become. Also, the coefficient Ith
1, Ith2, Pth, and Bth indicate coefficients used to determine the values of the I picture, P picture, and B picture when detecting a scene change.

【０１０９】図１１に示すように、ステップ１００（Ｓ
１００）において、ホストコンピュータ２０は、エンコ
ーダ１６２から所定のデータを得て、実難度データＤ_i
を生成する。ステップ１０２（Ｓ１０２）において、ホ
ストコンピュータ２０は、データＳＣ＿Ｆｌａｇの値が
０であるか否かを判断する。データＳＣ＿Ｆｌａｇの値
が０である場合にはＳ２００（図１２）の処理に進み、
０でない場合にはＳ１０４の処理に進む。As shown in FIG. 11, step 100 (S
100), the host computer 20 obtains predetermined data from the encoder 162, and obtains the actual difficulty data _Di.
Generate In step 102 (S102), the host computer 20 determines whether or not the value of the data SC_Flag is 0. If the value of the data SC_Flag is 0, the process proceeds to S200 (FIG. 12).
If it is not 0, the process proceeds to S104.

【０１１０】ステップ１０４（Ｓ１０４）において、ホ
ストコンピュータ２０は、第ｉ番目のピクチャーの種類
（ピクチャータイプ）を判断し、第ｉ番目のピクチャー
がＢピクチャー、Ｐピクチャー、Ｉピクチャーである場
合には、それぞれＳ１０６，Ｓ１２０，Ｓ１２８の処理
に進む。ステップ１０６（Ｓ１０６）において、ホスト
コンピュータ２０は、データＩ＿Ｆｌａｇの値が０であ
るか否かを判断する。データＩ＿Ｆｌａｇの値が０であ
る場合にはＳ１１０の処理に進み、０でない場合にはＳ
１０８の処理に進む。ステップ１０８（Ｓ１０８）にお
いて、ホストコンピュータ２０は、Ｂピクチャーの実難
度データＤ_iが予測難度データＤ’_i×Ｂｔｈより大き
いか否かを判断し、大きい場合にはＳ１１２の処理に進
み、小さい場合にはＳ１１０の処理に進む。In step 104 (S104), the host computer 20 determines the type (picture type) of the i-th picture, and if the i-th picture is a B picture, a P picture, or an I picture, The process proceeds to S106, S120, and S128, respectively. In step 106 (S106), the host computer 20 determines whether or not the value of the data I_Flag is 0. If the value of the data I_Flag is 0, the process proceeds to S110; otherwise, the process proceeds to S110.
Proceed to 108. At step 108 (S108), the host computer 20, the real difficulty data D _i of the B picture is determined whether predictive difficulty data D _'i × or Bth larger, the greater the flow proceeds to the processing of S112, if it is smaller The process proceeds to S110.

【０１１１】ステップ１１０（Ｓ１１０）において、ホ
ストコンピュータ２０は、シーンチェンジが発生しない
場合と同じ処理を行って、予測難度データＤ’_iを算出
する。ステップ１１２（Ｓ１１２）において、ホストコ
ンピュータ２０は、データＳＣ＿Ｆｌａｇの値を１にす
る。ステップ１１４（Ｓ１１４）において、ホストコン
ピュータ２０は、第ｉ番目のピクチャーが、シーンチェ
ンジ後の１枚目のＢピクチャーである場合には、式８に
より総和値Ｓｕｍ_iを算出し、シーンチェンジ後の２枚
目のＢピクチャーである場合には、式９により総和値Ｓ
ｕｍ_iを算出する。In step 110 (S110), the host computer 20 performs the same processing as when no scene change occurs, and calculates the predicted difficulty data D′ _i . In step 112 (S112), the host computer 20 sets the value of the data SC_Flag to 1. In step 114 (S114), if the i-th picture is the first B picture after the scene change, the host computer 20 calculates the total sum Sum _i by equation 8, and In the case of the second B picture, the sum S
um _i is calculated.

【０１１２】ステップ１１６（Ｓ１１６）において、ホ
ストコンピュータ２０は、予測した総和値Ｓｕｍ_iまた
は予測難度データＤ’_iを式４に代入して、第ｉ番目の
ピクチャーに対する目標データ量Ｔ_i（target bit) を
算出する。ステップ１１８（Ｓ１１８）において、ホス
トコンピュータ２０は、データｉをインクリメントす
る。In step 116 (S116), the host computer 20 substitutes the predicted sum value Sum _i or the predicted difficulty data D ′ _i into Equation 4 to obtain a target data amount T _i (target bit amount) for the i-th picture. ) Is calculated. In step 118 (S118), the host computer 20 increments the data i.

【０１１３】ステップ１２０（Ｓ２２０）において、ホ
ストコンピュータ２０は、Ｐピクチャーの実難度データ
Ｄ_iが予測難度データＤ’_i×Ｐｔｈより大きいか否か
を判断し、大きい場合にはＳ１２２の処理に進み、小さ
い場合にはＳ１１０の処理に進む。ステップ１２２（Ｓ
１２２）において、ホストコンピュータ２０は、データ
ＳＣ＿Ｆｌａｇにデータｉを代入する。ステップ１２４
（Ｓ１２４）において、ホストコンピュータ２０は、デ
ータＩ＿Ｆｌａｇの値を０にする。ステップ１２６（Ｓ
１２６）において、ホストコンピュータ２０は、式７を
用いて、総和値Ｓｕｍ_iを予測する。[0113] In step 120 (S220), the host computer 20, the real difficulty data D _i of P-picture is determined whether predictive difficulty data D _'i × or Pth larger, the greater the flow proceeds to the processing of S122 If it is smaller, the process proceeds to S110. Step 122 (S
At 122), the host computer 20 substitutes the data i for the data SC_Flag. Step 124
In (S124), the host computer 20 sets the value of the data I_Flag to 0. Step 126 (S
At 126), the host computer 20 predicts the total sum Sum _i using Expression 7.

【０１１４】ステップ１２８（Ｓ２２０）において、ホ
ストコンピュータ２０は、Ｉピクチャーの実難度データ
Ｄ_iが予測難度データＤ’_i×Ｉｔｈ１〜予測難度デー
タＤ’_i×Ｉｔｈ２の範囲外か否かを判断し、範囲外の
場合にはＳ１３０の処理に進み、範囲内の場合にはＳ１
１０の処理に進む。ステップ１３０（Ｓ１３０）におい
て、ホストコンピュータ２０は、データＳＣ＿Ｆｌａｇ
にデータｉを代入する。ステップ１３２（Ｓ１３２）に
おいて、ホストコンピュータ２０は、データＩ＿Ｆｌａ
ｇの値を１にして、Ｓ１２６の処理に進む。[0114] In step 128 (S220), the host computer 20, the real difficulty data D _i of I picture is determined whether outside predictive difficulty data D _'i × Ith1~ predictive difficulty data D' _i × Ith2 If the value is out of the range, the process proceeds to S130.
Proceed to step 10. In step 130 (S130), the host computer 20 transmits the data SC_Flag
Is substituted for data i. In step 132 (S132), the host computer 20 transmits the data I_Fla
The value of g is set to 1, and the process proceeds to S126.

【０１１５】図１２に示すように、ステップ２００（Ｓ
２００）において、ホストコンピュータ２０は、データ
ｉからデータＳＣ＿Ｆｌａｇを減算した値が１，２，３
〜９，９以上である場合にそれぞれ、Ｓ２０２，Ｓ２０
４，Ｓ２０６，Ｓ２１０の処理に進む。ステップ２０２
（Ｓ２０２）において、ホストコンピュータ２０は、式
８により総和値Ｓｕｍ_iを予測し、Ｓ１１６（図１１）
の処理に進む。ステップ２０４（Ｓ２０４）において、
ホストコンピュータ２０は、式９により総和値Ｓｕｍ_i
を予測し、Ｓ１１６（図１１）の処理に進む。As shown in FIG. 12, step 200 (S
200), the host computer 20 determines that the value obtained by subtracting the data SC_Flag from the data i is 1, 2, 3
S202 and S20, respectively, when the number is
The process proceeds to steps S4, S206, and S210. Step 202
In (S202), the host computer 20 predicts the total sum Sum _i by using the equation 8, and S116 (FIG. 11)
Proceed to processing. In step 204 (S204),
The host computer 20 calculates the sum Sum _i
And the process proceeds to S116 (FIG. 11).

【０１１６】ステップ２０６（Ｓ２０６）において、ホ
ストコンピュータ２０は、式９の於ける予め求めた比率
（ｉ：ｐ：ｂ）を、算出した実難度データに置換する。
ステップ２０８（Ｓ２０８）において、ホストコンピュ
ータ２０は、比率（ｉ：ｐ：ｂ）を、算出した実難度デ
ータに置換した式９を用いて、総和値Ｓｕｍ_iを予測す
る。In step 206 (S206), the host computer 20 replaces the ratio (i: p: b) obtained in advance in equation 9 with the calculated actual difficulty data.
In step 208 (S208), the host computer 20 predicts the total sum Sum _i using Expression 9 in which the ratio (i: p: b) is replaced with the calculated actual difficulty data.

【０１１７】ステップ２１０（Ｓ２１０）において、ホ
ストコンピュータ２０は、ピクチャー（ｉ−ＳＣ＿Ｆｌ
ａｇ）枚分の実難度データを用いて、直線近似を行い、
総和値Ｓｕｍ_i（予測難度データＤ’_i）を算出する。
ステップ２１２（Ｓ２１２）において、ホストコンピュ
ータ２０は、（ｉ−ＳＣ＿Ｆｌａｇ）＝１５であるか否
かを判断する。（ｉ−ＳＣ＿Ｆｌａｇ）＝１５である場
合にはＳ２１４の処理に進み、（ｉ−ＳＣ＿Ｆｌａｇ）
＝１５でない場合にはＳ１１０（図１１）の処理に進
む。In step 210 (S210), the host computer 20 sets the picture (i-SC_Fl
ag) A straight line approximation is performed using the actual difficulty data for
The sum Sum _i (predicted difficulty data D ′ _i ) is calculated.
In step 212 (S212), the host computer 20 determines whether (i-SC_Flag) = 15. If (i-SC_Flag) = 15, the process proceeds to S214, and (i-SC_Flag)
If not = 15, the process proceeds to S110 (FIG. 11).

【０１１８】ホストコンピュータ２０は、以上説明した
処理により生成した目標データ量Ｔ _jを、エンコーダ１
８の量子化制御回路１８０に設定する。エンコーダ１８
は、第１の実施形態および第２の実施形態においてと同
様に、ホストコンピュータ２０から設定された目標デー
タ量Ｔ_jに基づいて、図８（Ｃ）に示すように、後ろの
シーンの最初のＰピクチャーが、前のシーンの最後のピ
クチャーを参照しないように、Ｉピクチャーに変更し、
前のシーンの最後のＩピクチャーをＰピクチャーに変更
して圧縮符号化し、圧縮映像データＶＯＵＴとして出力
する。The host computer 20 has been described above.
Target data amount T generated by processing _jAnd encoder 1
8 is set in the quantization control circuit 180. Encoder 18
Is the same as in the first and second embodiments.
The target data set from the host computer 20
Volume T_jBased on the above, as shown in FIG.
The first P picture of the scene is the last P picture of the previous scene.
Change to I-picture so as not to refer to the culture,
Change last I picture of previous scene to P picture
Compression encoding, and output as compressed video data VOUT
I do.

【０１１９】以上、第３の実施形態に示した改良予測簡
易２パスエンコード方式によれば、シーンチェンジやカ
メラフラッシュ等を含む映像データにより多くのデータ
量を割り当てて圧縮符号化可能である上に、シーンチェ
ンジやカメラフラッシュの前後に発生する符号化歪みを
顕著に低減することができる。従って、第３の実施形態
に示した改良予測簡易２パスエンコード方式によって生
成した圧縮映像データを伸長復号して得られる映像の品
質を向上させることができる。As described above, according to the improved predictive simplified two-pass encoding method shown in the third embodiment, a larger amount of data can be allocated to video data including scene changes and camera flashes, and compression encoding can be performed. In addition, encoding distortion occurring before and after a scene change or a camera flash can be significantly reduced. Therefore, it is possible to improve the quality of the video obtained by decompressing and decoding the compressed video data generated by the improved simplified simple two-pass encoding method shown in the third embodiment.

【０１２０】なお、第３の実施形態においては、Ｎ＝１
５，Ｍ＝３のピクチャーシーケンスに対する処理に適合
する式７〜式９を例示したが、式７〜式９を適切に変更
する（式７〜式９中の係数４，１０をピクチャーシーケ
ンスに合わせて変更する）ことにより、他のピクチャー
シーケンスに対しても、改良予測簡易２パスエンコード
を適用することができる。In the third embodiment, N = 1
Equations (7) to (9), which are suitable for the processing for the picture sequence of 5, M = 3, have been exemplified. By doing so, it is possible to apply the improved prediction simple 2-pass encoding to other picture sequences.

【０１２１】第４実施形態以下、本発明の第４の実施形態として、第３の実施形態
に示した改良予測簡易２パスエンコード方式のシーンチ
ェンジ検出方法の変形例を説明する。まず、本発明の第
４の実施形態におけるシーンチェンジ検出方法の原理を
説明する。 Fourth Embodiment Hereinafter, as a fourth embodiment of the present invention, a modified example of the scene change detection method of the improved prediction simple 2-pass encoding method shown in the third embodiment will be described. First, the principle of the scene change detection method according to the fourth embodiment of the present invention will be described.

【０１２２】映像データ圧縮装置１（図１）が、シーン
チェンジ付近の編集映像データから、第２の実施形態お
よび第３の実施形態にそれぞれ示した予測簡易２パスエ
ンコード方式および改良予測簡易２パスエンコード方式
において、映像データのピクチャー間の時間的相関性を
用いて生成される予測難度データＤ_j’は、実難度デー
タＤ_j-1以前の映像データの難度の変化の傾向をよく反
映しており、その実難度データＤ_jとの誤差は、シーン
チェンジがないかぎり非常に少なくなる。例えば、図１
０に示した場合においては、予測難度データＤ₁₆’は、
１５個の実難度データＤ₁〜Ｄ₁₅に基づいて、これらの
１つ先のピクチャーの難度を予測した値であり、シーン
チェンジがない場合には、精度が非常に高いと期待でき
る。The video data compression apparatus 1 (FIG. 1) converts the edited video data in the vicinity of a scene change from the simplified predictive two-pass encoding method and the improved predictive simple two-pass method shown in the second and third embodiments, respectively. In the encoding method, the predicted difficulty data D _j ′ generated by using the temporal correlation between pictures of the video data reflects the tendency of the change in the difficulty of the video data before the actual difficulty data D _j−1. cage, error between the real difficulty data D _j becomes very small unless a scene change. For example, FIG.
In the case of 0, the prediction difficulty data D ₁₆ ′
This is a value obtained by predicting the difficulty of the next picture based on 15 pieces of actual difficulty data D _{1 to} D _15. If there is no scene change, it can be expected that the accuracy is extremely high.

【０１２３】図１３は、シーンチェンジがＰピクチャー
で生じた場合に、その前後における実難度データＤ
_j（○印）と予測難度データＤ’_j（×印）との関係
を、圧縮符号化の順に例示する図である。一方、図１３
に示すように、シーンチェンジがＰピクチャーで生じた
場合、シーンチェンジ直後のＰピクチャーの実難度デー
タＤ_jは、多くの場合、前方のピクチャーを参照した圧
縮符号化ができなくなるために、予測難度データＤ_j’
よりも大幅に大きな値となる。FIG. 13 shows actual difficulty data D before and after a scene change occurs in a P picture.
FIG. 9 is a diagram illustrating the relationship between _j (○) and prediction difficulty data D ′ _j (×) in the order of compression encoding. On the other hand, FIG.
As shown in FIG. 7, when a scene change occurs in a P picture, the actual difficulty data D _j of the P picture immediately after the scene change often becomes impossible to perform compression encoding with reference to the preceding picture. Data D _j '
The value is much larger than

【０１２４】逆に、シーンチェンジ部分のＰピクチャー
の実難度データＤ_jは、例えば、シーンチェンジ前の絵
柄に比べて、シーンチェンジ後の絵柄が平坦である場合
等には、予測難度データＤ_j’よりも大幅に小さな値と
なる場合もある。また、シーンチェンジ直後のＢピクチ
ャーの実難度データＤ_jの値は、後方のピクチャーのみ
を参照して圧縮符号化されるために、予測難度データＤ
_j’に比べて大幅に、例えばＰピクチャー並みに大きく
なる。Conversely, the actual difficulty data D _j of the P picture in the scene change portion is the predicted difficulty data D _j when the pattern after the scene change is flatter than the pattern before the scene change, for example. May be significantly smaller than '. Further, the value of the real difficulty data D _j of the B-picture immediately after the scene change, in order to be compressed and encoded with reference to only the rear of the picture, predictive difficulty data D
It is much larger than _j ′, for example, as large as a P picture.

【０１２５】図１４は、シーンチェンジがＩピクチャー
で生じた場合に、その前後における実難度データＤ
_j（○印）と予測難度データＤ’_j（×印）との関係
を、圧縮符号化の順に例示する図である。また、図１４
に示すように、シーンチェンジが、第ｊ（１６）番目の
Ｉピクチャーで生じた場合、シーンチェンジ前後のＩピ
クチャーには時間的相関関係がないので、シーンチェン
ジ直後のＩピクチャーの予測難度データＤ_j’と実難度
データＤ_jとの間に誤差が生じる。FIG. 14 shows actual difficulty data D before and after a scene change occurs in an I picture.
FIG. 9 is a diagram illustrating the relationship between _j (○) and prediction difficulty data D ′ _j (×) in the order of compression encoding. FIG.
As shown in the figure, when a scene change occurs in the j (16) th I picture, there is no temporal correlation between the I pictures before and after the scene change. error between the _j 'and the real difficulty data D _j occurs.

【０１２６】しかしながら、Ｉピクチャーは、元々、他
のピクチャーを参照せずに圧縮符号化されるので、Ｐピ
クチャーでシーンチェンジが生じた場合に比べて、予測
難度データＤ_j’と実難度データＤ_jとの差は少ない。
一方、シーンチェンジ直後のＢピクチャーの実難度デー
タＤ_jの値は、Ｐフレームでシーンチェンジが生じた場
合と同様に、予測難度データＤ_j’に比べて大幅に大き
くなる。However, since the I picture is originally compressed and encoded without referring to other pictures, the prediction difficulty data D _j ′ and the actual difficulty data D _j ′ are compared with the case where a scene change occurs in the P picture. The difference from _j is small.
On the other hand, the value of the actual difficulty data D _j of the B picture immediately after the scene change is much larger than the predicted difficulty data D _j ′, as in the case where a scene change occurs in the P frame.

【０１２７】このように、ＰピクチャーおよびＩピクチ
ャーの予測難度データＤ_j’と難度データＤ_jの値に大
きな誤差が生じない場合であっても、Ｂピクチャー自体
の予測難度データＤ_j’と難度データＤ_jの値に大きな
誤差が生じた場合には、その直前のＩピクチャーまたは
Ｐピクチャーでシーンチェンジが生じたと判断すること
ができる。[0127] difficulty Thus, the 'even if a large error in the value of the difficulty data D _j is not generated, predictive difficulty data D _j of the B-picture itself' predictive difficulty data D _j of the P picture and I picture If a large error in the value of the data D _j occurs, it can be determined that the I-picture or P-picture at the scene change immediately before occurred.

【０１２８】第４の実施形態に示すシーンチェンジ検出
方法は、以上説明した実難度データＤ_jと予測難度デー
タＤ_j’との関係を利用しており、第３の実施形態にそ
れぞれ示した改良簡易２パスエンコード方式において、
より正確にシーンチェンジの検出を可能とする。つま
り、第４の実施形態に示すシーンチェンジ検出方法は、
第３の実施形態に示した映像データ圧縮装置１を用いた
改良予測簡易２パスエンコード方式において、予測難度
データＤ_j’と実難度データＤ_jとの値を比較してシー
ンチェンジを正確に検出するようになっている。The scene change detection method shown in the fourth embodiment utilizes the relationship between the actual difficulty data D _j and the predicted difficulty data D _j ′ described above, and the improvement shown in the third embodiment, respectively. In the simple two-pass encoding method,
This enables more accurate scene change detection. That is, the scene change detection method shown in the fourth embodiment is
In the improved simplified simple two-pass encoding method using the video data compression device 1 shown in the third embodiment, a scene change is accurately detected by comparing the values of the prediction difficulty data D _j ′ and the actual difficulty data D _j. It is supposed to.

【０１２９】具体的には、第４の実施形態におけるシー
ンチェンジの検出は、Ｉピクチャーの実難度データＤ_jI
に対する予測難度データＤ_jI’の比の値（Ｄ_jI／
Ｄ_jI’）、および、Ｐピクチャーの実難度データＤ_jpに
対する予測難度データＤ_jp’の比の値（Ｄ_jp／Ｄ_jp’）
が、所定の閾値の範囲外にある場合〔Ｔｈ_I1＜（Ｄ_j／
Ｄ_j’）または（Ｄ_jP／Ｄ_jP’）＜Ｔｈ_I2，Ｔｈ_p1＜
（Ｄ_jP／Ｄ_jP’）または（Ｄ_j／Ｄ_j’）＜Ｔｈ_p2。た
だし、Ｔｈ_I1＞１＞Ｔｈ_I2＞０，Ｔｈ_p1＞１＞Ｔｈ_p2＞
０〕には、シーンチェンジの発生をそのピクチャーで検
出する。但し、通常、ＰピクチャーのＰピクチャーの実
難度データＤ_jpに対する予測難度データＤ_jp’の比の値
（Ｄ_jp／Ｄ_jp’）が、加減値Ｔｈ_P2以下になることは殆
どない。More specifically, the detection of a scene change in the fourth embodiment is based on the actual difficulty data D _{jI of the} I picture.
_Value of the prediction difficulty data D _jI ′ with respect to (D _jI /
D _jI ′) and the ratio of the predicted difficulty data D _jp ′ to the actual difficulty data D _jp of the P picture (D _jp / D _jp ′)
Is outside the range of the predetermined threshold [Th _I1 <(D _j /
D _j ') or (D _jP / D _jP ') <Th _I2 , Th _p1 <
(D _jP / D _jP ') or (D _j / D _j ') <Th _p2 . However, Th _I1 >1> Th _I2 > 0, Th _p1 >1> Th _p2 >
0], the occurrence of a scene change is detected in the picture. However, usually, the value of the ratio of the predicted difficulty data D _jp ′ to the actual difficulty data D _jp of the P picture to the actual difficulty data D _jp of the P picture (D _jp / D _jp ′) rarely becomes less than the adjustment value Th _P2 .

【０１３０】また、第４の実施形態におけるシーンチェ
ンジ検出方法は、ＩピクチャーおよびＰピクチャーの実
難度データＤ_jI，Ｄ_jPに対する予測難度データＤ_jI’，
Ｄ_jP’の比の値が、上記所定の閾値の範囲内である場合
であっても、Ｂピクチャーの実難度データＤ_jBに対する
予測難度データＤ_jB’の比の値（Ｄ_jB／Ｄ_jB’）が、所
定の範囲外にある場合に〔Ｔｈ_B＜（Ｄ_jB／Ｄ_jB’）。
但し、Ｔｈ_B＞１〕、シーンチェンジの発生を、そのＢ
ピクチャーの直前のＩピクチャーまたはＰピクチャーで
シーンチェンジが生じたと検出する。The scene change detection method according to the fourth embodiment _uses the prediction difficulty data D _jI ′, and the actual difficulty data D _jI , D _jP of the I picture and the P picture.
Even if the value of the ratio of D _jP ′ is within the range of the predetermined threshold value, the value of the ratio (D _jB / D _jB ′ ₎ of the predicted difficulty data D _jB ′ to the actual difficulty data D _jB of the B picture. ) Is out of the predetermined range, [Th _B <(D _jB / D _jB ′).
However, Th _B> 1], the occurrence of a scene change, the B
It is detected that a scene change has occurred in the I picture or P picture immediately before the picture.

【０１３１】次に、第４の実施形態における映像データ
圧縮装置１（図１）の動作を説明する。エンコーダ制御
部１２は、第１の実施形態〜第３の実施形態においてと
同様に、非圧縮映像データのピクチャーを、例えば、図
８（Ａ）に示した順番から図８（Ｂ）に示した順番に入
れ替える。ＦＩＦＯメモリ１６０は、第１の実施形態〜
第３の実施形態においてと同様に、例えば、入力される
編集映像データを１５ピクチャー分、遅延する。エンコ
ーダ１６２は、第１の実施形態〜第３の実施形態におい
てと同様に、シーンチェンジの有無にかかわらず、映像
データＳ１２を圧縮符号化し、実難度データＤ_jを生成
する。Next, the operation of the video data compression device 1 (FIG. 1) in the fourth embodiment will be described. As in the first to third embodiments, the encoder control unit 12 displays pictures of uncompressed video data in, for example, the order shown in FIG. Swap in order. FIFO memory 160 according to the first embodiment
As in the third embodiment, for example, input edited video data is delayed by 15 pictures. The encoder 162, as well as in the first embodiment to the third embodiment, regardless of the presence or absence of a scene change, compresses and encodes the video data S12, generates the real difficulty data D _j.

【０１３２】ホストコンピュータ２０は、エンコーダ１
６２から入力される実難度データＤ _jと予測難度データ
Ｄ_j’とを比較し、第４の実施形態において上述したよ
うに、ＰピクチャーおよびＩピクチャーの予測難度デー
タＤ_j’の実難度データＤ_jに対する比の値、および、
Ｂピクチャーの予測難度データＤ_j’の実難度データＤ
_jに対する比の値が、上記所定の範囲外となる位置でシ
ーンチェンジが発生したことを検出する。The host computer 20 has the encoder 1
Actual difficulty data D input from 62 _jAnd forecast difficulty data
D_j′, As described above in the fourth embodiment.
P-picture and I-picture prediction difficulty data
TA D_j’Actual difficulty data D_jThe value of the ratio to, and
B picture prediction difficulty data D_j’Actual difficulty data D
_jAt a position where the value of the ratio to
Detects that a traffic change has occurred.

【０１３３】シーンチェンジを検出した場合、ホストコ
ンピュータ２０はさらに、第３の実施形態においてと同
様に、後ろのシーンの最初のＰピクチャーを前のシーン
の最後のピクチャーを参照しないＩピクチャーに変更し
（図８（Ｃ））、前のシーンの最後のＩピクチャーをＰ
ピクチャーに変更するように、ピクチャータイプシーケ
ンスを変更させる。When a scene change is detected, the host computer 20 further changes the first P picture of the subsequent scene to an I picture that does not refer to the last picture of the previous scene, as in the third embodiment. (FIG. 8C), the last I picture of the previous scene is set to P
The picture type sequence is changed like changing to a picture.

【０１３４】ホストコンピュータ２０は、第３の実施形
態においてと同様に、編集映像データにシーンチェンジ
が発生しない場合には、エンコーダ１６２から得られた
データから実難度データＤ_jを生成し、予測難度データ
Ｄ’₁₆〜Ｄ’₃₀をピクチャータイプごとに算出する。ま
た、ホストコンピュータ２０は、シーンチェンジが発生
した場合には、シーンチェンジ前後でピクチャーの相関
性がなくなるので、第３の実施形態においと同様に、シ
ーンチェンジ直後の所定数枚のピクチャーの実難度デー
タＤ_jから、式６により、総和値Ｓｕｍ_j（式５）を算
出し、算出した総和値Ｓｕｍ_jに基づいて、目標データ
量Ｔ_jを算出する。エンコーダ１２は、圧縮符号化後の
データ量が、ホストコンピュータ２０が生成した目標デ
ータ量Ｔ_jが示す値に近くなるように遅延された非圧縮
映像データＳ１６を圧縮符号化し、圧縮映像データＶＯ
ＵＴとして出力する。[0134] The host computer 20, as well as in the third embodiment, when the edited video data scene change does not occur, generates real difficulty data D _j from the data obtained from the encoder 162, the prediction difficulty Data D' _{16 to} D' ₃₀ are calculated for each picture type. Further, when a scene change occurs, the host computer 20 loses the correlation between the pictures before and after the scene change. Therefore, as in the third embodiment, the actual difficulty level of a predetermined number of pictures immediately after the scene change is changed. From the data D _j , the total sum Sum _j (Equation 5) is calculated according to Equation 6, and the target data amount T _j is calculated based on the calculated total sum Sum _j . Encoder 12, the amount of compressed data encoded, compressed and encoded non-compressed video data S16 delayed to be closer to the value indicated by the target amount of data T _j by the host computer 20 is generated, compressed video data VO
Output as UT.

【０１３５】以下、フローチャートを参照して、第４の
実施形態に示した映像データ圧縮装置１のホストコンピ
ュータ２０によるシーンチェンジ検出処理の内容をさら
に説明する。図１５は、第４の実施形態における映像デ
ータ圧縮装置１（図１）のホストコンピュータ２０によ
るシーンチェンジ検出処理の内容を示すフローチャート
図である。Hereinafter, the details of the scene change detection processing by the host computer 20 of the video data compression apparatus 1 shown in the fourth embodiment will be described with reference to flowcharts. FIG. 15 is a flowchart showing the content of the scene change detection process by the host computer 20 of the video data compression device 1 (FIG. 1) in the fourth embodiment.

【０１３６】図１５に示すように、ステップ３００（Ｓ
３００）において、ホストコンピュータ２０は、第ｊ番
目の実難度データＤ_jを算出する。ステップ３０２（Ｓ
３０２）において、ホストコンピュータ２０は、第ｊ番
目のピクチャーがあるか否かを判断する。第ｊ番目のピ
クチャーがある場合には、Ｓ３０４の処理に進み、ない
場合には処理を終了する。ステップ３０４（Ｓ３０４）
において、ホストコンピュータ２０は、第ｊ番目のピク
チャーのピクチャータイプを判断する。第ｊ番目のピク
チャーのピクチャータイプがＢピクチャー、Ｉピクチャ
ーまたはＰピクチャーである場合、それぞれ、Ｓ３０
６，Ｓ３１６，Ｓ３２０の処理に進む。As shown in FIG. 15, step 300 (S
In 300), the host computer 20 calculates the j-th real difficulty data D _j. Step 302 (S
In 302), the host computer 20 determines whether or not there is a j-th picture. If there is a j-th picture, the process proceeds to S304; otherwise, the process ends. Step 304 (S304)
In, the host computer 20 determines the picture type of the j-th picture. If the picture type of the j-th picture is a B picture, an I picture, or a P picture,
6, the process proceeds to S316 and S320.

【０１３７】ステップ３０６（Ｓ３０６）において、ホ
ストコンピュータ２０は、数値Ｂ＿ｃｏｕｎｔをインク
リメントする。ステップ３０８（Ｓ３０８）において、
ホストコンピュータ２０は、数値Ｂ＿ｃｏｕｎｔの値が
１であるか否かを判断する。数値Ｂ＿ｃｏｕｎｔの値が
１である場合には、Ｓ３１２の処理に進み、数値Ｂ＿ｃ
ｏｕｎｔの値が１でない場合には、Ｓ３１０の処理に進
む。In step 306 (S306), the host computer 20 increments the numerical value B_count. In step 308 (S308),
The host computer 20 determines whether the value of the numerical value B_count is 1 or not. If the value of the numerical value B_count is 1, the process proceeds to S312, where the numerical value B_c
If the value of “out” is not “1”, the process proceeds to S310.

【０１３８】ステップ３１０（Ｓ３１０）において、ホ
ストコンピュータ２０は、シーンチェンジが発生しなか
ったと判断する。ステップ３１２（Ｓ３１２）におい
て、ホストコンピュータ２０は、Ｂピクチャーから生成
した予測難度データＤ_j’と実難度データＤ_jとの比の
値を算出し、Ｄ_j＞Ｔｈ_B×Ｄ_j’（Ｄ_jB／Ｄ_jB’＞Ｔ
ｈ_B）であるか否かを判断する。Ｄ _j＞Ｔｈ_B×Ｄ_j’
である場合、Ｓ３１０の処理に進み、Ｄ_j＞Ｔｈ_B×Ｄ
_j’でない場合、Ｓ３１４の処理に進む。ステップ３１
４（Ｓ３１４）において、ホストコンピュータ２０は、
直前のＩピクチャーまたはＰピクチャー〔第（ｊ−１）
番目のピクチャー〕でシーンチェンジが発生したと判定
する。In step 310 (S310), the
The strike computer 20 determines whether a scene change has occurred.
Judge that Step 312 (S312)
The host computer 20 generates the B picture
Predicted difficulty data D_j’And the actual difficulty data D_jOf the ratio of
Calculate the value, D_j> Th_B× D_j’(D_jB/ D_jB’> T
h_B) Is determined. D _j> Th_B× D_j’
In step S310, the process proceeds to step S310, where D_j> Th_B× D
_jIf not, the process proceeds to S314. Step 31
4 (S314), the host computer 20
The immediately preceding I picture or P picture [(j-1)
Judge that a scene change has occurred
I do.

【０１３９】ステップ３１６（Ｓ３１６）において、ホ
ストコンピュータ２０は、数値Ｂ＿ｃｏｕｎｔの値をゼ
ロクリアする。ステップ３１８（Ｓ３１８）において、
ホストコンピュータ２０は、Ｐピクチャーから生成した
予測難度データＤ_j’と実難度データＤ_jとの比の値を
算出し、Ｄ_j＞Ｔｈ_P1×Ｄ_j’またはＤ_j＜Ｔｈ_P2×Ｄ
_j’であるか否かを判断する。Ｄ_j＞Ｔｈ_P1×Ｄ_j’ま
たはＤ_j＜Ｔｈ_P2×Ｄ_j’である場合、Ｓ３２４の処理
に進み、Ｄ_j＞Ｔｈ_P1×Ｄ_j’またはＤ_j＜Ｔｈ_P2×Ｄ
_j’でない場合、Ｓ３１０の処理に進む。At step 316 (S316), the host computer 20 clears the value of the numerical value B_count to zero. In step 318 (S318),
The host computer 20 calculates a ratio value between the predicted difficulty data D _j ′ generated from the P picture and the actual difficulty data D _j, and calculates D _j > Th _P1 × D _j ′ or D _j <Th _P2 × D
_j 'is determined. If a _{_{_{D j> Th P1 × D j}}} ' or _{_{_{D j <Th P2 × D j}}} ', the flow proceeds to the processing of _{_{S324, D j> Th P1 ×}} D j ' or D _j <Th _P2 × D
If it is not _j ′, the process proceeds to S310.

【０１４０】ステップ３２０（Ｓ３２０）において、ホ
ストコンピュータ２０は、ホストコンピュータ２０は、
数値Ｂ＿ｃｏｕｎｔの値をゼロクリアする。ステップ３
２２（Ｓ３２２）において、ホストコンピュータ２０
は、Ｉピクチャーから生成した予測難度データＤ_j’と
実難度データＤ_jとの比の値を算出し、Ｄ_j＞Ｔｈ_I1×
Ｄ_j’またはＤ_j＜Ｔｈ_I2×Ｄ_j’であるか否かを判断
する。Ｄ_j＞Ｔｈ_I1×Ｄ_j’またはＤ_j＜Ｔｈ_I2×
Ｄ_j’である場合、Ｓ３２４の処理に進み、Ｄ_j＞Ｔｈ
_I1×Ｄ_j’またはＤ_j＜Ｔｈ_I2×Ｄ_j’でない場合、Ｓ
３１０の処理に進む。At step 320 (S320), the host computer 20
The value of the numerical value B_count is cleared to zero. Step 3
22 (S322), the host computer 20
Calculates the ratio value between the predicted difficulty data D _j ′ generated from the I picture and the actual difficulty data D _j, and calculates D _j > Th _I1 ×
It is determined whether D _j ′ or D _j <Th _I2 × D _j ′. D _j > Th _I1 × D _j ′ or D _j <Th _I2 ×
If D _j ′, the process proceeds to S324, where D _j > Th
_{If I1} × D _j ′ or D _j <Th _I2 × D _j ′, S
Proceed to 310.

【０１４１】ステップ３２４（Ｓ３２４）において、ホ
ストコンピュータ２０は、第ｊ番目のピクチャーでシー
ンチェンジが発生したとを判断する。ステップ３２６
（Ｓ３２６）において、ホストコンピュータ２０は、実
難度データＤ_jまでを用いて、次の予測難度データＤ
_j+1を算出する。ステップ３２８（Ｓ３２８）におい
て、ホストコンピュータ２０は、数値ｊをインクリメン
トする。In step 324 (S324), the host computer 20 determines that a scene change has occurred in the j-th picture. Step 326
In (S326), the host computer 20, using up real difficulty data D _j, next prediction difficulty data D
Calculate _{j + 1} . In step 328 (S328), the host computer 20 increments the numerical value j.

【０１４２】なお、第４の実施形態においては、予測難
度データＤ_j’の予測方法として、第３の実施形態に示
した直線近似を用いたが、予測難度データＤ_j’の予測
方法は、これに限らず、例えば、実難度データＤ_jの差
分値に基づいて、実難度データＤ_jの変化を予測するこ
とにより予測難度データＤ_j’を算出する方法を採って
もよい。また、第４の実施形態においては、シーンチェ
ンジを検出する際に、Ｂピクチャーの前のピクチャーが
ＩピクチャーであろうとＰピクチャーであろうと、同じ
Ｂピクチャーの予測難度データＤ_j’と実難度データＤ
_jとの比較の際に、同じ閾値Ｔｈ_Bを用いたが、前のピ
クチャーのピクチャータイプに応じて、閾値を変更して
もよい。[0142] In the fourth embodiment, the prediction difficulty data D _j 'as a method of predicting, but using a linear approximation shown in the third embodiment, predictive difficulty data D _j' prediction method is Alternatively, for example, based on the difference value of the real difficulty data D _j, it may be adopted a method for calculating a prediction difficulty data D _j 'by predicting a change in the real difficulty data D _j. Further, in the fourth embodiment, when detecting a scene change, whether the preceding picture of the B picture is an I picture or a P picture, the prediction difficulty data D _j ′ and the actual difficulty data of the same B picture are used. D
in the comparison with the _j, but with the same threshold value Th _B, depending on the picture type of the previous picture, the threshold value may be changed.

【０１４３】以上第４の実施形態において説明したシー
ンチェンジの検出方法によれば、第３の実施形態に示し
た実難度データＤ_jの経時的な変化の監視によっては、
検出しにくかったＩピクチャーでのシーンチェンジ、あ
るいは、シーンチェンジの前の絵柄が難しく、シーンチ
ェンジ後の絵柄が優しい場合のＰピクチャーでのシーン
チェンジを、確実に検出することができる。従って、第
３の実施形態に示したシーンチェンジの検出方法を採用
する場合に比べて、圧縮符号化後の映像データの品質を
向上させることができる。[0143] According to the method of detecting the scene change described in the fourth embodiment above, by monitoring the change over time of the real difficulty data D _j shown in the third embodiment,
A scene change in an I picture that is difficult to detect, or a scene change in a P picture when a picture before the scene change is difficult and the picture after the scene change is gentle, can be reliably detected. Therefore, the quality of video data after compression encoding can be improved as compared with the case where the scene change detection method described in the third embodiment is employed.

【０１４４】[0144]

【発明の効果】以上述べたように本発明に係る映像デー
タ圧縮装置およびその方法によれば、２パスエンコード
によらずに、複数のシーンを連続的に含む映像データを
所定のデータ量以下に圧縮符号化して圧縮映像データを
生成することができ、しかも、連続的な複数のシーンの
時間方向における境界（シーンチェンジ）部分を圧縮符
号化した圧縮映像データを伸長復号して得られる映像の
品質を保持することができる。As described above, according to the video data compression apparatus and method according to the present invention, video data continuously including a plurality of scenes can be reduced to a predetermined data amount or less without using two-pass encoding. Compressed video data can be generated by compression encoding, and the quality of video obtained by expanding and decoding compressed video data obtained by compressing and encoding boundaries (scene changes) in the time direction of a plurality of continuous scenes Can be held.

[Brief description of the drawings]

【図１】本発明に係る映像データ圧縮装置の構成を示す
図である。FIG. 1 is a diagram showing a configuration of a video data compression device according to the present invention.

【図２】図１に示した簡易２パス処理部のエンコーダの
構成を示す図である。FIG. 2 is a diagram illustrating a configuration of an encoder of a simple two-pass processing unit illustrated in FIG. 1;

【図３】図１に示したエンコーダの構成を示す図であ
る。FIG. 3 is a diagram illustrating a configuration of an encoder illustrated in FIG. 1;

【図４】（Ａ）〜（Ｃ）は、第１の実施形態における映
像データ圧縮装置の簡易２パスエンコードの動作を示す
図である。FIGS. 4A to 4C are diagrams illustrating an operation of a simple two-pass encoding of the video data compression device according to the first embodiment.

【図５】（Ａ）〜（Ｃ）は、映像データ圧縮装置の動作
を示す図である。FIGS. 5A to 5C are diagrams illustrating the operation of the video data compression device.

【図６】第２の実施形態における映像データ圧縮装置
（図１）の動作を示すフローチャートである。FIG. 6 is a flowchart showing the operation of the video data compression device (FIG. 1) in the second embodiment.

【図７】（Ａ）〜（Ｃ）は、第２の実施形態における予
測簡易２パスエンコード方式、および、第３の実施形態
における改良予測簡易２パスエンコード方式による、シ
ーンチェンジの前後のピクチャーに対する圧縮符号化を
示す図である。FIGS. 7A to 7C are diagrams for pictures before and after a scene change according to the simplified simplified two-pass encoding method according to the second embodiment and the improved simplified simplified two-pass encoding method according to the third embodiment; FIG. 3 is a diagram illustrating compression encoding.

【図８】（Ａ）〜（Ｃ）は、エンコーダ制御部（図１）
による編集映像データのピクチャーの順序の入れ替え処
理、および、ホストコンピュータによるピクチャータイ
プの変更処理を示す図である。8A to 8C are encoder control units (FIG. 1)
FIG. 8 is a diagram showing a process of changing the order of pictures in edited video data by a computer and a process of changing a picture type by a host computer.

【図９】編集映像データのシーンチェンジ部分付近の実
難度データの値の経時的な変化を例示する図である。FIG. 9 is a diagram exemplifying a temporal change in the value of actual difficulty data near a scene change portion of edited video data.

【図１０】ホストコンピュータ（図１）が、編集映像デ
ータにシーンチェンジが発生する場合に、実難度データ
Ｄ₁〜Ｄ₁₅に基づいて予測難度データＤ’₁₆〜Ｄ’₃₀を
算出する方法、および、編集映像データにシーンチェン
ジが発生しない場合の予測難度データＤ’₁₆〜Ｄ’₃₀を
算出する方法を示す図である。FIG. 10 shows a method in which a host computer (FIG. 1) calculates predicted difficulty data D ′ _{16 to} D ′ ₃₀ based on actual difficulty data D _{1 to} D ₁₅ when a scene change occurs in edited video data; FIG. 11 is a diagram illustrating a method of calculating predicted difficulty data D ′ _{16 to} D ′ ₃₀ when a scene change does not occur in edited video data.

【図１１】第３の実施形態における改良予測簡易２パス
エンコード方式における総和値Ｓｕｍ_iの予測および目
標データ量Ｔ_iの算出に係る処理内容を示す第１のフロ
ーチャート図である。11 is a first flowchart showing a process according to the calculation of the predicted and target amount of data T _i of the summation value Sum _i in the improved predictive simplified two pass encoding system in the third embodiment.

【図１２】第３の実施形態における改良予測簡易２パス
エンコード方式における総和値Ｓｕｍ_iの予測および目
標データ量Ｔ_iの算出に係る処理内容を示す第２のフロ
ーチャート図である。12 is a second flowchart showing the processing contents of the calculation of the predicted and target amount of data T _i of the summation value Sum _i in the improved predictive simplified two pass encoding system in the third embodiment.

【図１３】シーンチェンジがＰピクチャーで生じた場合
に、その前後における実難度データＤ_j（○印）と予測
難度データＤ’_j（×印）との関係を、圧縮符号化の順
に例示する図である。FIG. 13 illustrates a relationship between actual difficulty data D _j (Ｄ) and predicted difficulty data D ′ _j (×) before and after a scene change occurs in a P picture in the order of compression encoding. FIG.

【図１４】シーンチェンジがＩピクチャーで生じた場合
に、その前後における実難度データＤ_j（○印）と予測
難度データＤ’_j（×印）との関係を、圧縮符号化の順
に例示する図である。FIG. 14 illustrates the relationship between actual difficulty data D _j (Ｄ) and predicted difficulty data D ′ _j (×) before and after a scene change occurs in an I picture in the order of compression encoding. FIG.

【図１５】第４の実施形態における映像データ圧縮装置
（図１）のホストコンピュータによるシーンチェンジ検
出処理の内容を示すフローチャート図である。FIG. 15 is a flowchart showing the content of a scene change detection process by the host computer of the video data compression device (FIG. 1) in the fourth embodiment.

[Explanation of symbols]

１…映像データ圧縮装置、１０…圧縮符号化部、１４…
動き検出器、１６…簡易２パス処理部、１６０…ＦＩＦ
Ｏメモリ、１６２，１８…エンコーダ、２０…ホストコ
ンピュータ。DESCRIPTION OF SYMBOLS 1 ... Video data compression apparatus, 10 ... Compression coding part, 14 ...
Motion detector, 16: Simple 2-pass processing unit, 160: FIF
O memory, 162, 18 ... encoder, 20 ... host computer.

Claims

[Claims]

1. After a head of a plurality of uncompressed video data input continuously is compressed into a predetermined picture type sequence composed of a combination of an I picture, a P picture and a B picture by a predetermined compression method. ,
Means for changing the order of pictures so as to be I pictures or P pictures; and compressing the uncompressed video data whose order has been changed by the predetermined compression method to generate first compressed video data. A first compression unit; a delay unit that delays the uncompressed video data whose order has been changed by a predetermined delay time; and the first compression generated from the uncompressed video data corresponding to the predetermined delay time. A prediction unit for predicting a data amount of a predetermined amount of the uncompressed first compressed video data based on a data amount of the video data; a predicted data amount of the first compressed video data; Head detecting means for detecting a head of the non-compressed video data based on a data amount (actual data amount) of the first compressed video data; The beginning of the picture of the reduced image data,
Changing means for changing the predetermined picture type sequence so as not to have a relationship with a picture of another video data after compression; generated first compressed video data; and predicted first compression. A target value generating means for generating a target value of the data amount of the uncompressed video data after compression based on the data amount of the video data; A second compression means for compressing the uncompressed video data into the changed predetermined picture type sequence by the predetermined compression method.

2. The head detecting means according to claim 1, wherein the actual data amount of the I picture and the P picture is such that the predicted ratio of the first compressed video data to the I picture and the P picture is out of a predetermined range. 2. The video data compression apparatus according to claim 1, wherein when the data amount increases, a head of the uncompressed video data is detected at a position corresponding to the P picture whose data amount has increased.

3. The head detecting means according to claim 1, wherein the actual data amount of the B picture is the B data of the predicted first compressed video data.
When the data amount of the uncompressed video data is larger than the data amount of the picture by a predetermined ratio or more, a head of the uncompressed video data is detected at a position of a P picture and an I picture immediately before the B picture whose data amount is increased. 2. The video data compression device according to 1.

4. The method according to claim 1, wherein, in the predetermined picture type sequence, when the head of the uncompressed video data is compressed into a P picture, the head of the uncompressed video data is compressed into an I picture. 2. The video data compression apparatus according to claim 1, wherein said predetermined picture type sequence is changed.

5. The non-compressed video data, wherein when the predetermined picture type sequence is changed so that the head of the non-compressed video data is compressed into an I picture, the uncompressed video data becomes an I picture after neighboring compression. 5. The method of claim 4, wherein the predetermined picture type sequence is further modified such that a picture of data is compressed into a P picture.
2. The video data compression device according to claim 1.

6. After a head of a plurality of uncompressed video data which are continuously inputted is compressed into a predetermined picture type sequence composed of a combination of I picture, P picture and B picture by a predetermined compression method. ,
Generating a first compressed video data by compressing to become an I picture or a P picture; and generating a predetermined amount of the uncompressed video data based on a data amount of the first compressed video data corresponding to the predetermined delay time. The data amount of the first compressed video data, and the predicted data amount of the first compressed video data and the data amount (actual data amount) of the actually generated first compressed video data. A video data compression method for detecting a head of the uncompressed video data based on the video data.

7. A method according to claim 7, wherein said uncompressed video data is delayed by a predetermined delay time, and said predetermined picture type sequence is changed so that a detected P picture becomes an I picture after compression. The first compressed video data and the first
Generating a target value of the compressed data amount based on the data amount with the compressed video data of the compressed video data, and delaying the uncompressed video data so that the compressed data amount becomes the generated target value. 7. The video data compression method according to claim 6, wherein the video data is compressed into the changed predetermined picture type sequence by the predetermined compression method.

8. The position of the I picture and P picture in which the ratio of the actual I picture and P picture to the predicted data amount of the I picture and P picture is out of a predetermined range is used as the first picture. The video data compression method according to claim 7, wherein the detection is performed.

9. The position of an I picture and a P picture immediately before a B picture whose actual data amount has been increased by a predetermined ratio or more than the predicted data amount of a B picture is taken as the head of the first compressed video data. Claim 7 for detecting
2. The video data compression method according to 1.

10. In the predetermined picture type sequence, when the head of the uncompressed video data is compressed into a P picture, the head of the uncompressed video data is compressed into an I picture. 8. The video data compression method according to claim 7, wherein a predetermined picture type sequence is changed.

11. When the predetermined picture type sequence is changed so that the head of the uncompressed video data is compressed into an I picture, the picture of the uncompressed video data that becomes an I picture after nearby compression is: The video data compression method according to claim 10, wherein the predetermined picture type sequence is further changed so as to be compressed into a P picture.