CN100477789C

CN100477789C - A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000

Info

Publication number: CN100477789C
Application number: CN 03149199
Authority: CN
Inventors: 刘炜; 郑文涛
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2003-06-20
Filing date: 2003-06-20
Publication date: 2009-04-08
Anticipated expiration: 2023-06-20
Also published as: CN1568011A

Abstract

The invention discloses a method of intensifying video frequency image. It includes the following procedures: set up image reference pins that are used in intensifying the compensation for the video frequency image pins whose data is lost; divide the pins of loss of data into data blocks that are MxN pixel(both M and N are integer); form the motion estimation between reference pins and pins of loss of data, so it's easy to search the data block that is matching the data block whose data is lost in reference pins; make use of the above pins to intensify the lost data blocks in the pins of loss of data. The method can use the relevance between image pins to compensate the loss of image data caused by network transmission congestion, and upgrade the image quality.

Description

A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000

技术领域 technical field

本发明涉及一种视频图像帧间增强方法，特别是一种对基于运动联合图像专家组2000(Motion JPEG 2000)压缩的视频图像进行帧间增强的方法，该方法能够在接收端尽可能地恢复丢失的视频图像的细节。The present invention relates to a method for inter-frame enhancement of video images, in particular to a method for inter-frame enhancement of video images compressed based on Motion JPEG 2000, which can recover as much as possible at the receiving end Lost video image detail.

背景技术 Background technique

随着多媒体应用的普及、数字视频技术的发展及网络上图像传输的增多，例如可视电话的使用，图像的远程浏览与检索，全球多用户虚拟环境共享等，对图像处理技术的研究变得越来越重要。With the popularization of multimedia applications, the development of digital video technology and the increase of image transmission on the network, such as the use of videophone, remote browsing and retrieval of images, global multi-user virtual environment sharing, etc., the research on image processing technology has become more and more important. more and more important.

图像的处理包括图像压缩，图像增强和恢复，图像除噪以及图像的描述，分类和识别。对于图像采集、存储和传输过程来说，迫切需要快速发展图像压缩技术。一幅具有中等分辨率(640*480)彩色(24bit/象素)数字图像的数据量约为7.37Mbit，庞大的数据量给图像的存储和传输带来了很大的困难。因此，在传输前必须对图像进行编码。Image processing includes image compression, image enhancement and restoration, image denoising, and image description, classification, and recognition. For the process of image acquisition, storage and transmission, it is urgent to develop image compression technology rapidly. The data volume of a color (24bit/pixel) digital image with medium resolution (640*480) is about 7.37Mbit. The huge data volume brings great difficulties to the image storage and transmission. Therefore, images must be encoded before transmission.

传统的JPEG压缩技术已经无法满足人们对多媒体影像资料的要求，因此需要一种功能更强大、效率更卓越的图像压缩标准。在这种背景下JPEG 2000应运而生。The traditional JPEG compression technology has been unable to meet people's requirements for multimedia image data, so a more powerful and efficient image compression standard is needed. In this context, JPEG 2000 came into being.

运动联合图像专家组2000(Motion JPEG 2000)是一种对运动图像的帧内编码方法。它并不利用帧间相关性，而是使用JPEG 2000提供的压缩手段逐帧去除视频序列中每一帧的帧内冗余。Motion JPEG 2000 (Motion JPEG 2000) is an intra-frame coding method for moving images. It does not use inter-frame correlation, but uses the compression method provided by JPEG 2000 to remove the intra-frame redundancy of each frame in the video sequence frame by frame.

随着时间的推移，人们对图像压缩处理的要求越来越高。另外，基于互联网和无线信道的视频传输已经越来越普遍。在实际应用中，有一些重要的图像，如卫星遥感图像、医学图像等需要通过互联网来传送。而这些图像传输要求确保较高的图像质量。As time goes by, people have higher and higher requirements for image compression processing. In addition, video transmission based on the Internet and wireless channels has become more and more common. In practical applications, some important images, such as satellite remote sensing images and medical images, need to be transmitted through the Internet. These image transmissions require high image quality.

然而，由于受到信道宽度的限制，通过互联网或无线信道进行的视频传输通常受到网络拥塞的影响。当网络出现拥塞时，路由器有选择地截断一些表示详细信息的数据分组，这将导致信息被截断的帧的图像质量的降低。However, video transmission over the Internet or wireless channels is often affected by network congestion due to the limitation of channel width. When the network is congested, the router selectively truncates some data packets representing detailed information, which will result in a reduction in the image quality of the truncated frames.

在JPEG 2000中，空域信号在经过诸如彩色空间变化，去直流，归一化之类的前处理后，进行离散小波变换(DWT)，以去除数据之间的相关性。图1中示出了一维离散小波变换分解的示意图。一维DWT的基本滤波方式是通过高频和低频滤波器组，将原始信号分解到频域上两个互不混叠的子带上，并通过降采样来保持信号的数目不变。一次DWT后剩余的低频信号往往还具有相关性。可以在其上再做k(k是整数)级子带分解，直到信号之间的相关性可以忽略为止。In JPEG 2000, the spatial domain signal is subjected to discrete wavelet transform (DWT) after pre-processing such as color space change, DC removal, and normalization to remove the correlation between data. Fig. 1 shows a schematic diagram of one-dimensional discrete wavelet transform decomposition. The basic filtering method of one-dimensional DWT is to decompose the original signal into two non-aliasing sub-bands in the frequency domain through high-frequency and low-frequency filter banks, and keep the number of signals unchanged by down-sampling. The remaining low-frequency signals after a DWT are often still relevant. Further k (k is an integer) sub-band decomposition can be performed on it until the correlation between signals can be ignored.

二维DWT则是在一维的行DWT和降采样的基础上，再对列信号进行一次DWT，并进行降采样以保证变换后的数据量与变换前相同。经过一次二维DWT变换之后，原图像被划分为四个子带。其中唯一的一个LL子带(行低频和列低频)仍可以继续分解，经过3-5级后形成分级结构。这种分级结构使JPEG 2000支持多分辨率的码流结构。The two-dimensional DWT is based on the one-dimensional row DWT and downsampling, and then performs a DWT on the column signal, and performs downsampling to ensure that the amount of data after the transformation is the same as before the transformation. After a two-dimensional DWT transformation, the original image is divided into four subbands. The only LL sub-band (row low frequency and column low frequency) can still be decomposed, and a hierarchical structure is formed after 3-5 levels. This hierarchical structure enables JPEG 2000 to support multi-resolution stream structures.

DWT之后的小波系数需要经过量化，JPEG 2000的量化器的一个特殊之处在于引入了“Dead Zone”的概念。对于每个子带的量化器，“DeadZone”的宽度都是其它步长的2倍。这意味着如果在子带b内采用步长为Δ_b的量化器，则该子带的一个系数y(i，j)可以用

作为量化索引值，其中sign()表示一个变量的符号，

表示向下取整。这种量化方式的优点是可以根据信噪比要求，提供一种渐进的码率模式，即可以先用较小的步长对信号进行量化。然后根据用户的不同需求以较宽的步长(一般是原始步长的2的整数次幂倍)进行反量化。此时，只需丢弃量化索引值的一些最低有效位(LSB)即可。The wavelet coefficients after DWT need to be quantized. A special feature of the quantizer of JPEG 2000 is the introduction of the concept of "Dead Zone". For each subband quantizer, the "DeadZone" is twice as wide as the other steps. This means that if a quantizer with a step size _Δb is used in subband b, one coefficient y(i,j) of this subband can be given by

As a quantized index value, where sign() represents the sign of a variable,

Indicates rounding down. The advantage of this quantization method is that it can provide a progressive code rate mode according to the requirements of the signal-to-noise ratio, that is, the signal can be quantized with a smaller step size first. Then dequantize with a wider step size (generally an integer power of 2 of the original step size) according to different needs of users. At this point, it is only necessary to discard some least significant bits (LSBs) of the quantization index value.

然后需要对经过量化的小波系数进行优化截断点的嵌入式块编码(Embedded Block Coding with Optimal Truncation)(EBCOT)。EBCOT是一种基于位平面(bit plain)的编码方式。按照位面从高到低的顺序，提取处于相同位平面的数据进行编码。这样就提供了图像质量由粗到精的渐进式编码。同时，EBCOT还提供了对多截断点的支持，即，为了适应一定的压缩码率，可以在JPEG 2000码流的任何一个截断点截断该码流。相当于丢弃当前位平面中所有不显著的系数以及所有显著系数的较低有效位。Then it is necessary to perform Embedded Block Coding with Optimal Truncation (EBCOT) on the quantized wavelet coefficients. EBCOT is a bit-plane-based coding method. According to the order of bit planes from high to low, data in the same bit plane is extracted for encoding. This provides a progressive encoding of image quality from coarse to fine. At the same time, EBCOT also provides support for multiple truncation points, that is, in order to adapt to a certain compression rate, the code stream can be truncated at any truncation point of the JPEG 2000 code stream. Equivalent to discarding all insignificant coefficients and the less significant bits of all significant coefficients in the current bit-plane.

经过EBCOT的码流送入算术编码器，打包成Motion JPEG 2000(MJP2)码流，传送到解码端。解码端的解码器按照与上面所述相反的顺序，即，执行算术解码，逆EBCOT，逆量化，逆离散小波变换以及后处理，得到解码后的各帧图像。The code stream after EBCOT is sent to the arithmetic encoder, packaged into a Motion JPEG 2000 (MJP2) code stream, and sent to the decoder. The decoder at the decoding end performs arithmetic decoding, inverse EBCOT, inverse quantization, inverse discrete wavelet transform, and post-processing in the reverse order of the above, to obtain decoded images of each frame.

渐进可伸缩的解码模式对于视频的传输同样重要。在互联网或无线信道中，由于受信道带宽的限制，视频传输经常受到网络拥塞的影响，导致在线收看时出现画面停顿等现象。如果采用渐进模式传输MJP2码流，当遇到网络拥塞时，路由器可以主动截断每一帧的码流，以减少数据量而保证网络的畅通。此时，用户仍可以观看连续的画面，但有关图像的细节信息被丢失，这必然会导致图像质量的下降。Progressively scalable decoding schemes are equally important for video transmission. On the Internet or wireless channels, due to the limitation of channel bandwidth, video transmission is often affected by network congestion, resulting in picture pauses when watching online. If the MJP2 code stream is transmitted in progressive mode, when encountering network congestion, the router can actively cut off the code stream of each frame to reduce the amount of data and ensure the smooth flow of the network. At this time, the user can still watch continuous pictures, but the detailed information about the image is lost, which will inevitably lead to a decline in image quality.

因此，需要一种在解码端尽可能地恢复丢失的图像的细节信息的方法。Therefore, there is a need for a method for recovering the detailed information of the lost image as much as possible at the decoding end.

发明内容 Contents of the invention

本发明的目的是提供一种增强接收到的视频图像以提高图像质量的方法，该方法可以利用解码端接收到的运动图像各帧的帧间相关性在一定程度上恢复受到信息损失的帧丢失的细节，从而增强图像质量。The purpose of the present invention is to provide a method for enhancing the received video image to improve the image quality, which can utilize the inter-frame correlation of each frame of the moving image received by the decoding end to restore the frame loss caused by information loss to a certain extent details, thereby enhancing image quality.

为了实现本发明的目的，提供一种视频图像增强方法，包括步骤：建立用于对数据丢失的视频图像帧进行增强补偿的图像参考帧；将数据丢失帧划分成大小为M×N像素的数据块；利用逆离散小波变换将参考帧的频域信号转换成空域信号；In order to achieve the purpose of the present invention, a video image enhancement method is provided, comprising the steps of: establishing an image reference frame for enhancing and compensating a video image frame with data loss; dividing the data loss frame into data with a size of M×N pixels block; using the inverse discrete wavelet transform to convert the frequency domain signal of the reference frame into a spatial domain signal;

将所得到的所述空域信号在x轴，y轴以及x＝y的方向移动一个像素，并对所得到的三个移动方向的图像信号进行离散小波变换以得到参考帧经移位的频域信号；在经移位的参考帧与数据丢失帧之间做运动估算，以便在参考帧中搜索与数据丢失帧中出现数据丢失的数据块匹配的数据块；用在参考帧中搜索到的匹配数据块增强数据丢失帧中丢失的数据块。Moving the obtained spatial domain signal by one pixel in the direction of x-axis, y-axis and x=y, and performing discrete wavelet transform on the obtained image signals in three moving directions to obtain the shifted frequency domain of the reference frame signal; do motion estimation between the shifted reference frame and the data loss frame, so as to search for a data block in the reference frame that matches the data block in the data loss frame; use the match found in the reference frame Block Augmentation Data blocks lost in lost frames.

采用本发明的运动图像增强方法可以利用图像帧之间的相关性来补偿增强因网络传输拥塞等造成的图像数据丢失，改善图像质量。The moving image enhancement method of the present invention can use the correlation between image frames to compensate and enhance image data loss caused by network transmission congestion, etc., and improve image quality.

附图说明 Description of drawings

通过下面结合附图对本发明实施例的详细描述，将使本发明的上述和其它方面的目的、特征及优点变得更明显并更容易理解，需要说明的是这些实施例仅是为了说明本发明，而不是将本发明的技术方案限制于此，其中：Through the following detailed description of the embodiments of the present invention in conjunction with the accompanying drawings, the purpose, features and advantages of the above and other aspects of the present invention will become more obvious and easier to understand. It should be noted that these embodiments are only to illustrate the present invention , rather than limit the technical solution of the present invention to this, wherein:

图1是说明多级离散小波变换的示意图；Figure 1 is a schematic diagram illustrating multi-level discrete wavelet transform;

图2是说明根据本发明的实施例对视频图像进行增强处理的方法的流程图；2 is a flowchart illustrating a method for enhancing video images according to an embodiment of the present invention;

图3示出了采用根据本发明的图像增强方法对图像进行增强处理后的图像的峰值信噪比增益的示意图。Fig. 3 shows a schematic diagram of the peak signal-to-noise ratio gain of the image after the image is enhanced by the image enhancement method according to the present invention.

具体实施方式 Detailed ways

下面参考附图详细说明根据本发明利用图像的帧间相关性来增强图像的方法的实施例。An embodiment of a method for enhancing an image by utilizing inter-frame correlation of an image according to the present invention will be described in detail below with reference to the accompanying drawings.

首先，介绍JPEG 2000标准及离散小波变换。JPEG 2000作为JPEG的升级版，具有以下特点：First, the JPEG 2000 standard and discrete wavelet transform are introduced. As an upgraded version of JPEG, JPEG 2000 has the following characteristics:

1.高压缩率1. High compression ratio

JPEG 2000压缩性能比JPEG提高了大约20％，也就是说，在同样的压缩率下，JPEG 2000对图像的失真比JPEG小大约20％。同时，使用JPEG 2000的系统稳定性好，运行平稳，抗干扰性好，易于操作。The compression performance of JPEG 2000 is about 20% higher than that of JPEG, that is to say, under the same compression rate, the distortion of JPEG 2000 is about 20% smaller than that of JPEG. At the same time, the system using JPEG 2000 has good stability, smooth operation, good anti-interference and easy operation.

2.同时支持有损和无损压缩2. Support both lossy and lossless compression

JPEG只支持有损压缩，而JPEG 2000能支持无损压缩。在实际应用中，诸如卫星遥感图像、医学图像、文物照片等重要的图像都非常适合于采用JPEG 2000压缩。JPEG only supports lossy compression, while JPEG 2000 can support lossless compression. In practical applications, important images such as satellite remote sensing images, medical images, and photos of cultural relics are very suitable for JPEG 2000 compression.

3.实现了渐进传输(progressive transmission)3. Realized progressive transmission (progressive transmission)

这是JPEG 2000一个极其重要的特征，它可以先传输图像的轮廓，然后逐步传输数据，不断提高图像质量，让图像由朦胧到清晰显示，而不必像现在的JPEG那样，由上到下慢慢显示，这在网络传输中有重大意义。This is an extremely important feature of JPEG 2000. It can first transmit the outline of the image, and then gradually transmit the data to continuously improve the image quality, so that the image can be displayed from hazy to clear, instead of slowly moving from top to bottom like the current JPEG. It is shown that this has great significance in network transmission.

4.支持“感兴趣区域”(Region Of Interest(ROI))4. Support "Region Of Interest (ROI)"

用户可以任意指定图像上感兴趣区域的压缩质量，还可以选择指定的部分先解压缩，从而使重点突出。这种方法的优点在于它结合了接收方对压缩的主观需求，实现了交互式压缩。The user can arbitrarily specify the compression quality of the region of interest on the image, and can also choose the specified part to be decompressed first, so as to make the key point stand out. The advantage of this method is that it combines the receiver's subjective demand for compression and realizes interactive compression.

JPEG 2000与传统JPEG最大的不同在于它放弃了JPEG所采用的以离散余弦变换为主的区块编码方式，转而采用以小波变换为主的多分辨率编码方式。The biggest difference between JPEG 2000 and traditional JPEG is that it abandons the block coding method mainly based on discrete cosine transform adopted by JPEG, and instead adopts the multi-resolution coding method mainly based on wavelet transform.

小波变换是现代谱分析工具，它既能考察局部时域过程的频域特征，又能考察局部频域过程的时域特征，因此即使对于非平稳过程，处理起来也得心应手。它能将图像变换为一系列小波系数，这些系数可以被高效压缩和存储，此外，小波的粗略边缘可以更好地表现图像，因为它消除了DCT压缩普遍具有的方块效应。Wavelet transform is a modern spectrum analysis tool. It can not only examine the frequency domain characteristics of local time domain processes, but also examine the time domain characteristics of local frequency domain processes. Therefore, even for non-stationary processes, it is handy to deal with. It can transform the image into a series of wavelet coefficients, which can be efficiently compressed and stored. In addition, the rough edges of the wavelet can better represent the image, because it eliminates the blockiness common to DCT compression.

小波在空间和频率域上的局域性，是统计意义上的局域性。这里说的局域性，指的是一个变换系数实际牵涉到的图像空间范围是局部的。因而，要完全恢复图像中的某个局部，并不需要所有的编码都被精确地保留，只需要对应于它的一部分编码没有误差就可以了。所以，能实现感兴趣区域压缩。同时JPEG 2000支持整形小波，因而能够实现无损压缩。The locality of wavelet in space and frequency domain is locality in statistical sense. The locality mentioned here means that the image space range actually involved by a transformation coefficient is local. Therefore, in order to fully restore a certain part of the image, it is not necessary that all the codes are accurately preserved, only that a part of the codes corresponding to it has no error. Therefore, region-of-interest compression can be achieved. At the same time, JPEG 2000 supports shaping wavelet, so it can realize lossless compression.

小波建立在三个主要的基础理论之上，分别是阶层式编码(pyramidcoding)、滤波器组理论(filter bank theory)、以及子带编码(subbandcoding)，小波变换结合了此三项技术。小波变换能将各种交织在一起的不同频率组成的信号，分解成不相同频率的信号，因此能有效的应用于编码、解码、检测边缘、压缩数据，及将非线性问题线性化。良好的分析局部的时域与频域的信号，弥补傅利叶转换中的缺失。Wavelet is based on three main basic theories, which are pyramid coding, filter bank theory, and subband coding. Wavelet transform combines these three technologies. Wavelet transform can decompose signals composed of different frequencies that are intertwined together into signals of different frequencies, so it can be effectively applied to encoding, decoding, edge detection, data compression, and linearization of nonlinear problems. Good analysis of local time domain and frequency domain signals, making up for the lack of Fourier transform.

下面说明本发明的基于JPEG 2000码流增强视频图像的原理。The principle of enhancing the video image based on the JPEG 2000 code stream of the present invention is illustrated below.

Motion JPEG 2000是一种用于编码运动图像的新的国际标准。使用与JPEG 2000相同的帧内编码技术。码流被逐帧地预先打包。当出现网络拥塞时，路由器有选择地丢弃某些不重要的部分，并且只传输用于图像重构的重要部分。所谓不重要的数据包是指对人的视觉影响较小，图像分辨率较高或较清晰的数据包。高频信息的丢失将缓解网络传输的负担，但在一定程度上会降低图像的质量。Motion JPEG 2000 is a new international standard for encoding moving pictures. Uses the same intra-coding technique as JPEG 2000. The bitstream is prepackaged frame by frame. When network congestion occurs, routers selectively discard certain unimportant parts and transmit only important parts for image reconstruction. The so-called unimportant data packets refer to data packets that have less impact on human vision and have a higher or clearer image resolution. The loss of high-frequency information will ease the burden of network transmission, but it will reduce the quality of the image to a certain extent.

鉴于Motion JPEG 2000采用帧内编码技术，编码后的运动图像的各帧之间的帧间相关性依然存在，某一帧图像在传输过程中丢失的信息并不会影响到其它帧。因此，可以在解码端利用未受到图像数据损失的相关帧在一定程度上恢复丢失的数据，从而增强图像质量。Since Motion JPEG 2000 uses intra-frame coding technology, the inter-frame correlation between the frames of the encoded motion picture still exists, and the information lost during the transmission of a certain frame image will not affect other frames. Therefore, the lost data can be recovered to a certain extent by using the relevant frames that have not suffered the image data loss at the decoding end, thereby enhancing the image quality.

根据本发明的实施例，例如在通过互联网接收图像的过程中，假设当前帧出现了图像数据损失，而参考帧相对保持完好。首先在当前帧和参考帧之间做出运动估算。假设接收到的各帧数据经解码后得到帧序列Iⁿ。根据JPEG 2000的渐进传输原理，接收到的数据量越多，恢复的图像质量就越好。因此，如果在接收端检测到某一帧的数据量明显小于其它帧的数据量，则该帧中可能出现数据丢失。在这种情况下，需要对接收到的当前帧进行图像增强。According to an embodiment of the present invention, for example, in the process of receiving an image through the Internet, it is assumed that image data loss occurs in the current frame, while the reference frame remains relatively intact. Motion estimation is first made between the current frame and a reference frame. It is assumed that each frame of data received is decoded to obtain a frame sequence I ⁿ . According to the progressive transmission principle of JPEG 2000, the more data received, the better the image quality will be restored. Therefore, if it is detected at the receiving end that the amount of data in a certain frame is significantly smaller than the amount of data in other frames, data loss may occur in that frame. In this case, image enhancement needs to be performed on the received current frame.

下面结合图2说明对视频图像进行离散小波变换的增强过程。The enhancement process of performing discrete wavelet transform on video images will be described below with reference to FIG. 2 .

对图像增强的过程需要利用参考帧，这个参考帧应该是已经接收到的未受到数据损失的某一帧。为了进行图像增强，在接收端需要设置缓存器来存储用于对数据丢失的图像帧进行增强的参考帧，即，在缓存器中存储一帧未受到数据损失的图像信号。在接收过程中判断新接收的图像帧是否受到数据损失。如果新的一帧没有受到数据损失，可用新的一帧刷新缓存器。The process of image enhancement needs to use a reference frame, which should be a received frame without data loss. In order to perform image enhancement, a buffer needs to be set at the receiving end to store reference frames for enhancing image frames with data loss, that is, a frame of image signals without data loss is stored in the buffer. During the receiving process, it is judged whether the newly received image frame suffers from data loss. If the new frame does not suffer data loss, the buffer can be flushed with a new frame.

为了便于讨论，在下面的例子中假设接收的当前帧，例如，Iⁿ的图像数据受到损失。应该理解，图像数据被在网络传输过程中被截断的帧不限于当前帧。可利用，例如，缓存器存储的未受到数据损失的前一帧I^n-1帧作为参考帧。用xⁿ(i，j)表示出现数据丢失的第n帧的空域信号，经过多级的DWT变换后，子带b内的小波系数表示为y_b ⁿ(i，j)，b∈{kLL，kHL，kLH，kHH}，其中kHL(k是正整数)表示在第k层分解中对行信号进行高通滤波，对列信号进行低通滤波后间隔采样的结果。For ease of discussion, in the following examples it is assumed that the image data of the received current frame, eg ^In , is lost. It should be understood that the frame whose image data is truncated during network transmission is not limited to the current frame. For example, the previous frame In ^-1 frame stored in the buffer without data loss can be used as the reference frame. Use x ⁿ (i, j) to represent the spatial domain signal of the nth frame where data loss occurs. After multi-level DWT transformation, the wavelet coefficients in sub-band b are expressed as y _b ⁿ (i, j), b∈{kLL , kHL, kLH, kHH}, where kHL (k is a positive integer) represents the result of performing high-pass filtering on the row signal and performing low-pass filtering on the column signal in the decomposition of the k-th layer.

为了叙述方便起见，假设每帧图像都只经过一级离散小波变换(DWT)。由于传输的是运动图像，为了对受到数据损失的图像帧进行图像增强，首先需要找到参考帧与当前帧的匹配数据块。如图2所示，在LL子带内将第Iⁿ帧频域信号在划分为固定大小的M*N(M和N是正整数)个像素的数据块，逐个数据块地在I^n-1的LL子带和其经过运动偏移后的三个LL子带中进行运动搜索和匹配(具体方法将在之后详述)。如果得到匹配，则用第I^n-1帧的该数据块对第Iⁿ帧的数据进行增强。如果未得到匹配，则放弃参考帧的该数据块。For the convenience of description, it is assumed that each frame of image undergoes only one level of discrete wavelet transform (DWT). Since the transmission is a moving image, in order to perform image enhancement on an image frame subject to data loss, it is first necessary to find a matching data block between the reference frame and the current frame. As shown in Figure 2, in the LL subband, the I ^nth frame frequency domain signal is divided into data blocks of M*N (M and N are positive integers) pixels of fixed size, and the data blocks are divided into I ^n-1 data blocks one by one. The motion search and matching are performed in the LL sub-bands and the three LL sub-bands after motion offset (the specific method will be described in detail later). If a match is obtained, the data of the I nth frame is enhanced with the data block of the I ⁿ ^-1th frame. If no match is found, the data block of the reference frame is discarded.

由于图像的运动性，参考帧与受到损失的当前帧之间的相关数据块之间存在着一定的偏移。这种情况下，为了找到匹配的数据块进行图像增强，需要移动图像帧的像素。在本实施例中，搜索相关数据块的过程是通过移动参考帧的数据块进行的。Due to the motion of the image, there is a certain offset between the relevant data blocks between the reference frame and the lost current frame. In this case, in order to find a matching data block for image enhancement, it is necessary to shift the pixels of the image frame. In this embodiment, the process of searching for relevant data blocks is performed by moving the data blocks of the reference frame.

为了搜索对应的数据块，对第I^n-1帧的频域信号进行逆离散小波变换(IDWT)，以便得到第I^n-1帧的空域信号x^n-1(i，j)。接下来，将空域信号x^n-1(i，j)沿x轴，y轴以及x＝y方向各移动一个像素，得到三幅图像x^n-1(i-1，j)、x^n-1(i，j-1)、和x^n-1(i-1，j-1)。分别对这三幅图像进行DWT变换，得到三个经过移位的第I^n-1帧的频域信号。将转换后的信号表示为y_b(1) ^n-1(i，j)、y_b(2) ^n-1(i，j)、和y_b(3) ^n-1(i，j)，而 $y_{b (0)}^{n - 1} (i, j) = y_{b}^{n - 1} (i, j) .$ 之所以进行这样的移动是因为图像在DWT变换之后要进行降采样，如果一个数据块在两帧之间移动奇数个像素，则降采样之后在两帧之间保留下来的样本互为补集。在x、y和x＝y方向各移动一个像素，再加上图像自身就包括了数据块在平面的两个方向上所有运动的奇偶组合。In order to search for the corresponding data block, an inverse discrete wavelet transform (IDWT) is performed on the frequency domain signal of the I ^n-1th frame, so as to obtain the spatial domain signal x ^n-1 (i, j) of the I ^n-1th frame. Next, move the spatial domain signal x ^n-1 (i, j) by one pixel along the x-axis, y-axis and x=y direction to obtain three images x ^n-1 (i-1, j), x ^{n- 1} (i, j-1), and x ^n-1 (i-1, j-1). Perform DWT transformation on the three images respectively to obtain three shifted frequency domain signals of the In ^-1th frame. Denote the transformed signals as y _b(1) ^n-1 (i, j), y _b(2) ^n-1 (i, j), and y _b(3) ^n-1 (i, j), and ${the y}_{b (0)}^{no - 1} (i, j) = {the y}_{b}^{no - 1} (i, j) .$ The reason for such movement is that the image needs to be down-sampled after DWT transformation. If a data block moves an odd number of pixels between two frames, the samples retained between the two frames after down-sampling are complementary sets. Move one pixel each in the x, y and x=y directions, plus the image itself includes the odd and even combinations of all movements of the data block in both directions of the plane.

将接收到的当前帧的LL子带内的频域信号y_LL ⁿ(i，j)划分为固定大小的数据块，例如8*8。然后逐个在参考帧的四个LL子带y_LL(0) ^n-1(i，j)、y_LL(1) ^n-1(i，j)、y_LL(2) ^n-1(i，j)、y_LL(3) ^n-1(i，j)中搜索与当前帧的该每个数据块最相似的数据块。Divide the received frequency domain signal y _LL ⁿ (i, j) in the LL subband of the current frame into data blocks of fixed size, for example, 8*8. Then one by one in the four LL subbands y _LL(0) ^n-1 (i, j), y _LL(1) ^n-1 (i, j), y _LL(2) ^n-1 (i, j) of the reference frame j), y _LL(3) ^n-1 (i, j) to search for the data block most similar to each data block of the current frame.

搜索过程可采用三步法，判断两个数据块是否相似可以采用最大绝对距离(MAD)法则，即计算当前帧与移位的参考帧的对应子带的频域信号之间的差值的绝对值之和，即，计算 $\underset{(i, j) &Element; block}{Σ} | y_{LL}^{n} (i, j) - y_{LL (k)}^{n - 1} (i + di, j + dj) |,$ 找到使该取最小值时的(di，dj，k)。将该最小值与预先设定的阈值比较。如果该最小值大于设定的阈值，则表明两个数据块不匹配。如果该最小值小于或等于设定的阈值，则将该(di，dj，k)作为运动矢量记录下来。这种运动补偿方法被称为低带移位法(Low-Band-Shift)，其特点是可以在高频子带的信息丢失较多时仍能得到精确的运动矢量。The search process can use a three-step method, and the maximum absolute distance (MAD) rule can be used to determine whether two data blocks are similar, that is, the absolute value of the difference between the current frame and the frequency domain signal of the corresponding subband of the shifted reference frame is calculated. The sum of the values, i.e., computes $\underset{(i, j) &Element; block}{Σ} | {the y}_{LL}^{no} (i, j) - {the y}_{LL (k)}^{no - 1} (i + di, j + dj) |,$ Find (di, dj, k) that minimizes this. This minimum value is compared with a preset threshold. If the minimum value is greater than the set threshold, it indicates that the two data blocks do not match. If the minimum value is less than or equal to the set threshold, the (di, dj, k) is recorded as a motion vector. This motion compensation method is called low-band-shift method (Low-Band-Shift), which is characterized in that accurate motion vectors can still be obtained when high-frequency sub-band information is lost.

在参考帧中找到了与当前帧的某个小波系数y_LL ⁿ(i，j)对应的运动矢量(di，dj，k)后，其在参考帧中对应的小波系数则是y_LL(k) ^n-1(i+di，j+dj)。此后，可对当前帧中损失数据进行增强。作为例子，例如，如果在传输过程中y_LL ⁿ(i，j)丢失了当前帧的p个最低有效位，在接收端实际收到的则是

此时可以得到下面的公式(1)表示的未经损失的y_LL ⁿ(i，j)的分布范围：After the motion vector (di, dj, k) corresponding to a certain wavelet coefficient y _LL ⁿ (i, j) of the current frame is found in the reference frame, its corresponding wavelet coefficient in the reference frame is y _{LL(k )} ^n-1 (i+di, j+dj). Thereafter, the lost data in the current frame can be enhanced. As an example, for example, if y _LL ⁿ (i, j) loses the p least significant bits of the current frame during transmission, what is actually received at the receiver is

At this time, the distribution range of y _LL ⁿ (i, j) without loss expressed by the following formula (1) can be obtained:

$\{\begin{matrix} {\overset{\cdot &Center Dot;}{y the y}}_{LL LL}^{n no} ((i i,, j j)) \leq \leq {y the y}_{LL LL}^{n no} ((i i,, j j)) < < {\overset{\cdot &Center Dot;}{y the y}}_{LL LL}^{n no} ((i i,, j j)) + + 22^{p p} & ((sign sign (({\overset{\cdot &Center Dot;}{y the y}}_{LL LL}^{n no} ((i i,, j j)))) > > 00)) \\ {\overset{\cdot &Center Dot;}{y the y}}_{LL LL}^{n no} ((i i,, j j)) - - 22^{p p} < < {y the y}_{LL LL}^{n no} ((i i,, j j)) \leq \leq {\overset{\cdot \cdot}{y the y}}_{LL LL}^{n no} ((i i,, j j)) & ((sign sign (({\overset{\cdot \cdot}{y the y}}_{LL LL}^{n no} ((i i,, j j)))) < < 00)) \\ - - 22^{p p} < < {y the y}_{LL LL}^{n no} ((i i,, j j)) < < 22^{p p} & ((sign sign (({\overset{\cdot \cdot}{y the y}}_{LL LL}^{n no} ((i i,, j j)))) = = 00)) \end{matrix} - - - - - - ((11))$

用在上式所列范围内与y_LL(k ^n-1(i+di，j+dj)最接近的值作为对受到损失的当前帧的小波系数

的增强结果。其具体操作如下：Use the value closest to y _LL(k ^n-1 (i+di, j+dj) within the range listed in the above formula as the wavelet coefficient of the current frame subject to loss

enhanced results. Its specific operation is as follows:

如果参考帧中的小波系数y_LL(k ^n-1(i+di，j+dj)中除最后p个最低有效位外均与

相同，即：

则可以令

作为

的增强结果。也就是说，用y_LL(k ^n-1(i+di，j+dj)的最后p个最低有效位来补偿受到损失的当前帧的

If the wavelet coefficients y _LL(k ^n-1 (i+di, j+dj) in the reference frame are all the same as

same, ie:

then you can make

as

enhanced results. That is, use the last p least significant bits of y _LL(k ^n-1 (i+di, j+dj) to compensate for the loss of the current frame

如果参考帧中的小波系数y_LL(k ^n-1(i+di，j+dj)中除最后p个最低有效位外仍与有不同，则比较公式(1)中的不等式确定的区间的两个端点。在这两个端点中选择距y_LL(k ^n-1(i+di，j+dj)较近的一个点作为对当前帧的增强结果。If the wavelet coefficient y _LL(k ^n-1 (i+di, j+dj) in the reference frame is still consistent with If there is a difference, compare the two endpoints of the interval determined by the inequality in formula (1). Select a point closer to y _LL(k ^n-1 (i+di, j+dj) from these two endpoints as the enhancement result for the current frame.

在最坏的情况下，当前帧的y_LL ⁿ(i，j)丢失了其所有的有效位(例如，整个子带内的数据都被丢失)，这种情况下，则直接令当前帧中的 ${\overset{\cdot \cdot}{y}}_{LL}^{n} (i, j) = y_{LL (k}^{n - 1} (i + di, j + dj) .$ In the worst case, y _LL ⁿ (i, j) of the current frame loses all its valid bits (for example, the data in the entire subband is lost), in this case, directly make the current frame of ${\overset{&Center Dot; &Center Dot;}{the y}}_{LL}^{no} (i, j) = {the y}_{LL (k}^{no - 1} (i + di, j + dj) .$

根据离散小波变换的性质，对于图像的三个高频子带HL、LH和HH，不需要再到前一帧中做运动估算和查找对应的数据块。如果某个小波系数y_LL ⁿ(i，j)对应运动矢量(di，dj，k)，则对于所有的子带b，则有y_b ⁿ(i，j)对应的运动矢量都是(di，dj，k)。这种情况下下，可以用参考帧中的系数y_b(k) ^n-1(i+di，j+dj)作为参考量来增强丢失数据的当前帧的系数y_b ⁿ(i，j)。According to the nature of the discrete wavelet transform, for the three high-frequency sub-bands HL, LH and HH of the image, there is no need to go to the previous frame for motion estimation and search for corresponding data blocks. If a wavelet coefficient y _LL ⁿ (i, j) corresponds to the motion vector (di, dj, k), then for all subbands b, the motion vectors corresponding to y _b ⁿ (i, j) are (di , dj, k). In this case, the coefficient y _b(k) ^n-1 (i+di, j+dj) in the reference frame can be used as a reference to enhance the coefficient y _b ⁿ (i, j) of the current frame of the missing data .

经过对丢失数据的当前帧的小波系数进行增强处理之后，对增强后的小波系数进行逆离散小波变换和诸如直流复原，颜色空间逆变换等之类的后处理，就可以在接收端恢复出视频图像经过增强后的重建图像。After the wavelet coefficients of the current frame of the lost data are enhanced, the enhanced wavelet coefficients are subjected to inverse discrete wavelet transform and post-processing such as DC restoration, color space inverse transformation, etc., and the video can be restored at the receiving end. The reconstructed image after the image has been enhanced.

上面为了简单起见以一级DWT为例说明了利用图像的帧间相关性对视频图像进行补偿增强的方法。一般情况下，一帧图像在编码之前要经过3-5级的DWT变换。上述算法可以推广到m级的DWT变换。其过程如下：For the sake of simplicity, the first-level DWT is taken as an example to illustrate the method of compensating and enhancing the video image by using the inter-frame correlation of the image. Generally, a frame of image needs to undergo 3-5 levels of DWT transformation before encoding. The above algorithm can be extended to m-level DWT transformation. The process is as follows:

1.按照一级DWT变换的方法，对Iⁿ的LL_m，HL_m，LH_m，HH_m(m是整数)子带进行增强。1. According to the method of one-stage DWT transformation, the LL _m , HL _m , LH _m , HH _m (m is an integer) subbands of ^In are enhanced.

2.对增强后的四个子带进行逆DWT变换，以得到第m级DWT变换之前的低频子带LL_m-1。2. Inverse DWT transform is performed on the four enhanced sub-bands to obtain the low-frequency sub-band LL _m-1 before the m-th DWT transform.

3.LL_m-1是一个增强后的子带，在此基础上可对m-1级的其它三个子带HL_m-1，LH_m-1，HH_m-1进行补偿增强。其增强方法与前面说明的过程相似。另外，应指出的是，可以用上一级的运动矢量(di，dj，k)_m来加快对本级运动矢量的搜索。3. LL _m-1 is an enhanced sub-band, and on this basis, compensation and enhancement can be performed on the other three sub-bands HL _m-1 , LH _m-1 , and HH _m-1 of level m-1. Its enhancement method is similar to the process explained earlier. In addition, it should be pointed out that the motion vector (di, dj, k) _m of the previous stage can be used to speed up the search for the motion vector of the present stage.

4.由此如上所述的方式按照分辨率由低到高的顺序逐级对受到数据丢失的图像帧进行增强，直到整个处理过程结束。4. Thus, in the manner as described above, image frames subject to data loss are enhanced step by step in order of resolution from low to high, until the entire processing process ends.

图3示出了采用根据本发明的图像增强方法对图像进行增强处理后的图像的峰值信噪比增益。该实例对MPEG-2标准测试图像“运动物体和日历”进行了测试。所采用的参数如下：Fig. 3 shows the peak signal-to-noise ratio gain of the image after the image is enhanced by the image enhancement method according to the present invention. This example tests the MPEG-2 standard test image "moving object and calendar". The parameters used are as follows:

视频序列：运动物体和日历，选取300帧图像，352*288像素/帧，图像采用逐行扫描方式。Video sequence: moving objects and calendars, select 300 frames of images, 352*288 pixels/frame, and the images adopt progressive scanning.

小波基：采用JPEG 2000 Part 1提供的9/7实型滤波器组；运动补偿采用低频带移位(Low-Band-Shift)。Wavelet base: use the 9/7 real-type filter bank provided by JPEG 2000 Part 1; motion compensation uses low-band shift (Low-Band-Shift).

压缩率为2.0bpp。其中对奇数帧进行了截断处理，截断后的码流相当于1.0bpp。通过采用本发明的方法对截断的奇数帧进行增强处理后，可得到平均为3.12dB的峰值信噪比。图3中的横坐标表示帧序号，纵坐标表示峰值信噪比。图中的空白方框连成的曲线表示经过图像增强后的峰值信噪比，黑方块连成的曲线表示原始图像的峰值信噪比。从图3中可以明显看出，经过增强处理的图像比处理前的图像质量由了明显改善。The compression ratio is 2.0bpp. The odd frames are truncated, and the truncated code stream is equivalent to 1.0bpp. After enhancing the truncated odd frames by using the method of the present invention, an average peak signal-to-noise ratio of 3.12dB can be obtained. The abscissa in FIG. 3 represents the frame number, and the ordinate represents the peak signal-to-noise ratio. The curve formed by the blank squares in the figure represents the peak signal-to-noise ratio after image enhancement, and the curve formed by the black squares represents the peak signal-to-noise ratio of the original image. It can be clearly seen from Figure 3 that the quality of the enhanced image is significantly improved compared with the image before processing.

至此已结合特定实施例描述了本发明，应该指出，在此所作的描述只作为说明的目的，在不脱离下面权利要求的精神或范围的情况下可对本发明进行改变和变化。Having thus far described the invention in connection with specific embodiments, it should be noted that the description herein is for illustrative purposes only and that changes and variations may be made without departing from the spirit or scope of the following claims.

Claims

1. video image enhancing method comprises step:

Foundation is used for the video frame image that data are lost is strengthened the image reference frame of compensation;

The loss of data frame is divided into size is the data block of M * N pixel, wherein M and N are positive integer;

Utilize inverse discrete wavelet transform to convert the frequency-region signal of reference frame to the spatial domain signal;

At the x axle, the direction of y axle and x=y moves a pixel with resulting described spatial domain signal, and the picture signal of resulting three moving directions is carried out wavelet transform to obtain the frequency-region signal of reference frame through displacement;

In the estimation of between reference frame and the loss of data frame of displacement, taking exercises, so that in reference frame, occur the data block that the data block of loss of data is mated in search and the loss of data frame;

Be used in the matched data piece that searches in the reference frame and strengthen the data block of losing in the loss of data frame.

2. video image enhancing method according to claim 1 is characterized in that further being included in and sets up after the described image reference frame, with the step of described reference frame storing in memory.

3. video image enhancing method according to claim 1 and 2 is characterized in that described step of taking exercises estimation between reference frame and loss of data frame is that the frequency-region signal of reference frame and the frequency-region signal of loss of data frame are carried out motion search and coupling.

4. video image enhancing method according to claim 1 is characterized in that at the video image transmitting terminal carrying out intraframe coding through the moving image frame of one-level wavelet transform at least.

5. video image enhancing method according to claim 1, it is characterized in that further comprising the absolute value that calculates through the difference between the frequency-region signal of the frequency-region signal of the mobile image of the reference frame of wavelet transform and loss of data frame, and with the minimum value that obtains and the step of pre-set threshold comparison.

6. video image enhancing method according to claim 5 is characterized in that not matching if described minimum value, then shows two data blocks of reference frame and loss of data frame greater than preset threshold; If this minimum value is less than or equal to preset threshold, then writes down the motion vector of described reference frame displacement and strengthen the data block of described loss of data frame with wavelet coefficient corresponding in the described reference frame with this motion vector.

7. video image enhancing method according to claim 6, it is characterized in that if having only p least significant bit in the wavelet coefficient in the reference frame different with the least significant bit of the wavelet coefficient of loss of data frame, then strengthen the least significant bit of the wavelet coefficient of loss of data frame with the least significant bit in the wavelet coefficient of reference frame, otherwise in the dynamic range that allows with the wavelet coefficient of loss of data frame and the immediate numerical value of reference frame strengthen the wavelet coefficient of described loss of data frame.

8. according to claim 1,2,4, the described video image enhancing method of in 5 to 7 any one, it is characterized in that further being included in the wavelet coefficient after the enhancing of the described loss of data frame of obliterated data is carried out inverse discrete wavelet transform and direct current restores, the color space inverse transformation is to recover the step of video image through the reconstructed image after strengthening.