CN100477789C - A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 - Google Patents
A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 Download PDFInfo
- Publication number
- CN100477789C CN100477789C CN 03149199 CN03149199A CN100477789C CN 100477789 C CN100477789 C CN 100477789C CN 03149199 CN03149199 CN 03149199 CN 03149199 A CN03149199 A CN 03149199A CN 100477789 C CN100477789 C CN 100477789C
- Authority
- CN
- China
- Prior art keywords
- data
- frame
- loss
- image
- reference frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 52
- 230000033001 locomotion Effects 0.000 title claims abstract description 29
- 230000002708 enhancing effect Effects 0.000 claims description 19
- 230000009466 transformation Effects 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 9
- 238000006073 displacement reaction Methods 0.000 claims 3
- 230000008878 coupling Effects 0.000 claims 1
- 238000010168 coupling process Methods 0.000 claims 1
- 238000005859 coupling reaction Methods 0.000 claims 1
- 238000005728 strengthening Methods 0.000 claims 1
- 230000005540 biological transmission Effects 0.000 abstract description 20
- 238000007906 compression Methods 0.000 description 21
- 230000006835 compression Effects 0.000 description 21
- 230000008569 process Effects 0.000 description 12
- 230000000750 progressive effect Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 3
- 238000000354 decomposition reaction Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 238000013139 quantization Methods 0.000 description 3
- 238000012805 post-processing Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000003416 augmentation Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000003708 edge detection Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 230000005654 stationary process Effects 0.000 description 1
Images
Landscapes
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
技术领域 technical field
本发明涉及一种视频图像帧间增强方法,特别是一种对基于运动联合图像专家组2000(Motion JPEG 2000)压缩的视频图像进行帧间增强的方法,该方法能够在接收端尽可能地恢复丢失的视频图像的细节。The present invention relates to a method for inter-frame enhancement of video images, in particular to a method for inter-frame enhancement of video images compressed based on Motion JPEG 2000, which can recover as much as possible at the receiving end Lost video image detail.
背景技术 Background technique
随着多媒体应用的普及、数字视频技术的发展及网络上图像传输的增多,例如可视电话的使用,图像的远程浏览与检索,全球多用户虚拟环境共享等,对图像处理技术的研究变得越来越重要。With the popularization of multimedia applications, the development of digital video technology and the increase of image transmission on the network, such as the use of videophone, remote browsing and retrieval of images, global multi-user virtual environment sharing, etc., the research on image processing technology has become more and more important. more and more important.
图像的处理包括图像压缩,图像增强和恢复,图像除噪以及图像的描述,分类和识别。对于图像采集、存储和传输过程来说,迫切需要快速发展图像压缩技术。一幅具有中等分辨率(640*480)彩色(24bit/象素)数字图像的数据量约为7.37Mbit,庞大的数据量给图像的存储和传输带来了很大的困难。因此,在传输前必须对图像进行编码。Image processing includes image compression, image enhancement and restoration, image denoising, and image description, classification, and recognition. For the process of image acquisition, storage and transmission, it is urgent to develop image compression technology rapidly. The data volume of a color (24bit/pixel) digital image with medium resolution (640*480) is about 7.37Mbit. The huge data volume brings great difficulties to the image storage and transmission. Therefore, images must be encoded before transmission.
传统的JPEG压缩技术已经无法满足人们对多媒体影像资料的要求,因此需要一种功能更强大、效率更卓越的图像压缩标准。在这种背景下JPEG 2000应运而生。The traditional JPEG compression technology has been unable to meet people's requirements for multimedia image data, so a more powerful and efficient image compression standard is needed. In this context, JPEG 2000 came into being.
运动联合图像专家组2000(Motion JPEG 2000)是一种对运动图像的帧内编码方法。它并不利用帧间相关性,而是使用JPEG 2000提供的压缩手段逐帧去除视频序列中每一帧的帧内冗余。Motion JPEG 2000 (Motion JPEG 2000) is an intra-frame coding method for moving images. It does not use inter-frame correlation, but uses the compression method provided by JPEG 2000 to remove the intra-frame redundancy of each frame in the video sequence frame by frame.
随着时间的推移,人们对图像压缩处理的要求越来越高。另外,基于互联网和无线信道的视频传输已经越来越普遍。在实际应用中,有一些重要的图像,如卫星遥感图像、医学图像等需要通过互联网来传送。而这些图像传输要求确保较高的图像质量。As time goes by, people have higher and higher requirements for image compression processing. In addition, video transmission based on the Internet and wireless channels has become more and more common. In practical applications, some important images, such as satellite remote sensing images and medical images, need to be transmitted through the Internet. These image transmissions require high image quality.
然而,由于受到信道宽度的限制,通过互联网或无线信道进行的视频传输通常受到网络拥塞的影响。当网络出现拥塞时,路由器有选择地截断一些表示详细信息的数据分组,这将导致信息被截断的帧的图像质量的降低。However, video transmission over the Internet or wireless channels is often affected by network congestion due to the limitation of channel width. When the network is congested, the router selectively truncates some data packets representing detailed information, which will result in a reduction in the image quality of the truncated frames.
在JPEG 2000中,空域信号在经过诸如彩色空间变化,去直流,归一化之类的前处理后,进行离散小波变换(DWT),以去除数据之间的相关性。图1中示出了一维离散小波变换分解的示意图。一维DWT的基本滤波方式是通过高频和低频滤波器组,将原始信号分解到频域上两个互不混叠的子带上,并通过降采样来保持信号的数目不变。一次DWT后剩余的低频信号往往还具有相关性。可以在其上再做k(k是整数)级子带分解,直到信号之间的相关性可以忽略为止。In JPEG 2000, the spatial domain signal is subjected to discrete wavelet transform (DWT) after pre-processing such as color space change, DC removal, and normalization to remove the correlation between data. Fig. 1 shows a schematic diagram of one-dimensional discrete wavelet transform decomposition. The basic filtering method of one-dimensional DWT is to decompose the original signal into two non-aliasing sub-bands in the frequency domain through high-frequency and low-frequency filter banks, and keep the number of signals unchanged by down-sampling. The remaining low-frequency signals after a DWT are often still relevant. Further k (k is an integer) sub-band decomposition can be performed on it until the correlation between signals can be ignored.
二维DWT则是在一维的行DWT和降采样的基础上,再对列信号进行一次DWT,并进行降采样以保证变换后的数据量与变换前相同。经过一次二维DWT变换之后,原图像被划分为四个子带。其中唯一的一个LL子带(行低频和列低频)仍可以继续分解,经过3-5级后形成分级结构。这种分级结构使JPEG 2000支持多分辨率的码流结构。The two-dimensional DWT is based on the one-dimensional row DWT and downsampling, and then performs a DWT on the column signal, and performs downsampling to ensure that the amount of data after the transformation is the same as before the transformation. After a two-dimensional DWT transformation, the original image is divided into four subbands. The only LL sub-band (row low frequency and column low frequency) can still be decomposed, and a hierarchical structure is formed after 3-5 levels. This hierarchical structure enables JPEG 2000 to support multi-resolution stream structures.
DWT之后的小波系数需要经过量化,JPEG 2000的量化器的一个特殊之处在于引入了“Dead Zone”的概念。对于每个子带的量化器,“DeadZone”的宽度都是其它步长的2倍。这意味着如果在子带b内采用步长为Δb的量化器,则该子带的一个系数y(i,j)可以用作为量化索引值,其中sign()表示一个变量的符号,表示向下取整。这种量化方式的优点是可以根据信噪比要求,提供一种渐进的码率模式,即可以先用较小的步长对信号进行量化。然后根据用户的不同需求以较宽的步长(一般是原始步长的2的整数次幂倍)进行反量化。此时,只需丢弃量化索引值的一些最低有效位(LSB)即可。The wavelet coefficients after DWT need to be quantized. A special feature of the quantizer of JPEG 2000 is the introduction of the concept of "Dead Zone". For each subband quantizer, the "DeadZone" is twice as wide as the other steps. This means that if a quantizer with a step size Δb is used in subband b, one coefficient y(i,j) of this subband can be given by As a quantized index value, where sign() represents the sign of a variable, Indicates rounding down. The advantage of this quantization method is that it can provide a progressive code rate mode according to the requirements of the signal-to-noise ratio, that is, the signal can be quantized with a smaller step size first. Then dequantize with a wider step size (generally an integer power of 2 of the original step size) according to different needs of users. At this point, it is only necessary to discard some least significant bits (LSBs) of the quantization index value.
然后需要对经过量化的小波系数进行优化截断点的嵌入式块编码(Embedded Block Coding with Optimal Truncation)(EBCOT)。EBCOT是一种基于位平面(bit plain)的编码方式。按照位面从高到低的顺序,提取处于相同位平面的数据进行编码。这样就提供了图像质量由粗到精的渐进式编码。同时,EBCOT还提供了对多截断点的支持,即,为了适应一定的压缩码率,可以在JPEG 2000码流的任何一个截断点截断该码流。相当于丢弃当前位平面中所有不显著的系数以及所有显著系数的较低有效位。Then it is necessary to perform Embedded Block Coding with Optimal Truncation (EBCOT) on the quantized wavelet coefficients. EBCOT is a bit-plane-based coding method. According to the order of bit planes from high to low, data in the same bit plane is extracted for encoding. This provides a progressive encoding of image quality from coarse to fine. At the same time, EBCOT also provides support for multiple truncation points, that is, in order to adapt to a certain compression rate, the code stream can be truncated at any truncation point of the JPEG 2000 code stream. Equivalent to discarding all insignificant coefficients and the less significant bits of all significant coefficients in the current bit-plane.
经过EBCOT的码流送入算术编码器,打包成Motion JPEG 2000(MJP2)码流,传送到解码端。解码端的解码器按照与上面所述相反的顺序,即,执行算术解码,逆EBCOT,逆量化,逆离散小波变换以及后处理,得到解码后的各帧图像。The code stream after EBCOT is sent to the arithmetic encoder, packaged into a Motion JPEG 2000 (MJP2) code stream, and sent to the decoder. The decoder at the decoding end performs arithmetic decoding, inverse EBCOT, inverse quantization, inverse discrete wavelet transform, and post-processing in the reverse order of the above, to obtain decoded images of each frame.
渐进可伸缩的解码模式对于视频的传输同样重要。在互联网或无线信道中,由于受信道带宽的限制,视频传输经常受到网络拥塞的影响,导致在线收看时出现画面停顿等现象。如果采用渐进模式传输MJP2码流,当遇到网络拥塞时,路由器可以主动截断每一帧的码流,以减少数据量而保证网络的畅通。此时,用户仍可以观看连续的画面,但有关图像的细节信息被丢失,这必然会导致图像质量的下降。Progressively scalable decoding schemes are equally important for video transmission. On the Internet or wireless channels, due to the limitation of channel bandwidth, video transmission is often affected by network congestion, resulting in picture pauses when watching online. If the MJP2 code stream is transmitted in progressive mode, when encountering network congestion, the router can actively cut off the code stream of each frame to reduce the amount of data and ensure the smooth flow of the network. At this time, the user can still watch continuous pictures, but the detailed information about the image is lost, which will inevitably lead to a decline in image quality.
因此,需要一种在解码端尽可能地恢复丢失的图像的细节信息的方法。Therefore, there is a need for a method for recovering the detailed information of the lost image as much as possible at the decoding end.
发明内容 Contents of the invention
本发明的目的是提供一种增强接收到的视频图像以提高图像质量的方法,该方法可以利用解码端接收到的运动图像各帧的帧间相关性在一定程度上恢复受到信息损失的帧丢失的细节,从而增强图像质量。The purpose of the present invention is to provide a method for enhancing the received video image to improve the image quality, which can utilize the inter-frame correlation of each frame of the moving image received by the decoding end to restore the frame loss caused by information loss to a certain extent details, thereby enhancing image quality.
为了实现本发明的目的,提供一种视频图像增强方法,包括步骤:建立用于对数据丢失的视频图像帧进行增强补偿的图像参考帧;将数据丢失帧划分成大小为M×N像素的数据块;利用逆离散小波变换将参考帧的频域信号转换成空域信号;In order to achieve the purpose of the present invention, a video image enhancement method is provided, comprising the steps of: establishing an image reference frame for enhancing and compensating a video image frame with data loss; dividing the data loss frame into data with a size of M×N pixels block; using the inverse discrete wavelet transform to convert the frequency domain signal of the reference frame into a spatial domain signal;
将所得到的所述空域信号在x轴,y轴以及x=y的方向移动一个像素,并对所得到的三个移动方向的图像信号进行离散小波变换以得到参考帧经移位的频域信号;在经移位的参考帧与数据丢失帧之间做运动估算,以便在参考帧中搜索与数据丢失帧中出现数据丢失的数据块匹配的数据块;用在参考帧中搜索到的匹配数据块增强数据丢失帧中丢失的数据块。Moving the obtained spatial domain signal by one pixel in the direction of x-axis, y-axis and x=y, and performing discrete wavelet transform on the obtained image signals in three moving directions to obtain the shifted frequency domain of the reference frame signal; do motion estimation between the shifted reference frame and the data loss frame, so as to search for a data block in the reference frame that matches the data block in the data loss frame; use the match found in the reference frame Block Augmentation Data blocks lost in lost frames.
采用本发明的运动图像增强方法可以利用图像帧之间的相关性来补偿增强因网络传输拥塞等造成的图像数据丢失,改善图像质量。The moving image enhancement method of the present invention can use the correlation between image frames to compensate and enhance image data loss caused by network transmission congestion, etc., and improve image quality.
附图说明 Description of drawings
通过下面结合附图对本发明实施例的详细描述,将使本发明的上述和其它方面的目的、特征及优点变得更明显并更容易理解,需要说明的是这些实施例仅是为了说明本发明,而不是将本发明的技术方案限制于此,其中:Through the following detailed description of the embodiments of the present invention in conjunction with the accompanying drawings, the purpose, features and advantages of the above and other aspects of the present invention will become more obvious and easier to understand. It should be noted that these embodiments are only to illustrate the present invention , rather than limit the technical solution of the present invention to this, wherein:
图1是说明多级离散小波变换的示意图;Figure 1 is a schematic diagram illustrating multi-level discrete wavelet transform;
图2是说明根据本发明的实施例对视频图像进行增强处理的方法的流程图;2 is a flowchart illustrating a method for enhancing video images according to an embodiment of the present invention;
图3示出了采用根据本发明的图像增强方法对图像进行增强处理后的图像的峰值信噪比增益的示意图。Fig. 3 shows a schematic diagram of the peak signal-to-noise ratio gain of the image after the image is enhanced by the image enhancement method according to the present invention.
具体实施方式 Detailed ways
下面参考附图详细说明根据本发明利用图像的帧间相关性来增强图像的方法的实施例。An embodiment of a method for enhancing an image by utilizing inter-frame correlation of an image according to the present invention will be described in detail below with reference to the accompanying drawings.
首先,介绍JPEG 2000标准及离散小波变换。JPEG 2000作为JPEG的升级版,具有以下特点:First, the JPEG 2000 standard and discrete wavelet transform are introduced. As an upgraded version of JPEG, JPEG 2000 has the following characteristics:
1.高压缩率1. High compression ratio
JPEG 2000压缩性能比JPEG提高了大约20%,也就是说,在同样的压缩率下,JPEG 2000对图像的失真比JPEG小大约20%。同时,使用JPEG 2000的系统稳定性好,运行平稳,抗干扰性好,易于操作。The compression performance of JPEG 2000 is about 20% higher than that of JPEG, that is to say, under the same compression rate, the distortion of JPEG 2000 is about 20% smaller than that of JPEG. At the same time, the system using JPEG 2000 has good stability, smooth operation, good anti-interference and easy operation.
2.同时支持有损和无损压缩2. Support both lossy and lossless compression
JPEG只支持有损压缩,而JPEG 2000能支持无损压缩。在实际应用中,诸如卫星遥感图像、医学图像、文物照片等重要的图像都非常适合于采用JPEG 2000压缩。JPEG only supports lossy compression, while JPEG 2000 can support lossless compression. In practical applications, important images such as satellite remote sensing images, medical images, and photos of cultural relics are very suitable for JPEG 2000 compression.
3.实现了渐进传输(progressive transmission)3. Realized progressive transmission (progressive transmission)
这是JPEG 2000一个极其重要的特征,它可以先传输图像的轮廓,然后逐步传输数据,不断提高图像质量,让图像由朦胧到清晰显示,而不必像现在的JPEG那样,由上到下慢慢显示,这在网络传输中有重大意义。This is an extremely important feature of JPEG 2000. It can first transmit the outline of the image, and then gradually transmit the data to continuously improve the image quality, so that the image can be displayed from hazy to clear, instead of slowly moving from top to bottom like the current JPEG. It is shown that this has great significance in network transmission.
4.支持“感兴趣区域”(Region Of Interest(ROI))4. Support "Region Of Interest (ROI)"
用户可以任意指定图像上感兴趣区域的压缩质量,还可以选择指定的部分先解压缩,从而使重点突出。这种方法的优点在于它结合了接收方对压缩的主观需求,实现了交互式压缩。The user can arbitrarily specify the compression quality of the region of interest on the image, and can also choose the specified part to be decompressed first, so as to make the key point stand out. The advantage of this method is that it combines the receiver's subjective demand for compression and realizes interactive compression.
JPEG 2000与传统JPEG最大的不同在于它放弃了JPEG所采用的以离散余弦变换为主的区块编码方式,转而采用以小波变换为主的多分辨率编码方式。The biggest difference between JPEG 2000 and traditional JPEG is that it abandons the block coding method mainly based on discrete cosine transform adopted by JPEG, and instead adopts the multi-resolution coding method mainly based on wavelet transform.
小波变换是现代谱分析工具,它既能考察局部时域过程的频域特征,又能考察局部频域过程的时域特征,因此即使对于非平稳过程,处理起来也得心应手。它能将图像变换为一系列小波系数,这些系数可以被高效压缩和存储,此外,小波的粗略边缘可以更好地表现图像,因为它消除了DCT压缩普遍具有的方块效应。Wavelet transform is a modern spectrum analysis tool. It can not only examine the frequency domain characteristics of local time domain processes, but also examine the time domain characteristics of local frequency domain processes. Therefore, even for non-stationary processes, it is handy to deal with. It can transform the image into a series of wavelet coefficients, which can be efficiently compressed and stored. In addition, the rough edges of the wavelet can better represent the image, because it eliminates the blockiness common to DCT compression.
小波在空间和频率域上的局域性,是统计意义上的局域性。这里说的局域性,指的是一个变换系数实际牵涉到的图像空间范围是局部的。因而,要完全恢复图像中的某个局部,并不需要所有的编码都被精确地保留,只需要对应于它的一部分编码没有误差就可以了。所以,能实现感兴趣区域压缩。同时JPEG 2000支持整形小波,因而能够实现无损压缩。The locality of wavelet in space and frequency domain is locality in statistical sense. The locality mentioned here means that the image space range actually involved by a transformation coefficient is local. Therefore, in order to fully restore a certain part of the image, it is not necessary that all the codes are accurately preserved, only that a part of the codes corresponding to it has no error. Therefore, region-of-interest compression can be achieved. At the same time, JPEG 2000 supports shaping wavelet, so it can realize lossless compression.
小波建立在三个主要的基础理论之上,分别是阶层式编码(pyramidcoding)、滤波器组理论(filter bank theory)、以及子带编码(subbandcoding),小波变换结合了此三项技术。小波变换能将各种交织在一起的不同频率组成的信号,分解成不相同频率的信号,因此能有效的应用于编码、解码、检测边缘、压缩数据,及将非线性问题线性化。良好的分析局部的时域与频域的信号,弥补傅利叶转换中的缺失。Wavelet is based on three main basic theories, which are pyramid coding, filter bank theory, and subband coding. Wavelet transform combines these three technologies. Wavelet transform can decompose signals composed of different frequencies that are intertwined together into signals of different frequencies, so it can be effectively applied to encoding, decoding, edge detection, data compression, and linearization of nonlinear problems. Good analysis of local time domain and frequency domain signals, making up for the lack of Fourier transform.
下面说明本发明的基于JPEG 2000码流增强视频图像的原理。The principle of enhancing the video image based on the JPEG 2000 code stream of the present invention is illustrated below.
Motion JPEG 2000是一种用于编码运动图像的新的国际标准。使用与JPEG 2000相同的帧内编码技术。码流被逐帧地预先打包。当出现网络拥塞时,路由器有选择地丢弃某些不重要的部分,并且只传输用于图像重构的重要部分。所谓不重要的数据包是指对人的视觉影响较小,图像分辨率较高或较清晰的数据包。高频信息的丢失将缓解网络传输的负担,但在一定程度上会降低图像的质量。Motion JPEG 2000 is a new international standard for encoding moving pictures. Uses the same intra-coding technique as JPEG 2000. The bitstream is prepackaged frame by frame. When network congestion occurs, routers selectively discard certain unimportant parts and transmit only important parts for image reconstruction. The so-called unimportant data packets refer to data packets that have less impact on human vision and have a higher or clearer image resolution. The loss of high-frequency information will ease the burden of network transmission, but it will reduce the quality of the image to a certain extent.
鉴于Motion JPEG 2000采用帧内编码技术,编码后的运动图像的各帧之间的帧间相关性依然存在,某一帧图像在传输过程中丢失的信息并不会影响到其它帧。因此,可以在解码端利用未受到图像数据损失的相关帧在一定程度上恢复丢失的数据,从而增强图像质量。Since Motion JPEG 2000 uses intra-frame coding technology, the inter-frame correlation between the frames of the encoded motion picture still exists, and the information lost during the transmission of a certain frame image will not affect other frames. Therefore, the lost data can be recovered to a certain extent by using the relevant frames that have not suffered the image data loss at the decoding end, thereby enhancing the image quality.
根据本发明的实施例,例如在通过互联网接收图像的过程中,假设当前帧出现了图像数据损失,而参考帧相对保持完好。首先在当前帧和参考帧之间做出运动估算。假设接收到的各帧数据经解码后得到帧序列In。根据JPEG 2000的渐进传输原理,接收到的数据量越多,恢复的图像质量就越好。因此,如果在接收端检测到某一帧的数据量明显小于其它帧的数据量,则该帧中可能出现数据丢失。在这种情况下,需要对接收到的当前帧进行图像增强。According to an embodiment of the present invention, for example, in the process of receiving an image through the Internet, it is assumed that image data loss occurs in the current frame, while the reference frame remains relatively intact. Motion estimation is first made between the current frame and a reference frame. It is assumed that each frame of data received is decoded to obtain a frame sequence I n . According to the progressive transmission principle of JPEG 2000, the more data received, the better the image quality will be restored. Therefore, if it is detected at the receiving end that the amount of data in a certain frame is significantly smaller than the amount of data in other frames, data loss may occur in that frame. In this case, image enhancement needs to be performed on the received current frame.
下面结合图2说明对视频图像进行离散小波变换的增强过程。The enhancement process of performing discrete wavelet transform on video images will be described below with reference to FIG. 2 .
对图像增强的过程需要利用参考帧,这个参考帧应该是已经接收到的未受到数据损失的某一帧。为了进行图像增强,在接收端需要设置缓存器来存储用于对数据丢失的图像帧进行增强的参考帧,即,在缓存器中存储一帧未受到数据损失的图像信号。在接收过程中判断新接收的图像帧是否受到数据损失。如果新的一帧没有受到数据损失,可用新的一帧刷新缓存器。The process of image enhancement needs to use a reference frame, which should be a received frame without data loss. In order to perform image enhancement, a buffer needs to be set at the receiving end to store reference frames for enhancing image frames with data loss, that is, a frame of image signals without data loss is stored in the buffer. During the receiving process, it is judged whether the newly received image frame suffers from data loss. If the new frame does not suffer data loss, the buffer can be flushed with a new frame.
为了便于讨论,在下面的例子中假设接收的当前帧,例如,In的图像数据受到损失。应该理解,图像数据被在网络传输过程中被截断的帧不限于当前帧。可利用,例如,缓存器存储的未受到数据损失的前一帧In-1帧作为参考帧。用xn(i,j)表示出现数据丢失的第n帧的空域信号,经过多级的DWT变换后,子带b内的小波系数表示为yb n(i,j),b∈{kLL,kHL,kLH,kHH},其中kHL(k是正整数)表示在第k层分解中对行信号进行高通滤波,对列信号进行低通滤波后间隔采样的结果。For ease of discussion, in the following examples it is assumed that the image data of the received current frame, eg In , is lost. It should be understood that the frame whose image data is truncated during network transmission is not limited to the current frame. For example, the previous frame In -1 frame stored in the buffer without data loss can be used as the reference frame. Use x n (i, j) to represent the spatial domain signal of the nth frame where data loss occurs. After multi-level DWT transformation, the wavelet coefficients in sub-band b are expressed as y b n (i, j), b∈{kLL , kHL, kLH, kHH}, where kHL (k is a positive integer) represents the result of performing high-pass filtering on the row signal and performing low-pass filtering on the column signal in the decomposition of the k-th layer.
为了叙述方便起见,假设每帧图像都只经过一级离散小波变换(DWT)。由于传输的是运动图像,为了对受到数据损失的图像帧进行图像增强,首先需要找到参考帧与当前帧的匹配数据块。如图2所示,在LL子带内将第In帧频域信号在划分为固定大小的M*N(M和N是正整数)个像素的数据块,逐个数据块地在In-1的LL子带和其经过运动偏移后的三个LL子带中进行运动搜索和匹配(具体方法将在之后详述)。如果得到匹配,则用第In-1帧的该数据块对第In帧的数据进行增强。如果未得到匹配,则放弃参考帧的该数据块。For the convenience of description, it is assumed that each frame of image undergoes only one level of discrete wavelet transform (DWT). Since the transmission is a moving image, in order to perform image enhancement on an image frame subject to data loss, it is first necessary to find a matching data block between the reference frame and the current frame. As shown in Figure 2, in the LL subband, the I nth frame frequency domain signal is divided into data blocks of M*N (M and N are positive integers) pixels of fixed size, and the data blocks are divided into I n-1 data blocks one by one. The motion search and matching are performed in the LL sub-bands and the three LL sub-bands after motion offset (the specific method will be described in detail later). If a match is obtained, the data of the I nth frame is enhanced with the data block of the I n -1th frame. If no match is found, the data block of the reference frame is discarded.
由于图像的运动性,参考帧与受到损失的当前帧之间的相关数据块之间存在着一定的偏移。这种情况下,为了找到匹配的数据块进行图像增强,需要移动图像帧的像素。在本实施例中,搜索相关数据块的过程是通过移动参考帧的数据块进行的。Due to the motion of the image, there is a certain offset between the relevant data blocks between the reference frame and the lost current frame. In this case, in order to find a matching data block for image enhancement, it is necessary to shift the pixels of the image frame. In this embodiment, the process of searching for relevant data blocks is performed by moving the data blocks of the reference frame.
为了搜索对应的数据块,对第In-1帧的频域信号进行逆离散小波变换(IDWT),以便得到第In-1帧的空域信号xn-1(i,j)。接下来,将空域信号xn-1(i,j)沿x轴,y轴以及x=y方向各移动一个像素,得到三幅图像xn-1(i-1,j)、xn-1(i,j-1)、和xn-1(i-1,j-1)。分别对这三幅图像进行DWT变换,得到三个经过移位的第In-1帧的频域信号。将转换后的信号表示为yb(1) n-1(i,j)、yb(2) n-1(i,j)、和yb(3) n-1(i,j),而
将接收到的当前帧的LL子带内的频域信号yLL n(i,j)划分为固定大小的数据块,例如8*8。然后逐个在参考帧的四个LL子带yLL(0) n-1(i,j)、yLL(1) n-1(i,j)、yLL(2) n-1(i,j)、yLL(3) n-1(i,j)中搜索与当前帧的该每个数据块最相似的数据块。Divide the received frequency domain signal y LL n (i, j) in the LL subband of the current frame into data blocks of fixed size, for example, 8*8. Then one by one in the four LL subbands y LL(0) n-1 (i, j), y LL(1) n-1 (i, j), y LL(2) n-1 (i, j) of the reference frame j), y LL(3) n-1 (i, j) to search for the data block most similar to each data block of the current frame.
搜索过程可采用三步法,判断两个数据块是否相似可以采用最大绝对距离(MAD)法则,即计算当前帧与移位的参考帧的对应子带的频域信号之间的差值的绝对值之和,即,计算
在参考帧中找到了与当前帧的某个小波系数yLL n(i,j)对应的运动矢量(di,dj,k)后,其在参考帧中对应的小波系数则是yLL(k) n-1(i+di,j+dj)。此后,可对当前帧中损失数据进行增强。作为例子,例如,如果在传输过程中yLL n(i,j)丢失了当前帧的p个最低有效位,在接收端实际收到的则是此时可以得到下面的公式(1)表示的未经损失的yLL n(i,j)的分布范围:After the motion vector (di, dj, k) corresponding to a certain wavelet coefficient y LL n (i, j) of the current frame is found in the reference frame, its corresponding wavelet coefficient in the reference frame is y LL(k ) n-1 (i+di, j+dj). Thereafter, the lost data in the current frame can be enhanced. As an example, for example, if y LL n (i, j) loses the p least significant bits of the current frame during transmission, what is actually received at the receiver is At this time, the distribution range of y LL n (i, j) without loss expressed by the following formula (1) can be obtained:
用在上式所列范围内与yLL(k n-1(i+di,j+dj)最接近的值作为对受到损失的当前帧的小波系数的增强结果。其具体操作如下:Use the value closest to y LL(k n-1 (i+di, j+dj) within the range listed in the above formula as the wavelet coefficient of the current frame subject to loss enhanced results. Its specific operation is as follows:
如果参考帧中的小波系数yLL(k n-1(i+di,j+dj)中除最后p个最低有效位外均与相同,即:则可以令作为的增强结果。也就是说,用yLL(k n-1(i+di,j+dj)的最后p个最低有效位来补偿受到损失的当前帧的 If the wavelet coefficients y LL(k n-1 (i+di, j+dj) in the reference frame are all the same as same, ie: then you can make as enhanced results. That is, use the last p least significant bits of y LL(k n-1 (i+di, j+dj) to compensate for the loss of the current frame
如果参考帧中的小波系数yLL(k n-1(i+di,j+dj)中除最后p个最低有效位外仍与有不同,则比较公式(1)中的不等式确定的区间的两个端点。在这两个端点中选择距yLL(k n-1(i+di,j+dj)较近的一个点作为对当前帧的增强结果。If the wavelet coefficient y LL(k n-1 (i+di, j+dj) in the reference frame is still consistent with If there is a difference, compare the two endpoints of the interval determined by the inequality in formula (1). Select a point closer to y LL(k n-1 (i+di, j+dj) from these two endpoints as the enhancement result for the current frame.
在最坏的情况下,当前帧的yLL n(i,j)丢失了其所有的有效位(例如,整个子带内的数据都被丢失),这种情况下,则直接令当前帧中的
根据离散小波变换的性质,对于图像的三个高频子带HL、LH和HH,不需要再到前一帧中做运动估算和查找对应的数据块。如果某个小波系数yLL n(i,j)对应运动矢量(di,dj,k),则对于所有的子带b,则有yb n(i,j)对应的运动矢量都是(di,dj,k)。这种情况下下,可以用参考帧中的系数yb(k) n-1(i+di,j+dj)作为参考量来增强丢失数据的当前帧的系数yb n(i,j)。According to the nature of the discrete wavelet transform, for the three high-frequency sub-bands HL, LH and HH of the image, there is no need to go to the previous frame for motion estimation and search for corresponding data blocks. If a wavelet coefficient y LL n (i, j) corresponds to the motion vector (di, dj, k), then for all subbands b, the motion vectors corresponding to y b n (i, j) are (di , dj, k). In this case, the coefficient y b(k) n-1 (i+di, j+dj) in the reference frame can be used as a reference to enhance the coefficient y b n (i, j) of the current frame of the missing data .
经过对丢失数据的当前帧的小波系数进行增强处理之后,对增强后的小波系数进行逆离散小波变换和诸如直流复原,颜色空间逆变换等之类的后处理,就可以在接收端恢复出视频图像经过增强后的重建图像。After the wavelet coefficients of the current frame of the lost data are enhanced, the enhanced wavelet coefficients are subjected to inverse discrete wavelet transform and post-processing such as DC restoration, color space inverse transformation, etc., and the video can be restored at the receiving end. The reconstructed image after the image has been enhanced.
上面为了简单起见以一级DWT为例说明了利用图像的帧间相关性对视频图像进行补偿增强的方法。一般情况下,一帧图像在编码之前要经过3-5级的DWT变换。上述算法可以推广到m级的DWT变换。其过程如下:For the sake of simplicity, the first-level DWT is taken as an example to illustrate the method of compensating and enhancing the video image by using the inter-frame correlation of the image. Generally, a frame of image needs to undergo 3-5 levels of DWT transformation before encoding. The above algorithm can be extended to m-level DWT transformation. The process is as follows:
1.按照一级DWT变换的方法,对In的LLm,HLm,LHm,HHm(m是整数)子带进行增强。1. According to the method of one-stage DWT transformation, the LL m , HL m , LH m , HH m (m is an integer) subbands of In are enhanced.
2.对增强后的四个子带进行逆DWT变换,以得到第m级DWT变换之前的低频子带LLm-1。2. Inverse DWT transform is performed on the four enhanced sub-bands to obtain the low-frequency sub-band LL m-1 before the m-th DWT transform.
3.LLm-1是一个增强后的子带,在此基础上可对m-1级的其它三个子带HLm-1,LHm-1,HHm-1进行补偿增强。其增强方法与前面说明的过程相似。另外,应指出的是,可以用上一级的运动矢量(di,dj,k)m来加快对本级运动矢量的搜索。3. LL m-1 is an enhanced sub-band, and on this basis, compensation and enhancement can be performed on the other three sub-bands HL m-1 , LH m-1 , and HH m-1 of level m-1. Its enhancement method is similar to the process explained earlier. In addition, it should be pointed out that the motion vector (di, dj, k) m of the previous stage can be used to speed up the search for the motion vector of the present stage.
4.由此如上所述的方式按照分辨率由低到高的顺序逐级对受到数据丢失的图像帧进行增强,直到整个处理过程结束。4. Thus, in the manner as described above, image frames subject to data loss are enhanced step by step in order of resolution from low to high, until the entire processing process ends.
图3示出了采用根据本发明的图像增强方法对图像进行增强处理后的图像的峰值信噪比增益。该实例对MPEG-2标准测试图像“运动物体和日历”进行了测试。所采用的参数如下:Fig. 3 shows the peak signal-to-noise ratio gain of the image after the image is enhanced by the image enhancement method according to the present invention. This example tests the MPEG-2 standard test image "moving object and calendar". The parameters used are as follows:
视频序列:运动物体和日历,选取300帧图像,352*288像素/帧,图像采用逐行扫描方式。Video sequence: moving objects and calendars, select 300 frames of images, 352*288 pixels/frame, and the images adopt progressive scanning.
小波基:采用JPEG 2000 Part 1提供的9/7实型滤波器组;运动补偿采用低频带移位(Low-Band-Shift)。Wavelet base: use the 9/7 real-type filter bank provided by JPEG 2000
压缩率为2.0bpp。其中对奇数帧进行了截断处理,截断后的码流相当于1.0bpp。通过采用本发明的方法对截断的奇数帧进行增强处理后,可得到平均为3.12dB的峰值信噪比。图3中的横坐标表示帧序号,纵坐标表示峰值信噪比。图中的空白方框连成的曲线表示经过图像增强后的峰值信噪比,黑方块连成的曲线表示原始图像的峰值信噪比。从图3中可以明显看出,经过增强处理的图像比处理前的图像质量由了明显改善。The compression ratio is 2.0bpp. The odd frames are truncated, and the truncated code stream is equivalent to 1.0bpp. After enhancing the truncated odd frames by using the method of the present invention, an average peak signal-to-noise ratio of 3.12dB can be obtained. The abscissa in FIG. 3 represents the frame number, and the ordinate represents the peak signal-to-noise ratio. The curve formed by the blank squares in the figure represents the peak signal-to-noise ratio after image enhancement, and the curve formed by the black squares represents the peak signal-to-noise ratio of the original image. It can be clearly seen from Figure 3 that the quality of the enhanced image is significantly improved compared with the image before processing.
至此已结合特定实施例描述了本发明,应该指出,在此所作的描述只作为说明的目的,在不脱离下面权利要求的精神或范围的情况下可对本发明进行改变和变化。Having thus far described the invention in connection with specific embodiments, it should be noted that the description herein is for illustrative purposes only and that changes and variations may be made without departing from the spirit or scope of the following claims.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 03149199 CN100477789C (en) | 2003-06-20 | 2003-06-20 | A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 03149199 CN100477789C (en) | 2003-06-20 | 2003-06-20 | A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1568011A CN1568011A (en) | 2005-01-19 |
CN100477789C true CN100477789C (en) | 2009-04-08 |
Family
ID=34472486
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 03149199 Expired - Fee Related CN100477789C (en) | 2003-06-20 | 2003-06-20 | A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100477789C (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8072950B2 (en) | 2006-07-05 | 2011-12-06 | Samsung Electronics Co., Ltd. | Collaborative mobile ad hoc network infrastructure |
US8675075B2 (en) | 2010-07-30 | 2014-03-18 | Thomson Licensing | Method and apparatus for measuring video quality |
CN108596855A (en) * | 2018-04-28 | 2018-09-28 | 国信优易数据有限公司 | A kind of video image quality Enhancement Method, device and video picture quality enhancement method |
WO2023133888A1 (en) * | 2022-01-17 | 2023-07-20 | 深圳市大疆创新科技有限公司 | Image processing method and apparatus, remote control device, system, and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4989088A (en) * | 1988-12-15 | 1991-01-29 | Kokusai Denshin Denwa Co., Ltd. | Error correction system used in picture transmission |
-
2003
- 2003-06-20 CN CN 03149199 patent/CN100477789C/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4989088A (en) * | 1988-12-15 | 1991-01-29 | Kokusai Denshin Denwa Co., Ltd. | Error correction system used in picture transmission |
Non-Patent Citations (2)
Title |
---|
基于小波变换的块匹配运动估计方法. 龚涛,丁润涛.天津大学学报,第36卷第3期. 2003 |
基于小波变换的块匹配运动估计方法. 龚涛,丁润涛.天津大学学报,第36卷第3期. 2003 * |
Also Published As
Publication number | Publication date |
---|---|
CN1568011A (en) | 2005-01-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4254017B2 (en) | Image coding apparatus and method | |
JP5014989B2 (en) | Frame compression method, video coding method, frame restoration method, video decoding method, video encoder, video decoder, and recording medium using base layer | |
Reddy et al. | A lossless image compression using traditional and lifting based wavelets | |
CN101175208A (en) | Image encoding method and device, and image decoding method and device | |
KR20000059799A (en) | Device and method for motion compensation coding using wavelet coding | |
CN108810534B (en) | Image compression method based on direction boosting wavelet and improved SPIHT under the Internet of Things | |
KR20050028019A (en) | Wavelet based coding using motion compensated filtering based on both single and multiple reference frames | |
CN102006483B (en) | Video coding and decoding method and device | |
KR100678909B1 (en) | Method and apparatus for MFC-based video coding to reduce discrepancies between encoders and decoders | |
CN100477789C (en) | A method of inter-frame image enhancement based on Joint Motion Photographic Experts Group 2000 | |
KR101246115B1 (en) | Method and apparatus of image encoding/decoding based on 3D wavelet transform | |
Wu et al. | Comparisons of threshold ezw and spiht wavelets based image compression methods | |
CN1689045A (en) | L-frames with both filtered and unfilterd regions for motion comensated temporal filtering in wavelet based coding | |
Sunkara | Image compression using hand designed and Lifting Based Wavelet Transforms | |
KR100361804B1 (en) | Apparatus and method for compressing and depressing movable images using Wavelet theory | |
Marzougui et al. | Preliminary study of block matching algorithms for wavelet-based t+ 2D video coding | |
Morris et al. | Intra-frame wavelet video coding | |
Ranjeeta et al. | Image compression: an overview | |
Zhou | An Improved SPIHT Algorithm for Lossy Image Coding | |
Sankaragomathi et al. | Use of wavelets in fractal compression algorithm for enhanced performance | |
Vrindavanam et al. | Wavelet and JPEG based image compression: an experimental analysis | |
Al-Asmari | Compression Algorithm Using 3-D Discrete Wavelet Decomposition | |
Chen et al. | LBT based low complexity image compression method | |
Al-Asmari et al. | Low bit rate video compression algorithm using 3-D Decomposition | |
Al-Jawad et al. | An efficient realtime video compression algorithm with high feature preserving capability |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090408 Termination date: 20120620 |