CN108769696A - A kind of DVC-HEVC video transcoding methods based on Fisher discriminates - Google Patents
A kind of DVC-HEVC video transcoding methods based on Fisher discriminates Download PDFInfo
- Publication number
- CN108769696A CN108769696A CN201810573870.6A CN201810573870A CN108769696A CN 108769696 A CN108769696 A CN 108769696A CN 201810573870 A CN201810573870 A CN 201810573870A CN 108769696 A CN108769696 A CN 108769696A
- Authority
- CN
- China
- Prior art keywords
- frame
- hevc
- dvc
- encoding
- formula
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 54
- 230000008569 process Effects 0.000 claims abstract description 19
- 239000006185 dispersion Substances 0.000 claims abstract description 12
- 238000005457 optimization Methods 0.000 claims abstract description 4
- 239000013598 vector Substances 0.000 claims description 28
- 230000033001 locomotion Effects 0.000 claims description 27
- 238000004422 calculation algorithm Methods 0.000 claims description 21
- 238000012360 testing method Methods 0.000 claims description 15
- 238000012549 training Methods 0.000 claims description 15
- 238000004458 analytical method Methods 0.000 claims description 13
- 238000005192 partition Methods 0.000 claims description 12
- 238000012795 verification Methods 0.000 claims description 10
- 230000006870 function Effects 0.000 claims description 6
- 238000000638 solvent extraction Methods 0.000 claims description 2
- 230000001154 acute effect Effects 0.000 claims 1
- 238000005303 weighing Methods 0.000 claims 1
- 238000010561 standard procedure Methods 0.000 abstract 1
- 238000013145 classification model Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 238000010835 comparative analysis Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/40—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
本发明提供了一种基于预测残差的DVC‑HEVC转码方法及其转码器,主要涉及转码器中HEVC编码方式的划分。在编码方式划分中,利用DVC解码端生成的WZ帧的边信息与当前HEVC编码帧(即WZ帧的重建帧)二者相减获得的预测残差,通过此预测残差来快速确定CU的划分模式,从而跳过HEVC编码模块中复杂的逐层率失真优化过程。本发明根据当前预测残差的离散度与阈值之间的关系进行条件判决,确定是否继续进行CU划分,快速确定CU分块模式,从而有效的降低了编码端的计算复杂度。实验结果表明,本发明在编码效率和峰值信噪比(PSNR)损失都很小的情况下,和HM16.5标准方法相比,大大降低了编码时间。
The present invention provides a DVC-HEVC transcoding method and a transcoder based on a prediction residual, and mainly relates to the division of HEVC encoding methods in the transcoder. In the encoding method division, the prediction residual obtained by subtracting the side information of the WZ frame generated by the DVC decoder from the current HEVC coded frame (that is, the reconstructed frame of the WZ frame) is used to quickly determine the CU Split mode, thereby skipping the complex layer-by-layer rate-distortion optimization process in the HEVC encoding module. The present invention performs conditional judgment according to the relationship between the dispersion of the current prediction residual and the threshold, determines whether to continue CU division, and quickly determines the CU block mode, thereby effectively reducing the computational complexity of the encoding end. Experimental results show that the present invention greatly reduces the encoding time compared with the HM16.5 standard method under the condition that the encoding efficiency and the peak signal-to-noise ratio (PSNR) loss are small.
Description
技术领域technical field
本发明涉及图像通信领域中的视频转码技术问题,尤其是涉及一种基于Fisher判别式的分布式视频编码(DVC)到HEVC标准之间的视频转码技术。The present invention relates to the technical problem of video transcoding in the field of image communication, in particular to a video transcoding technology between Fisher discriminant-based Distributed Video Coding (DVC) and HEVC standard.
背景技术Background technique
随着数字多媒体技术的飞速发展,高清视频的普及以及超高清格式(4K×2K或8K×4K 分辨率)的出现,新型的视频通信得到了广泛的应用,如无线监控视频、移动视频通话、视频会议、移动远程现场指挥等,视频数据量的增长速度已经远远超过了网络和存储技术的发展速度,目前视频领域广泛应用的H.264/AVC标准已很难满足进一步的视频应用需求。With the rapid development of digital multimedia technology, the popularization of high-definition video and the emergence of ultra-high-definition format (4K×2K or 8K×4K resolution), new video communication has been widely used, such as wireless surveillance video, mobile video call, Video conferencing, mobile remote on-site command, etc., the growth rate of video data volume has far exceeded the development speed of network and storage technology. At present, the widely used H.264/AVC standard in the video field can hardly meet the further video application requirements.
2010年MPEG和VCEG组织成立了视频编码联合协作小组(JCT-VC),成功研发出了新一代视频编码标准——高效率视频编码(High Efficiency Video Coding,HEVC)。HEVC在编码端采用预测加变换的方式,相对H.264提高一倍左右的压缩效率,同时带来了编码器复杂度的急剧增加。分布式视频编码(Distributed Video Coding,DVC)技术通过将挖掘视频相关性的过程从编码端转移至解码端,实现了编码端低复杂度、低功耗的视频通信,得到了国内外学者越来越广泛的重视。然而其解码端复杂度较高,且编码性能与传统视频编码标准还是存在一定差距。因此,上述两种编码方案都无法很好的满足移动终端设备之间的低复杂度、低功耗视频通信需求。基于DVC-HEVC转码的方案可以有效地结合两者的优点,为上述存在的问题提供一种有效的解决思路。In 2010, MPEG and VCEG established the Joint Video Coding Collaboration Team (JCT-VC), and successfully developed a new generation of video coding standard - High Efficiency Video Coding (HEVC). HEVC adopts the method of prediction and transformation at the encoding end, which improves the compression efficiency by about one time compared with H.264, and at the same time brings a sharp increase in the complexity of the encoder. Distributed Video Coding (DVC) technology transfers the process of mining video correlation from the encoding end to the decoding end, realizing low-complexity and low-power video communication at the encoding end, and has been increasingly recognized by scholars at home and abroad. more extensive attention. However, the complexity of the decoding end is relatively high, and there is still a certain gap between the encoding performance and the traditional video encoding standards. Therefore, neither of the above two encoding schemes can well meet the requirements of low-complexity and low-power video communication between mobile terminal devices. The solution based on DVC-HEVC transcoding can effectively combine the advantages of both, and provide an effective solution to the above existing problems.
目前学术界对于DVC向传统视频标准转码开展了大量工作,并且多数已经取得了较为良好的进展。Alberto Corrales-García等根据H.264编码时宏块的分块尺寸大小与DVC解码端边信息的残差分布具有较高相似性这一特点,实现了模式选择算法的加速。J.L.Martinez等利用 DVC解码时产生的运动矢量减少H.264编码时运动估计过程中的计算复杂度。荣松等基于云计算的优势,提出了一种基于云转码的低复杂度端到端的视频通信系统。上述方法都能在一定程度上加速DVC-H.264转码过程,但是关于DVC-HEVC转码的研究仍处于初步阶段。At present, the academia has carried out a lot of work on transcoding DVC to traditional video standards, and most of them have made relatively good progress. Alberto Corrales-García et al. realized the acceleration of the mode selection algorithm based on the fact that the block size of the macroblock during H.264 encoding has a high similarity with the residual distribution of the side information of the DVC decoding end. J.L.Martinez et al. use the motion vector generated during DVC decoding to reduce the computational complexity in the motion estimation process during H.264 encoding. Based on the advantages of cloud computing, Rong Song et al. proposed a low-complexity end-to-end video communication system based on cloud transcoding. The above methods can accelerate the DVC-H.264 transcoding process to a certain extent, but the research on DVC-HEVC transcoding is still in its preliminary stage.
发明内容Contents of the invention
本发明的目的是加快DVC-HEVC转码器中HEVC编码过程。本发明利用DVC解码过程中产生的特征信息与CU划分模式之间的关系,建立CU快速划分模型,提出了一种基于Fisher 判别式的DVC-HEVC视频转码方法。相比传统的级联转码方案,本发明的方法在编码效率和峰值信噪比损失都很小的情况下,能较大幅度地降低转码器中HEVC编码的计算复杂度。The purpose of the present invention is to speed up the HEVC encoding process in a DVC-HEVC transcoder. The present invention utilizes the relationship between feature information generated during the DVC decoding process and the CU partition mode to establish a CU fast partition model, and proposes a DVC-HEVC video transcoding method based on Fisher discriminant formula. Compared with the traditional cascaded transcoding scheme, the method of the present invention can greatly reduce the computational complexity of HEVC encoding in the transcoder with little loss in coding efficiency and peak signal-to-noise ratio.
本发明的基本思想是利用DVC解码端生成的边信息作为当前HEVC编码帧(即WZ帧的重建帧)的预测帧,二者的残差即为预测残差,结合该帧的纹理复杂度及平均运动矢量,通过机器学习的方法得到三种特征与CU划分模式之间的关系,建立了一种基于Fisher判别分析法的分类器模型,跳过HEVC编码模块中复杂的逐层率失真优化过程,从而快速确定CU划分模式,达到降低计算复杂度的目的。The basic idea of the present invention is to use the side information generated by the DVC decoder as the predicted frame of the current HEVC encoded frame (that is, the reconstructed frame of the WZ frame), and the residual of the two is the predicted residual, combined with the texture complexity of the frame and The average motion vector, the relationship between the three features and the CU partition mode is obtained through machine learning, and a classifier model based on Fisher discriminant analysis is established, skipping the complex layer-by-layer rate-distortion optimization process in the HEVC encoding module , so as to quickly determine the CU partition mode and achieve the purpose of reducing the computational complexity.
一般而言,图像序列的纹理复杂度及运动剧烈程度与HEVC编码中CU的划分模式有很大的相关性。在HEVC编码过程中,纹理较复杂区域及运动较明显区域的编码单元划分较小,纹理较简单区域及运动较平缓区域的编码单元划分较大。此外,由于视频序列中相邻帧的图像之间存在非常高的相关性,在相邻已解码帧的基础上通过运动估计和运动补偿可得到当前待解码帧的预测图像,预测残差即为预测图像与当前待编码帧的差值。对于视频图像中静止区域或运动平缓区域,其预测残差系数较小且波动范围很小,而对于运动剧烈区域,其预测残差系数则往往波动较大。通常情况下,预测残差值的大小和分布与HEVC编码中CU的划分模式也有较大的相关性。预测残差小的图像区域CU分块较大,预测残差较大且分布不均匀的CU分块较小。因此,可以将DVC的解码过程中可获得的预测残差与纹理复杂度、运动矢量三种特征综合起来,作为CU划分模式的依据。Generally speaking, the texture complexity and motion intensity of the image sequence have a great correlation with the CU partition mode in HEVC coding. In the HEVC encoding process, the division of coding units in areas with more complex textures and areas with more obvious motion is smaller, while the division of coding units in areas with simpler textures and smoother motions is larger. In addition, due to the very high correlation between the images of adjacent frames in the video sequence, the predicted image of the current frame to be decoded can be obtained through motion estimation and motion compensation on the basis of adjacent decoded frames, and the prediction residual is The difference between the predicted image and the current frame to be encoded. For the static area or the gentle motion area in the video image, the prediction residual coefficient is small and the fluctuation range is small, while for the severe motion area, the prediction residual coefficient tends to fluctuate greatly. In general, the size and distribution of prediction residual values are also highly correlated with the CU partition mode in HEVC coding. The CU blocks in the image area with small prediction residuals are larger, and the CU blocks with large prediction residuals and uneven distribution are smaller. Therefore, the prediction residual obtained during the DVC decoding process can be combined with the three characteristics of texture complexity and motion vector as the basis for the CU partition mode.
预测残差图像的分布情况可以用残差数据的差异度(即离散度)来衡量,本发明选择数据的标准差作为离散度的度量。此外,本发明选择数据的绝对误差和作为图像纹理复杂度的度量,采用数据的方差作为运动矢量信息的度量。The distribution of the predicted residual image can be measured by the degree of difference (that is, the degree of dispersion) of the residual data, and the present invention selects the standard deviation of the data as the measure of the degree of dispersion. In addition, the present invention selects the absolute error sum of the data as the measure of the image texture complexity, and uses the variance of the data as the measure of the motion vector information.
Fisher判别分析法是模式识别中的经典算法,其基本思想是将高维的模式样本投影到最佳鉴别矢量空间,投影后的模式样本在新的子空间中有最大的类间距离和最小的类内距离,达到抽取分类信息和压缩特征维数的作用。本发明中将残差信息离散度、平均运动矢量和纹理复杂度作为判别法的三维输入模式样本,采用机器学习的方法对其进行分类训练得到最佳分类模型函数,HEVC编码时只需输入三维特征信息即可得到CU划分模式的标签,确定该 CU是否继续往下划分。Fisher discriminant analysis is a classic algorithm in pattern recognition. Its basic idea is to project high-dimensional pattern samples into the best discriminant vector space, and the projected pattern samples have the largest inter-class distance and the smallest class distance in the new subspace. The intra-class distance achieves the function of extracting classification information and compressing the feature dimension. In the present invention, the residual information dispersion, average motion vector and texture complexity are used as the three-dimensional input pattern samples of the discriminant method, and the method of machine learning is used to classify and train it to obtain the best classification model function. When HEVC encoding, only three-dimensional The feature information can be used to obtain the label of the CU partition mode, and determine whether the CU continues to be partitioned.
在DVC-HEVC转码器设计中,提高转码实时性的关键步骤是如何高效利用DVC解码过程中产生的相关信息加速HEVC编码过程。本发明利用DVC解码过程中可获得的残差信息离散度、平均运动矢量以及纹理复杂度作为CU划分模型的参数,设置时间周期T,在一个T内通过对前k帧WZ帧的参数采用Fisher判别分析法得到最佳分类器模型,周期内剩余WZ 帧则利用该分类模型进行快速CU分块。为保证分类模型的准确率,设置了一种在线学习策略,可及时更新训练集的阈值和权值向量。通过本发明的方法,可以跳过HEVC编码模块中计算复杂度较高的逐层率失真优化过程,从而达到降低HEVC编码复杂度的目的。In the design of DVC-HEVC transcoder, the key step to improve the real-time transcoding is how to efficiently use the relevant information generated in the DVC decoding process to accelerate the HEVC encoding process. The present invention utilizes the residual information dispersion degree, average motion vector and texture complexity available in the DVC decoding process as the parameters of the CU partition model, sets the time period T, and adopts Fisher to the parameters of the previous k frames and WZ frames within one T. The discriminant analysis method obtains the best classifier model, and the remaining WZ frames in the cycle use the classification model for fast CU partitioning. In order to ensure the accuracy of the classification model, an online learning strategy is set up, which can update the threshold and weight vector of the training set in time. Through the method of the present invention, the layer-by-layer rate-distortion optimization process with high computational complexity in the HEVC encoding module can be skipped, thereby achieving the purpose of reducing HEVC encoding complexity.
具体主要包括以下过程步骤:Specifically, it mainly includes the following process steps:
(1)设置时间周期T,将DVC码流解码后的重建帧序列划分为T1、T2、T3……,周期内视频帧分为训练帧、验证帧以及测试帧,其中验证帧固定选取为一帧;(1) Set the time period T, divide the reconstructed frame sequence after decoding the DVC code stream into T 1 , T 2 , T 3 ..., the video frame in the period is divided into training frame, verification frame and test frame, wherein the verification frame is fixed Select as a frame;
(2)对训练帧进行HEVC编码,同时提取其特征,获取残差信息离散度、平均运动矢量以及纹理复杂度,分别建立64×64、32×32、16×16训练集,得到各个尺寸的CU划分模型;(2) Perform HEVC encoding on the training frame, extract its features at the same time, obtain the residual information dispersion, average motion vector and texture complexity, respectively establish 64×64, 32×32, 16×16 training sets, and obtain the CU partition model;
(3)验证帧仍进行HEVC编码,同时利用(2)中模型进行快速CU模式划分,计算初始准确率ACCinit;(3) Verify that the frame is still HEVC encoded, and use the model in (2) to perform fast CU mode division, and calculate the initial accuracy rate ACC init ;
(4)测试帧序列利用(2)中分类模型进行快速CU模式划分;(4) The test frame sequence uses the classification model in (2) to perform fast CU mode division;
(5)T2、T3……的步骤同T1类似,同时计算加权平均准确率ACCweight,对模型的分类向量和阈值进行更新。(5) The steps of T 2 , T 3 ... are similar to T 1 , and the weighted average accuracy rate ACC weight is calculated at the same time, and the classification vector and threshold of the model are updated.
本发明改进的是整个DVC-HEVC转码器中HEVC视频编码中计算复杂度最高的地方。在整个HEVC视频编码过程中,CTU的递归划分占计算复杂度的90%以上,本发明最关键的步骤是根据残差信息离散度、平均运动矢量以及纹理复杂度三种特征值,建立最佳分类器模型,使得测试帧快速进行CU划分模式的选择。因此,在计算复杂度方面,本发明方法着眼于DVC-HEVC转码器中HEVC视频编码中计算复杂度改进的最关键之处。What the present invention improves is the part with the highest computational complexity in HEVC video coding in the entire DVC-HEVC transcoder. In the entire HEVC video encoding process, the recursive division of CTU accounts for more than 90% of the computational complexity. The most critical step of the present invention is to establish the optimal The classifier model enables the test frame to quickly select the CU partition mode. Therefore, in terms of computational complexity, the method of the present invention focuses on the most critical point of improving computational complexity in HEVC video coding in a DVC-HEVC transcoder.
附图说明Description of drawings
图1为本发明基于Fisher判别分析法的DVC-HEVC视频转码方法系统框图;Fig. 1 is a system block diagram of the DVC-HEVC video transcoding method based on the Fisher discriminant analysis method of the present invention;
图2为本发明中DVC解码重建帧在编码前进行周期划分以及周期内帧划分示意图;Fig. 2 is a schematic diagram of periodic division and intra-period frame division of DVC decoded and reconstructed frames before encoding in the present invention;
图3为本发明基于Fisher判别分析法的DVC-HEVC视频转码方法的流程图;Fig. 3 is the flowchart of the DVC-HEVC video transcoding method based on the Fisher discriminant analysis method of the present invention;
图4~7为本发明方法与传统级联算法的率失真曲线图,其中,图4为BasketballDrill的率失真曲线;图5为BQMall的率失真曲线;图6为Johnny的率失真曲线;图7为FourPeople 的率失真曲线。Figures 4 to 7 are the rate-distortion curves of the method of the present invention and the traditional cascade algorithm, wherein Figure 4 is the rate-distortion curve of BasketballDrill; Figure 5 is the rate-distortion curve of BQMall; Figure 6 is the rate-distortion curve of Johnny; Figure 7 is the rate-distortion curve of FourPeople.
具体实施方式Detailed ways
下面结合附图及实施例对本发明作进一步的详细说明,有必要指出的是,以下的实施例只用于对本发明做进一步的说明,不能理解为对本发明保护范围的限制,所属领域技术熟悉人员根据上述发明内容,对本发明做出一些非本质的改进和调整进行具体实施,应仍属于本发明的保护范围。Below in conjunction with accompanying drawing and embodiment the present invention is described in further detail, it is necessary to point out that following embodiment is only used for further description of the present invention, can not be interpreted as the restriction to protection scope of the present invention, those skilled in the art According to the content of the invention above, making some non-essential improvements and adjustments to the present invention for specific implementation shall still belong to the protection scope of the present invention.
图3中,基于Fisher判别分析法的DVC-HEVC视频转码方法,包括以下步骤:In Fig. 3, the DVC-HEVC video transcoding method based on Fisher's discriminant analysis method comprises the following steps:
(1)对DVC码流进行解码,得到重建帧。设置时间周期T,将重建的视频序列划分为T1、T2、T3……;(1) Decode the DVC code stream to obtain the reconstructed frame. Set the time period T, divide the reconstructed video sequence into T 1 , T 2 , T 3 ......;
(2)在T1内对非关键帧进行帧分组,分为训练帧、验证帧以及测试帧,其中验证帧固定选取为一帧;(2) Carry out frame grouping to non - key frame in T1, be divided into training frame, verification frame and test frame, wherein the verification frame is fixedly selected as a frame;
(3)对训练帧进行HEVC编码,同时提取其特征,获取残差信息离散度、平均运动矢量以及纹理复杂度,分别建立64×64、32×32、16×16训练集,得到最佳分类向量w(1)64、w(1)32、 w(1)16及其阈值Th(1)64、Th(1)32、Th(1)16;(3) Perform HEVC encoding on the training frame, extract its features at the same time, obtain the residual information dispersion, average motion vector and texture complexity, respectively establish 64×64, 32×32, 16×16 training sets, and obtain the best classification Vectors w(1) 64 , w(1) 32 , w(1) 16 and their thresholds Th(1) 64 , Th(1) 32 , Th(1) 16 ;
(4)验证帧仍进行HEVC编码,同时利用(3)中模型进行快速CU模式划分,计算初始准确率ACCinit;(4) Verify that the frame is still HEVC encoded, and use the model in (3) to perform fast CU mode division, and calculate the initial accuracy rate ACC init ;
(5)将测试帧图像划分成LCU(64×64),进行特征提取,建立测试集,作为分类模型的输入。首先与最佳分类向量w(1)64相乘,其结果与阈值Th(1)64进行比较,若大于阈值,则进入步骤(6),否则,进入步骤(9);(5) Divide the test frame image into LCUs (64×64), perform feature extraction, and establish a test set as the input of the classification model. First multiplied with the optimal classification vector w (1) 64 , the result is compared with the threshold Th (1) 64 , if greater than the threshold, then enter step (6), otherwise, enter step (9);
(6)将LCU继续划分成32×32的块,提取其特征,与最佳分类向量w(1)32相乘后与阈值Th(1)32进行比较,若大于阈值,则进入步骤(7),否则,进入步骤(9);(6) Continue to divide the LCU into 32×32 blocks, extract its features, multiply it with the best classification vector w(1) 32 and compare it with the threshold Th(1) 32 , if it is greater than the threshold, then enter step (7 ), otherwise, go to step (9);
(7)继续划分成16×16的块,提取其特征,与最佳分类向量w(1)16相乘后与阈值Th(1)16进行比较,若大于阈值,则进入步骤(8),否则,进入步骤(9);(7) Continue to be divided into 16×16 blocks, extract its features, multiply it with the best classification vector w(1) 16 and then compare it with the threshold Th(1) 16 , if it is greater than the threshold, then enter step (8), Otherwise, go to step (9);
(8)继续划分成8×8的CU块;(8) Continue to divide into 8×8 CU blocks;
(9)将当前CU尺寸确定为最终CU分块模式,CU划分完成并进入PU模式的选择,继续下一步编码过程。(9) Determine the current CU size as the final CU block mode, complete the CU division and enter the selection of the PU mode, and continue the next step of the encoding process.
(10)在T2内对非关键帧进行帧分组,同样分为训练帧、验证帧以及测试帧,其中训练帧帧数可较T1适当减少;(10) Carry out frame grouping to non-key frame in T 2 , be divided into training frame, verification frame and test frame equally, wherein training frame frame number can be reduced appropriately compared with T 1 ;
(11)对训练帧进行HEVC编码,同时提取其特征,获取残差信息离散度、运动矢量方差以及纹理复杂度均值,建立64×64、32×32、16×16训练集,得到新的最佳分类向量w(2)64、w(2)32、w(2)16及其阈值Th(2)64、Th(2)32、Th(2)16。此外,利用T1中的模型计算训练帧中每一帧的准确率,得到一个加权平均准确率ACCweight;(11) Perform HEVC encoding on the training frame, extract its features at the same time, obtain the dispersion of residual information, the variance of the motion vector and the mean value of the texture complexity, establish 64×64, 32×32, 16×16 training sets, and obtain the new most The optimal classification vectors w(2) 64 , w(2) 32 , w(2) 16 and their thresholds Th(2) 64 , Th(2) 32 , Th(2) 16 . In addition, using the model in T1 to calculate the accuracy rate of each frame in the training frame, a weighted average accuracy rate ACC weight is obtained;
(12)验证帧仍进行HEVC编码,同时利用新训练的模型进行快速CU模式划分,计算其准确率ACCupdate;(12) Verify that the frame is still HEVC encoded, and at the same time use the newly trained model to perform fast CU mode division, and calculate its accuracy rate ACC update ;
(13)若ACCupdate大于ACCweight,则对后续的测试帧采用新训练的分类模型向量及阈值,否则仍采用上一周期内的分类模型向量及阈值;(13) If the ACC update is greater than the ACC weight , the newly trained classification model vector and threshold are used for subsequent test frames, otherwise the classification model vector and threshold in the previous cycle are still used;
(14)测试帧采用同T1周期类似的划分步骤;(14) The test frame adopts a division step similar to the T1 period;
(15)T3、T4……的步骤同T2一致。(15) The steps of T 3 , T 4 ... are the same as those of T 2 .
具体地,所述步骤(1)中,结合转码流程可知,当前HEVC编码帧是经过DVC解码后WZ帧的重建帧。由于在DVC编码时对关键帧帧采用的HEVC编码,因此,在转码时不需要再对关键帧进行编码。Specifically, in the step (1), combined with the transcoding process, it can be known that the current HEVC coded frame is a reconstructed frame of the WZ frame after DVC decoding. Since HEVC encoding is adopted for key frames during DVC encoding, there is no need to encode key frames during transcoding.
所述步骤(3)中,本文利用当前CU编码单元的各个像素值与该编码单元像素均值的绝对误差和来表示该块的纹理复杂度,具体如式(1)所示。其中,W为当前CU块的宽度,P(x,y) 表示坐标位置(x,y)处的像素值。此外,为了降低纹理计算复杂度,本文对尺寸大小为64×64 及32×32的CU编码单元进行水平和垂直方向的1/2下采样处理。In the step (3), the sum of absolute errors between each pixel value of the current CU coding unit and the pixel mean value of the coding unit is used to represent the texture complexity of the block, specifically as shown in formula (1). Wherein, W is the width of the current CU block, and P(x, y) represents the pixel value at the coordinate position (x, y). In addition, in order to reduce the complexity of texture calculation, this paper performs horizontal and vertical 1/2 downsampling processing on CU coding units with sizes of 64×64 and 32×32.
本文利用当前CU编码单元中所有8×8大小块的运动矢量的平均值来衡量图像块的运动剧烈程度,如式(2)所示。其中,N代表当前CU编码单元所包含8×8不重叠块的个数,MVi(x)与MVi(y)分别代表第i个8×8块运动矢量的水平和垂直方向上的运动偏移大小In this paper, the average value of the motion vectors of all 8×8 blocks in the current CU coding unit is used to measure the motion intensity of the image block, as shown in formula (2). Among them, N represents the number of 8×8 non-overlapping blocks contained in the current CU coding unit, and MV i (x) and MV i (y) represent the horizontal and vertical motions of the ith 8×8 block motion vector respectively offset size
本文利用DVC解码获得的边信息及其重建帧的差值来代替当前帧HEVC编码对应的预测残差,如式(3)所示,通过当前CU中每个像素值与像素均值计算每个CU编码单元的离散度,如式(4)所示。其中,R(x,y)代表当前解码重建帧的像素值,SI(x,y)代表当前帧在解码过程中生成的边信息,Pres(x,y)则代表边信息残差值大小,n代表的是当前CU编码单元中像素个数,p代表的是所有像素的平均值。In this paper, the side information obtained by DVC decoding and the difference between its reconstructed frame are used to replace the prediction residual corresponding to HEVC encoding of the current frame. The degree of dispersion of coding units is shown in formula (4). Among them, R(x,y) represents the pixel value of the currently decoded and reconstructed frame, SI(x,y) represents the side information generated during the decoding process of the current frame, and P res (x,y) represents the residual value of the side information , n represents the number of pixels in the current CU coding unit, and p represents the average value of all pixels.
Pres(x,y)=R(x,y)-SI(x,y) (3)P res (x,y)=R(x,y)-SI(x,y) (3)
本文选择了基于Fisher判别分析式学习模型,分界阈值Th通过式(5)获得,其中代表的是各类样本经过向量w投影后的均值,当Th确定后,对于测试集的CU单元划分,利用式(6)进行决策,其中Sgn函数为符号函数,当f(X)等于-1时代表“划分”,反之则代表“不划分”。This paper chooses a learning model based on Fisher's discriminant analysis, and the cut-off threshold Th is obtained by formula (5), where Represents the mean value of various samples after vector w projection. When Th is determined, for the CU unit division of the test set, use formula (6) to make a decision, where the Sgn function is a symbolic function. When f(X) is equal to -1 When it means "divided", otherwise it means "not divided".
f(X)=Sgn(wTX-T) (6)f(X)=Sgn(w T XT) (6)
所述步骤(4)中,初始准确率ACCinit计算公式如式(7)所示,其中,CountCU表示转码后不同尺寸(包括“64×64”、“32×32”、“16×16”、“8×8”)CU的个数,CountCUHEVC表示采用级联转码算法后不同CU的个数,CountCU本文表示采用本文所提快速转码算法后不同CU 的个数。In the step (4), the calculation formula of the initial accuracy rate ACC init is shown in formula (7), wherein, CountCU represents different sizes after transcoding (including "64×64", "32×32", "16×16 ", "8×8") the number of CUs, CountCU HEVC indicates the number of different CUs after adopting the cascade transcoding algorithm, and CountCU in this paper indicates the number of different CUs after adopting the fast transcoding algorithm proposed in this paper.
所述步骤(11)中,加权平均准确率ACCweight可通过式(8)计算得到。In the step (11), the weighted average accuracy rate ACC weight can be calculated by formula (8).
为证明本发明算法的有效性,我们对其进行了实验验证,其结果如图4~7所示。图4~7 为本发明的基于Fisher判别分析法的DVC-HEVC视频转码方法与传统的级联转码算法的率失真曲线对比结果,比较的具体过程如下:In order to prove the effectiveness of the algorithm of the present invention, we have carried out experimental verification on it, and the results are shown in Figures 4-7. Figures 4 to 7 are the comparison results of the rate-distortion curves between the DVC-HEVC video transcoding method based on the Fisher discriminant analysis method of the present invention and the traditional cascade transcoding algorithm. The specific process of the comparison is as follows:
(1)对视频序列进行DVC编解码,视频序列选择标准的HEVC测试视频,它们的名称、分辨率分别为:BasketballDrill(832×480)、FourPeople(1280×720),帧率为30帧/ 秒。其中,量化步长(QP)值分别取22、26、30、34。(1) DVC codec is performed on the video sequence, and the standard HEVC test video is selected for the video sequence. Their names and resolutions are: BasketballDrill (832×480), FourPeople (1280×720), and the frame rate is 30 frames per second . Wherein, the quantization step size (QP) values are 22, 26, 30, 34 respectively.
(2)同时打开两个方法的程序并设置好相同的配置文件,HEVC编码参数版本选择HM16.5,量化步长(QP)值分别取22、26、30、34。本发明将与传统的级联转码算法进行比较。本文选择三种视频编码性能:峰值信噪比(PSNR)、比特率以及编码时间(其中PSNR 体现视频的客观视频质量,视频编码时间体现编码的计算复杂度)进行比较分析,为了更加直观表现本文所提转码加速情况,采用以下三个指标对性能差距进行评价:(2) Open the programs of the two methods at the same time and set the same configuration file. The HEVC encoding parameter version is selected as HM16.5, and the quantization step (QP) value is 22, 26, 30, and 34 respectively. The present invention will be compared with traditional cascaded transcoding algorithms. This paper selects three kinds of video coding performance: peak signal-to-noise ratio (PSNR), bit rate and coding time (PSNR reflects the objective video quality of the video, and video coding time reflects the computational complexity of coding) for comparative analysis. In order to express this paper more intuitively For the proposed transcoding acceleration, the following three indicators are used to evaluate the performance gap:
ΔPSNR=PSNR本文-PSNR级联 ΔPSNR = PSNR paper - PSNR cascade
其中,ΔPSNR表示本发明的方法与传统级联转码算法峰值信噪比的差值,ΔBR表示本发明的方法与传统级联转码算法比特率差值的百分率,ΔT表示本发明的方法与传统级联转码算法时间差值的百分率。Among them, ΔPSNR represents the difference between the peak signal-to-noise ratio of the method of the present invention and the traditional cascaded transcoding algorithm, ΔBR represents the percentage of the bit rate difference between the method of the present invention and the traditional cascaded transcoding algorithm, and ΔT represents the difference between the method of the present invention and the traditional cascaded transcoding algorithm. The percentage of the time difference of the traditional cascade transcoding algorithm.
(3)输入2个相同的步骤1中得到的DVC重建视频序列;(3) import the DVC reconstructed video sequence that obtains in 2 identical steps 1;
(4)分别对其进行视频编码;(4) carry out video encoding to it respectively;
(5)利用传统级联转码算法对视频序列在HEVC方式下进行视频编码;(5) Using the traditional cascade transcoding algorithm to encode the video sequence in the HEVC mode;
(6)利用本发明方法对视频序列在HEVC方式下进行视频编码;(6) Utilize the method of the present invention to carry out video coding to video sequence under HEVC mode;
(7)两个程序分别输出视频编码后的视频序列以及各自的比特率、PSNR值以及总的视频编码时间,上述三个指标的结果如表1-3所示,统计显示本发明方法与传统级联转码方法在比特率方面变化了0.6%~3.8%,在PSNR方面降低了0.01dB~0.16dB,在编码计算复杂度方面降低了41.4%~68.9%。实验验证结果表明,本发明方法与传统的级联转码算法相比,在视频压缩率(由比特率下降程度来体现)和视频质量(由PSNR值得下降程度来体现)损失很小的前提下,较大程度地降低了视频编码的计算复杂度(由编码时间下降程度来体现,如表1~3所示)。(7) two programs respectively output video sequences after video encoding and respective bit rate, PSNR value and total video encoding time, the results of the above three indicators are as shown in table 1-3, statistics show that the method of the present invention and traditional The cascade transcoding method changes the bit rate by 0.6%-3.8%, reduces the PSNR by 0.01dB-0.16dB, and reduces the encoding calculation complexity by 41.4%-68.9%. Experimental verification results show that compared with the traditional cascade transcoding algorithm, the method of the present invention has very little loss in video compression rate (reflected by the degree of bit rate decrease) and video quality (reflected by the degree of decrease in PSNR value) , which greatly reduces the computational complexity of video coding (reflected by the degree of reduction in coding time, as shown in Tables 1-3).
表1本发明算法与传统级联转码算法比特率的比较Table 1 Comparison of the bit rate between the algorithm of the present invention and the traditional cascade transcoding algorithm
表2本发明算法与传统级联转码算法算法之间PSNR值的比较Table 2 Comparison of PSNR values between the algorithm of the present invention and the traditional cascade transcoding algorithm
表3本发明算法与传统级联转码算法之间视频编码时间的比较Table 3 Comparison of video encoding time between the algorithm of the present invention and the traditional cascade transcoding algorithm
。 .
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810573870.6A CN108769696A (en) | 2018-06-06 | 2018-06-06 | A kind of DVC-HEVC video transcoding methods based on Fisher discriminates |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810573870.6A CN108769696A (en) | 2018-06-06 | 2018-06-06 | A kind of DVC-HEVC video transcoding methods based on Fisher discriminates |
Publications (1)
Publication Number | Publication Date |
---|---|
CN108769696A true CN108769696A (en) | 2018-11-06 |
Family
ID=63999058
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810573870.6A Pending CN108769696A (en) | 2018-06-06 | 2018-06-06 | A kind of DVC-HEVC video transcoding methods based on Fisher discriminates |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108769696A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109274965A (en) * | 2018-11-27 | 2019-01-25 | 广东工业大学 | Fast prediction mode selection method based on statistical properties of pixel values in HEVC |
CN109743575A (en) * | 2018-12-05 | 2019-05-10 | 四川大学 | A DVC-HEVC Video Transcoding Method Based on Naive Bayes |
CN110650342A (en) * | 2019-08-31 | 2020-01-03 | 电子科技大学 | Quick coding method based on multi-feature analysis of coding unit |
CN113347415A (en) * | 2020-03-02 | 2021-09-03 | 阿里巴巴集团控股有限公司 | Coding mode determining method and device |
WO2024012263A1 (en) * | 2022-07-14 | 2024-01-18 | 广州市百果园信息技术有限公司 | Video coding processing method, apparatus and device, and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150063469A1 (en) * | 2013-08-30 | 2015-03-05 | Arris Enterprises, Inc. | Multipass encoder with heterogeneous codecs |
CN105430407A (en) * | 2015-12-03 | 2016-03-23 | 同济大学 | A fast inter-mode decision method for transcoding from H.264 to HEVC |
CN105681797A (en) * | 2016-01-12 | 2016-06-15 | 四川大学 | Prediction residual based DVC-HEVC (Distributed Video Coding-High Efficiency Video Coding) video transcoding method |
CN107018412A (en) * | 2017-04-20 | 2017-08-04 | 四川大学 | A kind of DVC HEVC video transcoding methods based on key frame coding unit partition mode |
CN107734331A (en) * | 2017-11-17 | 2018-02-23 | 南京理工大学 | A kind of video transcoding method based on HEVC standard |
-
2018
- 2018-06-06 CN CN201810573870.6A patent/CN108769696A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150063469A1 (en) * | 2013-08-30 | 2015-03-05 | Arris Enterprises, Inc. | Multipass encoder with heterogeneous codecs |
CN105430407A (en) * | 2015-12-03 | 2016-03-23 | 同济大学 | A fast inter-mode decision method for transcoding from H.264 to HEVC |
CN105681797A (en) * | 2016-01-12 | 2016-06-15 | 四川大学 | Prediction residual based DVC-HEVC (Distributed Video Coding-High Efficiency Video Coding) video transcoding method |
CN107018412A (en) * | 2017-04-20 | 2017-08-04 | 四川大学 | A kind of DVC HEVC video transcoding methods based on key frame coding unit partition mode |
CN107734331A (en) * | 2017-11-17 | 2018-02-23 | 南京理工大学 | A kind of video transcoding method based on HEVC standard |
Non-Patent Citations (1)
Title |
---|
DONGDONG ZHANG等: "Fast CU partition for H.264 AVC to HEVC transcoding based on fisher discriminant analysis", 《2016 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109274965A (en) * | 2018-11-27 | 2019-01-25 | 广东工业大学 | Fast prediction mode selection method based on statistical properties of pixel values in HEVC |
CN109274965B (en) * | 2018-11-27 | 2021-07-20 | 广东工业大学 | A Fast Prediction Mode Selection Method Based on Pixel Value Statistical Characteristics in HEVC |
CN109743575A (en) * | 2018-12-05 | 2019-05-10 | 四川大学 | A DVC-HEVC Video Transcoding Method Based on Naive Bayes |
CN110650342A (en) * | 2019-08-31 | 2020-01-03 | 电子科技大学 | Quick coding method based on multi-feature analysis of coding unit |
CN113347415A (en) * | 2020-03-02 | 2021-09-03 | 阿里巴巴集团控股有限公司 | Coding mode determining method and device |
WO2024012263A1 (en) * | 2022-07-14 | 2024-01-18 | 广州市百果园信息技术有限公司 | Video coding processing method, apparatus and device, and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110087087B (en) | VVC inter-frame coding unit prediction mode early decision and block division early termination method | |
Liu et al. | Neural video coding using multiscale motion compensation and spatiotemporal context model | |
CN114501010B (en) | Image encoding method, image decoding method and related devices | |
CN108769696A (en) | A kind of DVC-HEVC video transcoding methods based on Fisher discriminates | |
CN105049850B (en) | HEVC bit rate control methods based on area-of-interest | |
Gao et al. | Recent standard development activities on video coding for machines | |
Pessoa et al. | End-to-end learning of video compression using spatio-temporal autoencoders | |
CN108989802B (en) | A method and system for quality estimation of HEVC video stream using inter-frame relationship | |
CN105681797B (en) | A kind of DVC-HEVC video transcoding methods based on prediction residual | |
CN106060539B (en) | A kind of method for video coding of low transmission bandwidth | |
CN102137263A (en) | Distributed video coding and decoding methods based on classification of key frames of correlation noise model (CNM) | |
CN111479110B (en) | Fast Affine Motion Estimation Method for H.266/VVC | |
WO2021129007A1 (en) | Method and device for determining video bitrate, computer apparatus, and storage medium | |
CN107018412B (en) | A kind of DVC-HEVC video transcoding method based on key frame coding unit partition mode | |
CN106791828A (en) | High performance video code-transferring method and its transcoder based on machine learning | |
CN116916036A (en) | Video compression method, device and system | |
CN107343202B (en) | Feedback-free distributed video encoding and decoding method based on additional code rate | |
CN105611301A (en) | Distributed video coding and decoding method based on wavelet domain residual errors | |
CN108737839A (en) | A kind of Lagrange factor computational methods controlled towards code check in frame | |
US20240291995A1 (en) | Video processing method and related apparatus | |
US11924449B2 (en) | Multivariate rate control for transcoding video content | |
CN116437089B (en) | A Deep Video Compression Method Based on Key Object | |
CN108769689A (en) | skip macro block judging method, video transcoding method, electronic equipment and storage medium | |
Amirpour et al. | A real-time video quality metric for HTTP adaptive streaming | |
CN100586185C (en) | A Mode Selection Method for H.264 Video Reduced Resolution Transcoding |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20181106 |