CN111447438B - Rapid intra-frame prediction mode judgment method for universal video coding - Google Patents
Rapid intra-frame prediction mode judgment method for universal video coding Download PDFInfo
- Publication number
- CN111447438B CN111447438B CN202010331567.2A CN202010331567A CN111447438B CN 111447438 B CN111447438 B CN 111447438B CN 202010331567 A CN202010331567 A CN 202010331567A CN 111447438 B CN111447438 B CN 111447438B
- Authority
- CN
- China
- Prior art keywords
- coding
- intra
- mode
- distortion
- prediction mode
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
The invention discloses a fast mode judgment method for universal video coding, which utilizes a rough mode judgment method (RMD) to acquire context information; predicting the coding rate R from a mathematical perspective based on the probability distribution of the residuals e The method comprises the steps of carrying out a first treatment on the surface of the Predicting coding distortion D by combining context information in the pixel domain e The method comprises the steps of carrying out a first treatment on the surface of the According to the rate distortion cost J e An intra-prediction mode is determined from the candidate mode set. Experimental results show that the method for judging the fast intra-frame mode of the universal video coding can reduce the coding complexity by about 29% under the condition of less influence on the coding efficiency.
Description
Technical Field
The invention belongs to the technical field of video coding, and particularly relates to a fast intra-frame prediction mode judgment method for universal video coding.
Background
With the rapid development of network technology, the video coding technology is mature and perfect, the transmission technology is continuously improved, and the video service is widely applied. How to better realize efficient video coding is always a research hotspot in the video field. The new generation video coding standard VVC (general video coding) aims at the point of 'large data volume of high definition/ultra high definition video', and by means of advanced coding tools, abundant video contents are effectively represented by a more flexible means, so that a large amount of storage space and transmission bandwidth are saved, and the capability of coping with new service challenges is provided.
To further exploit spatial correlation to remove redundancy of video content, VVC introduces 67 intra prediction modes, including multi-angle mode, planar mode, and DC mode. An optimal mode is selected from a plurality of modes to maximize intra coding efficiency of VVC by using intra prediction mode decision according to texture characteristics of video. The method is realized mainly by means of Lagrange optimization technology, each mode obtains coding code rate and coding distortion through the steps of prediction, transformation, quantization, entropy coding, inverse quantization, inverse transformation, pixel reconstruction and the like, rate distortion cost is calculated, and the optimal prediction mode is determined by comparing the rate distortion cost.
In VVC, the intra prediction mode decision algorithm is divided into two parts, coarse mode decision (RMD) and Rate Distortion Optimization (RDO). The sum of absolute errors after Hadamard transformation is firstly used as D in the RMD module s Calculating a rate distortion cost J of an intra-prediction mode for a distortion measure s :
J s =D s +λ s ·R s , (1)
Wherein lambda is s R is Lagrangian factor s Representing the number of bits consumed to encode the intra prediction mode. According to the result of equation (1), M (m=6) most likely to be the optimal intra prediction modes are selected to constitute the most probable mode set. Then each intra prediction mode in the most probable mode set is encoded in RDO, and the rate-distortion cost J is selected e The smallest mode is taken as the optimal intra prediction mode:
J e =D e +λ e ·R e . (2)
parameter lambda e Represents the Lagrangian factor, R e D is the sum of the number of coded bits of the intra prediction mode and the number of coded bits of the prediction residual e Is the sum of the mean square errors.
Although multiple modes can well capture video information in any direction in a frame, overload operand and long processing time delay are brought to an encoder, which is extremely unfavorable for the expansion of the multimedia service based on the VVC. Jamali et al use transform domain residuals to predict rate distortion costs, excluding prediction modes that cannot be optimal to reduce coding complexity (see Jamali M, COULOMBE s.fast HEVC intra mode decision based on RDO cost prediction [ J ]. IEEE Transactions on Broadcasting, 2018:1-14); tariq predicts the rate-distortion cost of the current block according to the dynamic change of the rate-distortion cost of the adjacent block, and determines the optimal prediction mode by minimizing the rate-distortion cost (see Tariq j.rd-cost as statistical inference for early intra mode decision in HEVC [ J ]. Multimedia Tools and Applications,2019,78 (12): 16783-16801); in addition, by means of intelligent optimization algorithm, a new objective function of new optimal prediction mode decision can be established by analyzing and researching the characteristics of intra-frame prediction mode decision (see TARIQ J, ARMGHAN A, IJAZ A, ASHRAF I.Pure intra mode decision in HEVC using optimized firefly algorithm [ J ]. Journal of Visual Communication and Image Representation,2020, vol.68). The method is mainly used for optimizing 35 kinds of intra-frame prediction mode decisions in High Efficiency Video Coding (HEVC), and has low accuracy and coding efficiency so as to be unsuitable for being applied to the latest video coding VVC. It is therefore necessary to establish a fast intra mode decision method for a suitable VVC.
Disclosure of Invention
The invention aims to provide a fast intra-frame prediction mode judging method for general video coding, which has lower coding complexity and higher coding efficiency.
The technical scheme adopted by the invention is that a fast intra-frame prediction mode judgment method for universal video coding specifically comprises the following steps:
step 1, first, M (m=6) candidate modes most likely to be the optimal intra prediction mode are selected from among the intra prediction modes in 67 using RMD method. And secondly, respectively calculating the coding code rate and the coding distortion of the M candidate modes according to the step 2 and the step 3, obtaining the total cost of each candidate mode, and selecting the prediction mode with the minimum total cost from the total cost as the optimal intra-frame prediction mode of the coding block.
Step 2, the residual error of the transform domain can be described as zero-mean gaussian distribution, and the sum of mean square errors is used as a distortion measure, and the quantized theoretical rate distortion model is as follows:
where x represents the residual error, δ is the standard deviation of the residual error, and e is a natural constant. At high bit rate or low distortion, the coding rate R and log 2 δ Proportional, and thus residual, coding rate is aboutThe method comprises the following steps:
R r ≈α·log 2 σ , (4)
wherein α is 0.5. The number of coded bits for the intra prediction mode can be obtained by the RMD module:
R m =R s . (5)
the total number of coded bits R e :
R e ≈R m +R r .(6)
In step 3, since VVC coding uses scalar quantization technique, the pixel cannot be restored to the original value after being de-coded, resulting in distortion. According to the intra coding principle, the pixel distortion d at the (i, j) position can be expressed as the difference between the original pixel value p and the reconstructed pixel value p':
wherein p is p Is the pixel predicted value obtained by adopting a certain intra-frame prediction mode, p e Representing the difference, p, between the original pixel value and the predicted pixel value e ' represents the difference between the reconstructed pixel value and the predicted pixel value. Considering that the current pixel has a strong correlation with the reference pixel and all use the same quantization parameter, the coding distortion of the current pixel can be determined by means of the coding distortion d (r) i ,r j ) And (3) predicting:
total distortion D of final encoded block e :
The present invention is also characterized in that,
the method for calculating the parameter delta in the formula (9) in the step 2 is as follows:
for residual block coefficients r with the size of N multiplied by N, a transformation matrix T is obtained after DCT transformation:
where A is the DCT matrix, then:
wherein [] u,u The coefficient values in the matrix at the (u, u) position are represented, and the symbol R' is defined as:
the parameter ρ is used to measure the correlation between pixel values in the horizontal direction and the vertical direction, and its value is set to 0.6; parameter delta p For representing the standard deviation of pixels within the residual block, this value can be approximated by means of the mean absolute difference MAD:
compared with the prior art, the invention has the following advantages:
(1) The invention predicts the coding rate from the angle of the rate distortion theory and has high accuracy.
(2) The invention predicts the coding distortion from the angle of the rate distortion theory and has high accuracy.
(3) The fast intra-frame prediction mode judging method provided by the invention considers the characteristics of video content and has better coding efficiency.
(3) The method for judging the fast intra-frame prediction mode considers the VVC coding technology and has better coding efficiency.
(4) The present invention determines the optimal intra prediction mode by using the predicted rate-distortion cost, and the encoding complexity can be reduced by about 30% compared to the existing method.
Drawings
Fig. 1 is a flowchart of a fast intra mode decision method for general video coding according to the present invention.
Detailed Description
The invention will be described in detail below with reference to the drawings and the detailed description.
The invention is directed to a fast intra mode decision method for general video coding, the specific calculation flow is shown in figure 1,
step 1, using the sum of absolute errors D after Hadamard transformation s Calculating a rate distortion cost J of an intra-prediction mode for a distortion measure s :
J s =D s +λ s ·R s , (1)
Wherein lambda is s R is Lagrangian factor s Representing the number of bits consumed to encode the intra prediction mode. According to the result of equation (1), M (m=6) most likely to be the optimal intra prediction modes are selected to constitute the most probable mode set. Step 2, calculating a parameter delta under the i (i epsilon M) th candidate mode:
for residual block coefficients r with the size of N multiplied by N, a transformation matrix T is obtained after DCT transformation:
where A is the DCT matrix, then:
wherein [] u,u The coefficient values in the matrix at the (u, u) position are represented, and the symbol R' is defined as:
the parameter ρ is used to measure the horizontal and vertical directionsCorrelation between upward pixel values, the value of which is set to 0.6; parameter delta p For representing the standard deviation of pixels within the residual block, this value can be approximated by means of the mean absolute difference MAD:
step 3, calculating the coding rate under the i (i epsilon M) candidate modes:
calculating the coding rate of the residual error:
R r ≈α·log 2 σ , (4)
wherein α is 0.5. The coding rate for the intra prediction mode can be obtained by the RMD module:
R m =R s . (5)
the total code rate R e :
R e ≈R m +R r . (6)
Step 4, calculating the coding distortion in the ith (i E M) candidate mode
According to the intra coding principle, the pixel distortion d at the (i, j) position can be expressed as the difference between the original pixel value p and the reconstructed pixel value p':
wherein p is p Is the pixel predicted value obtained by adopting a certain intra-frame prediction mode, p e Representing the difference, p, between the original pixel value and the predicted pixel value e ' represents the difference between the reconstructed pixel value and the predicted pixel value. Considering that the current pixel has a strong correlation with the reference pixel and all use the same quantization parameter, the coding distortion of the current pixel can be determined by means of the coding distortion d (r) i ,r j ) And (3) predicting:
total distortion D of final encoded block e :
And 5, calculating the rate distortion cost under the ith (i epsilon M) candidate mode according to the formula (2).
And 6, selecting a prediction mode with the minimum rate distortion cost from the candidate mode set as an optimal intra prediction mode of the coding block.
The effects of the present invention are further illustrated by experiments.
The VVC intra-frame coding performance under different application scenes with different resolutions is tested through experiments. The statistical result shows that the average coding time of the invention is reduced by 31.2%, and the average coding bit number is increased by 1.53%. When facing new video service characteristics and new coding technology, the method can compress rich video content more quickly by applying a quick intra-frame prediction mode judgment method of general video coding, save the coding time of a system, improve the capability of coping with new service challenges in video playing, and enable more and more users to participate in video application, thereby feeling the unique charm brought by high-definition/ultra-high-definition video on the spot.
Claims (2)
1. The fast intra mode judging method for the universal video coding is characterized by comprising the following steps:
step 1, selecting M candidate modes which are most likely to become the optimal intra-frame prediction mode from 67 intra-frame prediction modes by using a rough mode judgment method;
step 2, calculating the standard deviation delta of the transform domain residual error in the ith (i epsilon M) candidate mode; the method for calculating the parameter delta comprises the following steps:
for residual block coefficients r with the size of N multiplied by N, a transformation matrix T is obtained after DCT transformation:
where A is the DCT matrix, then:
wherein [] u,u The coefficient values in the matrix at the (u, u) position are represented, and the symbol R' is defined as:
the parameter ρ is used to measure the correlation between pixel values in the horizontal direction and the vertical direction, and its value is set to 0.6; parameter delta p For representing the standard deviation of pixels within the residual block, which value is approximated by means of the mean absolute difference MAD:
step 3, calculating the coding rate of the ith (i epsilon M) candidate mode; the method for calculating the coding rate specifically comprises the following steps:
calculating the coding rate of the residual error:
R r ≈α·log 2 δ (4) Where α is 0.5, the coding rate for the intra prediction mode is obtained by the RMD module:
R m =R s (5)
the total code rate R e :
R e ≈R m +R r (6);
Step 4, calculating coding distortion under the i (i epsilon M) candidate modes; the specific method for calculating the coding distortion is as follows:
according to the intra coding principle, the pixel distortion d at the (i, j) position is expressed as the difference between the original pixel value p and the reconstructed pixel value p':
wherein p is p Is the pixel predicted value obtained by adopting a certain intra-frame prediction mode, p e Representing the difference, p, between the original pixel value and the predicted pixel value e ' represents the difference between the reconstructed pixel value and the predicted pixel value; considering that the current pixel has a strong correlation with the reference pixel and all use the same quantization parameter, the coding distortion of the current pixel is thus determined by the coding distortion d (r i ,r j ) And (3) predicting:
total distortion D of final encoded block e :
Step 5, calculating the rate distortion cost under the i (i epsilon M) candidate mode;
and 6, selecting a prediction mode with the minimum rate distortion cost from the candidate mode set as an optimal intra prediction mode of the coding block.
2. The fast intra mode decision method for general video coding according to claim 1, wherein in step 1, M takes a value of 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010331567.2A CN111447438B (en) | 2020-04-24 | 2020-04-24 | Rapid intra-frame prediction mode judgment method for universal video coding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010331567.2A CN111447438B (en) | 2020-04-24 | 2020-04-24 | Rapid intra-frame prediction mode judgment method for universal video coding |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111447438A CN111447438A (en) | 2020-07-24 |
CN111447438B true CN111447438B (en) | 2023-05-26 |
Family
ID=71651901
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010331567.2A Active CN111447438B (en) | 2020-04-24 | 2020-04-24 | Rapid intra-frame prediction mode judgment method for universal video coding |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111447438B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112689146B (en) * | 2020-12-18 | 2022-07-22 | 重庆邮电大学 | Heuristic learning-based VVC intra-frame prediction rapid mode selection method |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108737839A (en) * | 2018-04-11 | 2018-11-02 | 西安理工大学 | A kind of Lagrange factor computational methods controlled towards code check in frame |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100566427C (en) * | 2007-07-31 | 2009-12-02 | 北京大学 | The choosing method and the device that are used for the intraframe predictive coding optimal mode of video coding |
CN103581682B (en) * | 2013-08-14 | 2017-08-01 | 北京交通大学 | A kind of fast mode decision algorithm of HEVC intraframe codings and its application |
US10142626B2 (en) * | 2014-10-31 | 2018-11-27 | Ecole De Technologie Superieure | Method and system for fast mode decision for high efficiency video coding |
CN104853192B (en) * | 2015-05-08 | 2018-02-13 | 腾讯科技(深圳)有限公司 | Predicting mode selecting method and device |
CN106534870B (en) * | 2016-12-19 | 2019-12-03 | 国网新疆电力公司电力科学研究院 | A kind of rate-distortion optimal coding method based on RGB source video |
CN108696750A (en) * | 2017-04-05 | 2018-10-23 | 深圳市中兴微电子技术有限公司 | A kind of decision method and device of prediction mode |
US11064203B2 (en) * | 2018-03-12 | 2021-07-13 | Nvidia Corporation | SSIM-based rate distortion optimization for improved video perceptual quality |
CN112689146B (en) * | 2020-12-18 | 2022-07-22 | 重庆邮电大学 | Heuristic learning-based VVC intra-frame prediction rapid mode selection method |
-
2020
- 2020-04-24 CN CN202010331567.2A patent/CN111447438B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108737839A (en) * | 2018-04-11 | 2018-11-02 | 西安理工大学 | A kind of Lagrange factor computational methods controlled towards code check in frame |
Non-Patent Citations (1)
Title |
---|
基于率失真优化的HEVC帧内编码模式选择;乔莉;李博;刘伟杰;;现代电子技术(第12期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN111447438A (en) | 2020-07-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8144770B2 (en) | Apparatus and method for encoding moving picture | |
US10298954B2 (en) | Device and method for image encoding/decoding using prediction direction conversion and selective encoding | |
CN103248893B (en) | From H.264/AVC standard to code-transferring method and transcoder thereof the fast frame of HEVC standard | |
CN103220527B (en) | Method for encoding images and device and its coding/decoding method and device | |
CN103327325B (en) | The quick self-adapted system of selection of intra prediction mode based on HEVC standard | |
US10091526B2 (en) | Method and apparatus for motion vector encoding/decoding using spatial division, and method and apparatus for image encoding/decoding using same | |
US20130301717A1 (en) | Intra prediction method and apparatus, and image encoding/decoding method and apparatus using same | |
KR20070012168A (en) | Method and apparatus for video signal encoding and decoding with extending directional intra prediction | |
US20100290521A1 (en) | Method and Device For Selecting Best Mode Of Intra Predictive Coding For Video Coding | |
CN104320657B (en) | The predicting mode selecting method of HEVC lossless video encodings and corresponding coding method | |
CN103997646A (en) | Rapid intra-frame prediction mode selection method in high-definition video coding | |
CA2886995C (en) | Rate-distortion optimizers and optimization techniques including joint optimization of multiple color components | |
CN100591136C (en) | Video frequency intraframe coding method based on null field decomposition | |
CN105850124B (en) | Method and apparatus for encoding and decoding video signal using additional control of quantization error | |
KR20130068479A (en) | Method of encoding intra mode by choosing most probable mode with high hit rate and apparatus for the same, and method of decoding and apparatus for the same | |
CN107864380A (en) | 3D HEVC fast intra-mode prediction decision-making techniques based on DCT | |
CN102291582A (en) | Distributed video encoding method based on motion compensation refinement | |
CN105681797A (en) | Prediction residual based DVC-HEVC (Distributed Video Coding-High Efficiency Video Coding) video transcoding method | |
US20230020946A1 (en) | Cross-codec encoding optimizations for video transcoding | |
CN103442228A (en) | Quick frame inner transcoding method from H.264/AVC standard to HEVC standard and transcoder thereof | |
US20050226329A1 (en) | Motion estimation method using multilevel succesive elimination algorithm | |
CN1206994A (en) | Signal encoding device and method, signal recording medium and signal transmission method | |
CN111447438B (en) | Rapid intra-frame prediction mode judgment method for universal video coding | |
CN101867818B (en) | Selection method and device of macroblock mode | |
Song et al. | Fast CU Partition for VVC Intra-Frame Coding via Texture Complexity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |