CN106254870B - Video encoding method, system and computer-readable recording medium using adaptive color conversion - Google Patents
Video encoding method, system and computer-readable recording medium using adaptive color conversion Download PDFInfo
- Publication number
- CN106254870B CN106254870B CN201610357374.8A CN201610357374A CN106254870B CN 106254870 B CN106254870 B CN 106254870B CN 201610357374 A CN201610357374 A CN 201610357374A CN 106254870 B CN106254870 B CN 106254870B
- Authority
- CN
- China
- Prior art keywords
- coding
- size
- mode
- unit
- color space
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 135
- 238000006243 chemical reaction Methods 0.000 title claims description 29
- 230000003044 adaptive effect Effects 0.000 title claims description 22
- 241000023320 Luma <angiosperm> Species 0.000 claims description 32
- OSWPMRLSEDHDFF-UHFFFAOYSA-N methyl salicylate Chemical compound COC(=O)C1=CC=CC=C1O OSWPMRLSEDHDFF-UHFFFAOYSA-N 0.000 claims description 32
- 238000012545 processing Methods 0.000 claims description 4
- 230000009466 transformation Effects 0.000 abstract description 6
- 238000013519 translation Methods 0.000 abstract description 5
- 238000002474 experimental method Methods 0.000 description 39
- 238000004364 calculation method Methods 0.000 description 19
- 230000008569 process Effects 0.000 description 15
- 238000013139 quantization Methods 0.000 description 15
- 238000000638 solvent extraction Methods 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 8
- 230000011218 segmentation Effects 0.000 description 8
- 238000010219 correlation analysis Methods 0.000 description 7
- 230000008901 benefit Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 238000005192 partition Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000013500 data storage Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- VBRBNWWNRIMAII-WYMLVPIESA-N 3-[(e)-5-(4-ethylphenoxy)-3-methylpent-3-enyl]-2,2-dimethyloxirane Chemical compound C1=CC(CC)=CC=C1OC\C=C(/C)CCC1C(C)(C)O1 VBRBNWWNRIMAII-WYMLVPIESA-N 0.000 description 1
- 101150114515 CTBS gene Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012854 evaluation process Methods 0.000 description 1
- 238000012812 general test Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
- H04N19/122—Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N9/00—Details of colour television systems
- H04N9/64—Circuits for processing colour signals
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
一种视频编码方法及系统,该方法包括以下步骤。接收一原始视频画面(source video frame)。分割原始视频画面为一编码树单元(coding tree unit)。从编码树单元决定一编码单元(coding unit)。启用或禁用编码单元的一编码模式(coding mode)。若启用编码模式,则在启用的编码模式判断是否估计一转换单元(transform unit)的尺寸。在启用的编码模式决定编码单元的转换单元。编码单元的尺寸为NxN。
A video encoding method and system, the method includes the following steps. Receive a source video frame. Split the original video picture into a coding tree unit (coding tree unit). A coding unit (coding unit) is determined from the coding tree unit. Enable or disable a coding mode of the coding unit. If the encoding mode is enabled, it is determined whether to estimate the size of a transformation unit (transform unit) in the enabled encoding mode. The enabled encoding mode determines the encoding unit's translation unit. The size of the coding unit is NxN.
Description
技术领域technical field
本公开涉及视频编码与解码方法及系统。The present disclosure relates to video encoding and decoding methods and systems.
背景技术Background technique
对于高质量图像的需求逐渐增加。随着4K及8K等视频规格的来临,极需提升视频编码与解码效率。此外,消费者期待能够通过各种传输媒介来传输与接收高质量图像。举例来说,消费者希望能够通过网络于携带式装置(如智能手机、平板计算机、笔记型计算机)及家用电视与计算机上观看高质量图像。消费者也希望能够在视频会议及屏幕共享的过程中显示高质量图像。The demand for high-quality images is gradually increasing. With the advent of video standards such as 4K and 8K, it is extremely necessary to improve the efficiency of video encoding and decoding. In addition, consumers expect to be able to transmit and receive high-quality images through a variety of transmission media. For example, consumers want to be able to view high-quality images on portable devices (eg, smart phones, tablet computers, notebook computers) and home TVs and computers via the Internet. Consumers also expect high-quality images to be displayed during video conferencing and screen sharing.
高效率视频编码标准(High Efficiency Video Coding,HEVC)H.265在改进视频压缩的编码与解码效能上提供了一个新的标准。相较于原先的AVC (Advanced VideoCoding)标准,由ISO/IEC JTC 1/SC 29/WG 11 MPEG (Moving Picture Experts Group)及ITU-T SG16 VCEG(Video Coding Experts Group)所建立的HEVC能够降低压缩高质量视频的数据率。AVC标准亦称为H.264。High Efficiency Video Coding (HEVC) H.265 provides a new standard for improving the encoding and decoding performance of video compression. Compared with the original AVC (Advanced Video Coding) standard, HEVC established by ISO/IEC JTC 1/SC 29/WG 11 MPEG (Moving Picture Experts Group) and ITU-T SG16 VCEG (Video Coding Experts Group) can reduce compression Data rate for high quality video. The AVC standard is also known as H.264.
HEVC利用帧间预测技术(Inter prediction)及帧内预测技术(Intraprediction)等各种编码工具来压缩视频。帧间预测技术利用视频串流的不同视频画面之间的时间冗余(temporal redundancies)来压缩视频数据。举例来说,含有相似内容的已编码及已解码的视频画面可用来编码目前的视频画面。这些已编码及已解码的视频画面可以用来预测目前视频画面的编码区域。相对地,帧内预测技术仅利用目前编码视频画面的内部数据来压缩视频数据。帧内预测技术并不使用不同视频画面的时间冗余。举例来说,目前视频画面是利用同一画面的另一部分来进行编码。帧内预测技术包括35种帧内模式,包含平面模式(Planar mode)、DC模式及33种定向模式(directional modes)。HEVC utilizes various coding tools such as Inter prediction and Intraprediction to compress video. Inter prediction techniques utilize temporal redundancies between different video pictures of a video stream to compress video data. For example, encoded and decoded video pictures containing similar content may be used to encode the current video picture. These encoded and decoded video pictures can be used to predict the encoding region of the current video picture. In contrast, intra-prediction techniques utilize only the internal data of currently encoded video pictures to compress video data. Intra prediction techniques do not use the temporal redundancy of different video frames. For example, current video pictures are encoded using another part of the same picture. The intra-frame prediction technology includes 35 intra-frame modes, including a Planar mode, a DC mode, and 33 directional modes.
相较于AVC标准,HEVC标准对各个输入视频画面采用扩张分割技术 (expansivepartitioning and dividing)。AVC标准在编码及解码时仅利用输入视频画面的大区块(macroblock)进行分割。相反地,HEVC标准可以分割输入视频画面成不同尺寸的数据单元及区块,相关说明如后。相较于AVC标准, HEVC标准对动态、多细节及多边缘的视频画面的编码及解码程序提供了更多的弹性。Compared with the AVC standard, the HEVC standard adopts an expansion partitioning and dividing technique for each input video picture. The AVC standard uses only macroblocks of the input video picture for segmentation during encoding and decoding. Conversely, the HEVC standard can divide the input video frame into data units and blocks of different sizes, as described below. Compared with the AVC standard, the HEVC standard provides more flexibility for the encoding and decoding procedures of dynamic, multi-detail and multi-edge video images.
一些能够改善视频编码程序的编码工具亦列入于HEVC标准中。此些编码工具被称为编码扩展(coding extensions)。屏幕内容编码扩展(Screen Content Codingextension,SCC extension)专注于改善HEVC标准下的视频屏幕内容的处理效能。屏幕内容为图案、文字或动画所成像(render)的视频,而不是照相机所提取的视频场景。成像的图案、文字或动画可以是动态或静态,且可以提供于照相机所提取的视频场景内的视频。SCC的应用实例可以包含屏幕镜射(Screen mirroring)、云端游戏(cloud gaming)、无线显示内容 (wireless display of content)、远端计算机存取时的显示(displays generatedduring remote computer desktop access)及屏幕共享(screen sharing)(例如是视频会议的即时屏幕共享)。Some coding tools that can improve the video coding process are also included in the HEVC standard. Such coding tools are called coding extensions. The Screen Content Coding extension (SCC extension) focuses on improving the processing performance of video screen content under the HEVC standard. The content of the screen is a video rendered by a pattern, text or animation, rather than a video scene extracted by a camera. The imaged patterns, text or animation can be dynamic or static and can be provided in video within the video scene captured by the camera. Application examples of SCC can include screen mirroring, cloud gaming, wireless display of content, displays generated during remote computer desktop access, and screen sharing (screen sharing) (eg instant screen sharing for video conferencing).
SCC内的一编码工具为自适性色彩转换(adaptive color transform,ACT)。 ACT为应用于编码单元(coding unit,CU)的残差像素样本(residue pixel samples)的色彩空间转换。对特定的色彩空间而言,已存在一编码单元(CU) 的一像素的色彩元素(colorcomponents)的相关性。当像素的色彩元素的相关性高时,像素执行ACT可以通过去相关性(de-correlating)来帮助相关的色彩元素集中能量。这种集中能量的作法能够提高编码效率,并降低编码成本。因此,ACT能够在HEVC编码过程中增进编码效能。An encoding tool within the SCC is adaptive color transform (ACT). ACT is a color space transformation applied to residual pixel samples (residue pixel samples) of a coding unit (CU). For a particular color space, correlations exist for color components of a pixel of a coding unit (CU). When the correlation of color elements of a pixel is high, the pixel performing ACT can help related color elements concentrate energy by de-correlating. This way of concentrating energy can improve coding efficiency and reduce coding cost. Therefore, ACT can improve coding performance during HEVC coding.
然而,在编码过程中,需要额外的码率失真函数(rate distortionoptimization,RDO)来评估是否启用ACT。RDO用来评估码率失真(rate distortion,RD)的成本。这些评估过程可能会增加编码复杂度及编码时间。再者,当像素的色彩元素已经去相关时,ACT可能就不是必须的。在这种情况下,由于执行ACT的成本高于编码的效益,进一步对色彩元素进行的去相关性程序可能无法带来任何好处。However, during encoding, an additional rate distortion optimization (RDO) function is required to evaluate whether ACT is enabled. RDO is used to estimate the cost of rate distortion (RD). These evaluation processes may increase coding complexity and coding time. Furthermore, ACT may not be necessary when the color elements of the pixels have been decorrelated. In this case, further decorrelation procedures on color elements may not bring any benefit since the cost of performing ACT outweighs the benefits of encoding.
发明内容SUMMARY OF THE INVENTION
根据本公开的一方面,提供一种视频编码方法。视频编码方法包括以下步骤。接收一原始视频画面(source video frame)。分割原始视频画面为一编码树单元(coding treeunit)。从编码树单元决定一编码单元(coding unit)。启用或禁用编码单元的一编码模式(coding mode)。若启用编码模式,则在启用该编码模式判断是否估计一转换单元(transform unit)的尺寸。在启用的编码模式决定编码单元的转换单元。编码单元的尺寸为NxN。According to an aspect of the present disclosure, a video encoding method is provided. The video encoding method includes the following steps. A source video frame is received. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enables or disables a coding mode of the coding unit. If the encoding mode is enabled, it is determined whether to estimate the size of a transform unit when the encoding mode is enabled. Determines the translation unit of the coding unit in the enabled coding mode. The size of the coding unit is NxN.
根据本公开的另一方面,提供一种视频编码系统。视频编码系统包括一存储器及一处理器。存储器用以存储一组指令。处理器用以执行此组指令。此组指令包括以下步骤。接收一原始视频画面(source video frame)。分割原始视频画面为一编码树单元(codingtree unit)。从编码树单元决定一编码单元(coding unit)。启用或禁用编码单元的一编码模式(coding mode)。若启用编码模式,则在启用的编码模式判断是否估计一转换单元(transform unit) 的尺寸。在启用的编码模式决定编码单元的转换单元。编码单元的尺寸为NxN。According to another aspect of the present disclosure, a video encoding system is provided. The video encoding system includes a memory and a processor. Memory is used to store a set of instructions. The processor is used to execute the set of instructions. This set of instructions includes the following steps. A source video frame is received. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enables or disables a coding mode of the coding unit. If the encoding mode is enabled, it is determined whether to estimate the size of a transform unit in the enabled encoding mode. Determines the translation unit of the coding unit in the enabled coding mode. The size of the coding unit is NxN.
根据本公开的另一方面,提供一种非暂态计算机可读取记录介质。非暂态计算机可读取记录介质用以存储一组指令。此组指令由一或多个处理器执行,以执行一视频编码方法。此视频编码方法包括以下步骤。接收一原始视频画面(source video frame)。分割原始视频画面为一编码树单元(coding tree unit)。从编码树单元决定一编码单元(codingunit)。启用或禁用编码单元的一编码模式(coding mode)。若启用编码模式,则在启用的编码模式判断是否估计一转换单元(transform unit)的尺寸。在启用的编码模式决定编码单元的转换单元。编码单元的尺寸为NxN。According to another aspect of the present disclosure, a non-transitory computer-readable recording medium is provided. A non-transitory computer-readable recording medium stores a set of instructions. The set of instructions are executed by one or more processors to perform a video encoding method. This video encoding method includes the following steps. A source video frame is received. The original video picture is divided into a coding tree unit. A coding unit is determined from the coding tree unit. Enables or disables a coding mode of the coding unit. If the encoding mode is enabled, it is determined whether to estimate the size of a transform unit in the enabled encoding mode. Determines the translation unit of the coding unit in the enabled coding mode. The size of the coding unit is NxN.
为了对本公开的上述及其他方面有更佳的了解,下文特举多个实施例,并配合附图,作详细说明如下:In order to have a better understanding of the above-mentioned and other aspects of the present disclosure, a number of embodiments are exemplified below, and are described in detail as follows in conjunction with the accompanying drawings:
附图说明Description of drawings
图1A~1J绘示视频画面及根据本公开数个实施例的相关的分割。1A-1J illustrate video frames and related segmentations according to several embodiments of the present disclosure.
图2绘示本公开的视频编码器。2 illustrates a video encoder of the present disclosure.
图3说明根据本公开一实施例的编码方法。FIG. 3 illustrates an encoding method according to an embodiment of the present disclosure.
图4说明根据本公开另一实施例的编码方法。FIG. 4 illustrates an encoding method according to another embodiment of the present disclosure.
图5说明根据本公开另一实施例的编码方法。FIG. 5 illustrates an encoding method according to another embodiment of the present disclosure.
图6说明根据本公开另一实施例的编码方法。FIG. 6 illustrates an encoding method according to another embodiment of the present disclosure.
图7说明非444色度格式的IPM的演算流程。FIG. 7 illustrates the calculation flow of IPM in non-444 chroma format.
图8绘示执行本公开的编码与解码方法的系统。FIG. 8 illustrates a system implementing the encoding and decoding methods of the present disclosure.
【符号说明】【Symbol Description】
101:视频画面(video frame)101: Video frame (video frame)
102:编码树单元(coding tree unit,CTU)102: coding tree unit (coding tree unit, CTU)
103:亮度编码树区块(luma coding tree block,luma CTB)103: Luma coding tree block (luma coding tree block, luma CTB)
104:Cb CTB104: Cb CTB
105:Cr CTB105: Cr CTB
106、111:相关说明106, 111: related instructions
107-1、107-2、107-3、107-4:亮度编码区块(luma coding block,luma CB)107-1, 107-2, 107-3, 107-4: Luma coding block (luma coding block, luma CB)
108:编码单元(Coding unit,CU)108: Coding unit (Coding unit, CU)
109:Cb CB109: Cb CB
110:Cr CB110: Cr CB
112:亮度预测区块(luma prediction block,PB)112: Luma prediction block (luma prediction block, PB)
113-1、113-2、113-3、113-4:转换区块(transform block,TB)113-1, 113-2, 113-3, 113-4: Transform block (TB)
114:转换单元(Transform unit,TU)114: Transformation unit (Transform unit, TU)
200:视频编码器200: Video encoder
202:画面分割模块(Frame Dividing Module)202: Frame Dividing Module
204:帧间预测启用ACT模块(Inter Prediction enabling adaptive colortransformation Module)204: Inter Prediction enabling adaptive colortransformation Module
206:帧间预测禁用ACT模块(Inter Prediction disabling ACT Module)206: Inter Prediction disabling ACT Module
208:画面寄存器(Frame Buffer)208: Picture register (Frame Buffer)
210:模式决定模块(Mode Decision Module)210: Mode Decision Module
212:帧内预测启用ACT模块(Intra Prediction enabling ACT Module)212: Intra Prediction enabling ACT Module (Intra Prediction enabling ACT Module)
214:帧内预测禁用ACT模块(Intra Prediction disabling ACT Module)214: Intra Prediction disabling ACT Module
216、218:加总模块(Summing Module)216, 218: Summing Module
220:切换器220: Switcher
222:自适性色彩转换(ACT)模块222: Adaptive Color Conversion (ACT) module
224:CCP、转换及量化模块(CCP,Transform,and Quantization Module)224: CCP, Transform, and Quantization Module
226:熵编码模块(Entropy Coding Module)226: Entropy Coding Module
228:逆运算CCP、转换及量化模块(Inverse CCP,Transform,and QuantizationModule)228: Inverse CCP, Transform, and Quantization Module (Inverse CCP, Transform, and QuantizationModule)
230:切换器230: Switcher
232:逆运算ACT模块(Inverse ACT Module)232: Inverse ACT Module (Inverse ACT Module)
300、400、500、600、700、800:编码方法300, 400, 500, 600, 700, 800: Coding method
304:分量相关性分析(component correlation analysis)304: Component correlation analysis
306:概略模式决定(Rough mode decision)306: Rough mode decision
308:结束308: end
310:码率失真函数模式决定(rate distortion optimization mode decision,RDO mode decision)310: Rate distortion optimization mode decision (RDO mode decision)
311:色度格式是否为非444(non-444)的判断311: Judging whether the chroma format is non-444 (non-444)
312:CU尺寸是否小于临界值T1的判断312: Judgment of whether the CU size is smaller than the critical value T1
314:TU尺寸决定(TU size decision)314: TU size decision
316:色度模式决定(chroma mode decision)316: chroma mode decision
402:CU尺寸是否小于临界值T2的判断402: Judgment of whether the CU size is smaller than the critical value T2
702:非暂态计算机可读取介质702: Non-transitory computer readable media
704:处理器704: Processor
具体实施方式Detailed ways
以下将搭配附图详细地说明示例性的实施例。在下面描述的附图中,除非另有说明,在不同附图的相同标号代表相同或近似的元件。以下提出的实施例并非代表本公开的所有实施情况。事实上,这些实施例仅仅是对应于权利要求书的系统与方法的一些实例。Exemplary embodiments will be described in detail below in conjunction with the accompanying drawings. In the drawings described below, unless otherwise indicated, the same reference numbers in different drawings represent the same or similar elements. The examples presented below are not representative of all implementations of the present disclosure. In fact, these embodiments are merely examples of systems and methods corresponding to the claims.
图1A~1J说明根据本公开的实施例的视频画面及其相关的分割。1A-1J illustrate a video picture and its associated segmentation according to embodiments of the present disclosure.
图1A绘示视频画面101。视频画面101包括数个像素。视频画面101被分割为数个编码树单元(coding tree units,CTUs)102。每个CTU 102的尺寸是根据L个垂直样本及L个水平样本(LxL)来决定。每个样本于CTU的不同像素位置对应于一像素值。举例来说,L可以是16、32、或64。像素位置可以是像素于CTU所在的位置或像素之间的位置。当像素位置是像素之间的位置,像素值可以是像素位置附近的一或多个像素的内插值。各个CTU 102 包括亮度编码树区块(luma coding tree block,luma CTB)、色度编码树区块 (chroma CTB)及相关说明(associated syntax)。FIG. 1A shows a
图1B绘示数个CTB可以被包含于图1A的一个CTU 102中。举例来说, CTU 102可以包含亮度CTB(luma CTB)103、色度CTB(chroma CTB)(含Cb CTB 104Cr CTB 105)。CTU 102也可以包括相关说明(associated syntax) 106。Cb CTB 104为蓝色色差CTB(bluedifference chroma component CTB),其表示CTB在蓝色的变化。Cr CTB 105为红色色差CTB(red difference chroma component CTB),其表示CTB在红色的变化。相关说明106包含亮度CTB 103、 Cb CTB 104及Cr CTB 105如何被编码的信息、以及亮度CTB 103、Cb CTB104及Cr CTB 105的进一步分割。CTB 103、Cb CTB 104及Cr CTB 105的尺寸可以相同于CTU102的尺寸。或者,亮度CTB 103的尺寸可以相同于CTU 102的尺寸,但Cb CTB 104及Cr CTB105的尺寸可以小于CTU 102的尺寸。FIG. 1B shows that several CTBs may be included in one
帧内预测(intra prediction)、帧间预测(inter prediction)及其他等编码工具运作于编码区块(coding blocks,CBs)上。为了决定编码的程序是要采用帧内预测还是帧间预测,CTB可以被分割为一或多个CB。CTB分割为 CB的程序是根据四分树分割(quad-treepartitioning)技术。因此,CTB可以分割为四个CB,各个CB可以再分割为四个CB。根据CTB的尺寸,可以继续进行这样的分割程序。Coding tools such as intra prediction, inter prediction and others operate on coding blocks (CBs). In order to decide whether the coding procedure is to use intra-frame prediction or inter-frame prediction, the CTB can be divided into one or more CBs. The procedure for partitioning CTB into CB is based on quad-tree partitioning techniques. Therefore, the CTB can be divided into four CBs, and each CB can be further divided into four CBs. Depending on the size of the CTB, such a segmentation procedure may continue.
图1C绘示图1B的亮度CTB 103被分割为一或多个亮度CB 107-1、107-2、 107-3或107-4。以64x64的亮度CTB为例,对应的亮度CB 107-1、107-2、 107-3或107-4可以是NxN的尺寸,例如是64x64、32x32、16x16或8x8。在图1C中,亮度CTB 103的尺寸为64x64。而亮度CTB103的尺寸可以为32x32 或16x16。FIG. 1C shows the
图1D绘示图1B的亮度CTB 103进行四分树分割的一实例,其中亮度 CTB 103分割为图1C的亮度CB 107-1、107-2、107-3或107-4。在图1D中,亮度CTB 103的尺寸为64x64。然而,亮度CTB 103的尺寸也可以是32x32 或16x16。FIG. 1D illustrates an example of quadtree partitioning performed by the
在图1D中,亮度CTB 103分割为四个32x32的亮度CB 107-2。各个32x32 的亮度CB可以更分割为四个16x16的亮度CB 107-3。各个16x16的亮度CB 可以更分割为四个8x8的亮度CB 107-4。In Figure ID, the
编码单元(Coding unit,CU)用以编码CB。CTB可以包括唯一一个 CU、或者分割为数个CU。因此CU的尺寸也可以是NxN,例如是64x64、 32x32、16x16或8x8。各个CU包括一个亮度CB、两个色度CB及相关说明。于编码及解码程序中产生的残差CU的尺寸可相同于其对应的CU的尺寸。A coding unit (Coding unit, CU) is used to encode the CB. A CTB may include a single CU, or be divided into several CUs. Therefore, the size of the CU can also be NxN, such as 64x64, 32x32, 16x16 or 8x8. Each CU includes one luma CB, two chrominance CBs, and related descriptions. The size of the residual CU generated in the encoding and decoding process may be the same as the size of its corresponding CU.
图1E绘示CB(图1C的亮度CB 107-1)的示意图,此些CB可以是CU 108的一部分。举例来说,CU 108可以包括亮度CB 107-1及色度CB(Cb CB 109)及色度CB(Cr CB 110)。CU108可以包括相关说明111。相关说明111 包含如何对亮度CB 107-1、Cb CB 109及Cr CB110进行编码的信息,例如是四分树信息的说明(亮度CB及色度CB的尺寸、位置与进一步的分割)。各个CU 108可于亮度CB 107-1、Cb CB 109及Cr CB 110具有相关的预测区块(prediction blocks,PBs)。预测区块组合成预测单元(prediction units,PUs)。FIG. 1E shows a schematic diagram of CBs (luminance CB 107 - 1 of FIG. 1C ), which may be part of
图1F绘示图1D的CB 107-1分割为亮度PB 112的各种可能情况。亮度 CB 107-1例如是根据亮度CB 107-1的不同区域的可预测性来分割为亮度PB 112。举例来说,亮度CB107-1可以包含单一个亮度PB 112,其尺寸相同于亮度CB 107-1。或者,亮度CB 107-1可以垂直地或水平地分割为两个偶数亮度PB 112。或者亮度CB 107-1可以垂直地或水平地分割为四个亮度PB 112。需说明的是图1F仅仅作为示例。在HEVC标准下的任何分割为PB的方式都属于本公开的范围。图1F所绘示将亮度CB 107-1分割为亮度PB 112的方式是互斥的。举例来说,在HEVC的帧内预测模式中,64x64、32x32及16x16 的CB可能被分割为单一个PB,其尺寸相同于CB。然而,8x8的CB可能被分割为单一个8x8的PB或四个4x4的PB。FIG. 1F illustrates various possible scenarios of partitioning of CB 107-1 into
一但采用帧内预测或帧间预测,由预测区块与来源视频图像区块之间不同处所产生的残差信号(residual signal)被转换至另一域(domain),以进一步进行离散余弦转换(discrete cosine transform,DCT)或离散正弦变换(discrete sine transform,DST)的编码。为了提供这些转换,各个CU或各个CB需要利用一或多个转换区块(transform block,TB)。Once intra-frame prediction or inter-frame prediction is used, the residual signal generated by the difference between the prediction block and the source video image block is converted to another domain for further discrete cosine transform (discrete cosine transform, DCT) or discrete sine transform (discrete sine transform, DST) encoding. To provide these transformations, each CU or each CB needs to utilize one or more transform blocks (TBs).
图1G绘示图1E或图1F的亮度CB 107-1如何被分割为不同的TB 113-1、 113-2、113-3及113-4。若亮度CB 107-1为64x64的CB,TB 113-1为32x32 的TB,TB 113-2为16x16的TB,TB 113-3为8x8的TB,并且TB 113-4为 4x4的TB。亮度CB 107-1可以被分割为4个TB113-1、16个TB 113-2、64 个TB 113-3及256个TB 113-4。一个亮度CB 107-1可以被分割为相同尺寸的TB 113或不同尺寸的TB 113。1G illustrates how the luminance CB 107-1 of FIG. 1E or FIG. 1F is divided into different TBs 113-1, 113-2, 113-3, and 113-4. If luminance CB 107-1 is a 64x64 CB, TB 113-1 is a 32x32 TB, TB 113-2 is a 16x16 TB, TB 113-3 is an 8x8 TB, and TB 113-4 is a 4x4 TB. The luminance CB 107-1 can be divided into 4 TBs 113-1, 16 TBs 113-2, 64 TBs 113-3, and 256 TBs 113-4. One luminance CB 107-1 may be divided into TBs 113 of the same size or TBs 113 of different sizes.
将CB分割为TB的程序根据四分树分割(quad-tree splitting)。因此,一个CB可以被分割为一或多个TB,其中各个TB可以更进一步被分割为4个 TB。这样的分割程序可以根据CB的尺寸来继续进行。The procedure for splitting CBs into TBs is based on quad-tree splitting. Thus, a CB can be partitioned into one or more TBs, where each TB can be further partitioned into 4 TBs. Such a segmentation procedure may continue depending on the size of the CB.
图1H绘示图1E或图1F的亮度CB 107-1的四分树分割,其利用各种分割方式分割为图1G的TB 113-1、113-2、113-3或113-4。在图1H中,亮度 CB 107-1的尺寸为64x64。然而,亮度CB 107-1的尺寸也可以是32x32或16x16。1H illustrates a quadtree partition of luma CB 107-1 of FIG. 1E or FIG. 1F, which is partitioned into TBs 113-1, 113-2, 113-3, or 113-4 of FIG. 1G using various partitioning methods. In FIG. 1H, the size of luminance CB 107-1 is 64x64. However, the size of the luminance CB 107-1 may also be 32x32 or 16x16.
在图1H中,亮度CB 107-1被分割为四个32x32的TB 113-1。各个32x32 的TB可以更进一步被分割为4个16x16的TB 113-2。各个16x16的TB可以更进一步被分割为4个8x8的TB113-3。各个8x8的TB可以更进一步被分割为4个4x4的TB 113-4。In Figure 1H, the luminance CB 107-1 is divided into four 32x32 TBs 113-1. Each 32x32 TB can be further divided into four 16x16 TBs 113-2. Each 16x16 TB can be further divided into four 8x8 TB113-3. Each 8x8 TB can be further divided into four 4x4 TBs 113-4.
TB 113接着以进行DCT或任何HEVC标准的转换。转换单元(Transform units,TUs)汇总TB 113。一或多个TB被各个CB采用。CB形成各个CU。因此,转换单元(TU)的结构于不同的CU 108是不同的,并且是由CU 108 来决定的。TB 113 is then followed by conversion to DCT or any HEVC standard. Transform units (TUs) summarize TB 113. One or more TBs are employed by each CB. CBs form individual CUs. Therefore, the structure of the translation unit (TU) is different for
图1I绘示TU 114各种不同分割的TB 113-1、113-2、113-3及113-4。各个TU汇总图1G或图1H分割的TB。32x32的TU 114可以采用32x32的单一个TB 113-1、或一或多个16x16的TB 113-2、8x8的TB 113-3、或4x4的 TB 113-4。对采用HEVC的帧间预测的CU而言,TU可以大于PU,使得TU 可以包含PU边界(boundaries)。然而,对采用HEVC的帧内预测的CU而言,TU可以不穿越(cross)PU边界。FIG. 1I depicts TBs 113-1, 113-2, 113-3, and 113-4 of various partitions of
图1J绘示图1I的TU 114的四分树分割,其利用图1I的各种TB 113-1、 113-2、113-3或113-4。在图1J中,TU 114的尺寸为32x32。然而,TU的尺寸可以是16x16、8x8、或4x4。FIG. 1J depicts a quadtree partition of the
在图1J中,TU 114被分割为一个32X32的TB 113-1及4个16x16的TB 113-2。各个16x16的TB可以更进一步分割为4个8x8的TB 113-3。各个8x8 的TB可以更进一步分割为4个4x4的TB 113-4。In Figure 1J,
本公开所述的CTU、CTB、CB、CU、PU、PB、TU或TB皆可以包括 HEVC标准的任何特征(feature)、尺寸(size)及特性(property)。第1C、1E及1F图所述的分割也可以应用于色度CTB(Cb CTB 104)、色度CTB(Cr CTB 105)及色度CB(Cb CB 109)、色度CB(Cr CB 110)。The CTU, CTB, CB, CU, PU, PB, TU, or TB described in this disclosure may include any feature, size, and property of the HEVC standard. The segmentation described in Figures 1C, 1E and 1F can also be applied to Chroma CTB (Cb CTB 104), Chroma CTB (Cr CTB 105) and Chroma CB (Cb CB 109), Chroma CB (Cr CB 110) .
图2绘示执行本公开的编码方法的视频编码器200。视频编码器200可以包括一或多个附加元件,其提供HEVC-SCC的附加的编码功能,如调色盘模式(palette mode)、样本自适性偏移(sample adaptive offset)及去块滤波 (de-blocking filtering)。此外,本公开考虑到ACT的帧内预测模式及其他编码模式,例如是ACT的帧间预测模式。FIG. 2 illustrates a
视频编码器200接收输入的一原始视频画面(source video frame)。此输入原始视频画面先输入至画面分割模块(Frame Dividing Module)202。画面分割模块202分割原始视频画面为至少一原始CTU(source CTU)。原始CU (source CU)再由原始CTU来获得。原始CTU的尺寸及原始CU的尺寸由画面分割模块202来决定。接着,以逐CU的方式进行编码。原始CU由画面分割模块202输出后,输入至帧间预测启用ACT模块(Inter Predictionenabling adaptive color transformation Module)204、帧间预测禁用ACT模块 (InterPrediction disabling ACT Module)206、帧内预测启用ACT模块(Intra Predictionenabling ACT Module)212及帧内预测禁用ACT模块(Intra Prediction disabling ACTModule)214。The
输入画面的原始CU被帧间预测启用ACT模块204编码,其利用帧间预测技术且启用自适性色彩转换(ACT)自输入画面决定一原始CU的预测。输入画面的原始CU也被帧间预测禁用ACT模块206编码,其利用帧间预测技术且不启用自适性色彩转换(ACT)自输入画面决定一原始CU的预测(即禁用ACT)。The original CU of the input picture is encoded by the inter prediction enabled
在帧间预测时可以使用存储于画面寄存器(Frame Buffer)208的参考CU。原始PU及PB也由原始CU来获得,且使用于帧间预测启用ACT模块204 及帧间预测禁用ACT模块206的帧间预测程序。帧间预测利用不同时间的视频画面的区域来进行运动检测。帧间预测启用ACT模块204及帧间预测禁用 ACT模块206的已编码帧间预测CU预定为最高画面质量。已编码帧间预测 CU接着被输入至模式决定模块(Mode Decision Module)210。The reference CU stored in the
输入画面的原始CU也藉由帧内预测启用ACT模块212进行编码,其利用帧内预测技术且启用自适性色彩转换(ACT)自输入画面决定一原始CU 的预测。The original CU of the input picture is also encoded by the intra-prediction enabled
输入画面的原始CU也藉由帧内预测禁用ACT模块214进行编码,其利用帧内预测技术且不启用自适性色彩转换(ACT)自输入画面决定一原始CU 的预测(即禁用ACT)。The original CU of the input picture is also encoded by the intra-prediction
帧内预测启用ACT模块212及帧内预测禁用ACT模块214进行帧内预测时,可使用存储于画面寄存器208的同一画面的原始CU。原始PU及PB 也由原始CU来获得,且使用于帧内预测启用ACT模块212及帧内预测禁用 ACT模块214的帧内预测程序。已编码的帧内预测CU预定为最高画面质量。从帧内预测启用ACT模块212及帧内预测禁用ACT模块214输出的已编码帧内预测CU输入至模式决定模块210。The intra-prediction-enabled
在模式决定模块210中,采用帧间预测启用ACT、帧间预测禁用ACT、帧内预测启用ACT及帧内预测禁用ACT等方式进行原始CU的编码的成本搭配预测CU的质量进行比较。根据比较的结果,决定哪一编码模式的预测 CU(例如是帧间预测CU或帧内预测CU)。被选择的预测CU接着被传送至加总模块(Summing Module)216、218。In the
在加总模块216中,原始CU减去已选择的预测CU,以提供一剩余CU (residualCU)。若已选择的预测CU是来自在帧间预测启用ACT模块204 及帧内预测启用ACT模块121的其中之一,切换器(switch)220切换至位置A。在位置A,剩余CU输入至ACT模块(ACT Module)222,并接着输入至CCP、转换及量化模块224(CCP,Transform,and Quantization Module)224。然而,若已选择的预测CU是来自在帧间预测禁用ACT模块206及帧内预测禁用ACT模块214的其中之一,切换器220切换至位置B。在位置B, ACT模块222被跳过且在编码过程中不被执行。剩余CU从加总模块216直接被输入至CCP、转换及量化模块224。In the summing
在ACT模块222,自适性色彩转换(adaptive color transform)执行于剩余CU。ACT模块222的输出接入至CCP、转换及量化模块224。At
CCP、转换及量化模块224执行跨组件预测(cross component prediction, CCP)、转换(如离散续余弦转换(Discrete Cosine Transform,DCT)或离散正弦转换(DiscreteSine Transform,DST)、及剩余CU的量化。CCP、转换及量化模块224的输出接入至熵编码模块(Entropy Coding Module)226及逆运算CCP、转换及量化模块(Inverse CCP,Transform,and Quantization Module)228。The CCP, transform and
熵编码模块226执行剩余熵编码(entropy encoding)。举例来说,全文自适应二进位算术编码(Context Adaptive Binary Arithmetic Coding,CABAC) 可以被执行以编码剩余CU。HEVC所提供的任何其他熵编码程序皆可执行于熵编码模块226中。The
在执行熵编码之后,输入视频画面的CU的已编码比特流自视频编码器 200输出。输出的已编码比特流可以存储于一存储器、通过传输线广播或网络、或提供至一显示器等。After performing entropy encoding, the encoded bitstream of the CU of the input video picture is output from the
在逆运算CCP、转换及量化模块228中,执行CCP、转换及量化模块224 的相反决定于剩余CU,以提供一重建的剩余CU。In the inverse CCP, transform and
若已选择的预测CU是来自在帧间预测启用ACT模块204或帧内预测启用ACT模块212,切换器(switch)230切换至位置C。在位置C,重建的剩余CU输入至逆运算ACT模块(Inverse ACT Module)232并接着输入至加总模块(Summing Module)218。然而,若已选择的预测CU是来自在帧间预测禁用ACT模块206或帧内预测禁用ACT模块214,切换器230切换至位置D。在位置D,逆运算ACT模块232被跳过而不被执行,且重建的剩余CU被直接输入至加总模块218。The
逆运算ACT模块232对重建的剩余CU执行ACT模块232的自适性色彩转换的逆运算。逆运算ACT模块232的输出输入至加总模块218。Inverse operation The
在加总模块218中,来自模式决定模块210的已选择的预测CU加上重建的剩余CU,以提供重建的原始CU(reconstructed source CU)。重建的原始 CU接着被存储于画面寄存器208,以供其他CU的帧间预测及帧内预测使用。In the
以下说明的编码方法300、400及500如何执行于帧内预测启用ACT模块212内。编码方法300、400及500能够改善编码效率与编码时间。The
帧间预测启用ACT模块204、帧间预测禁用ACT模块206、帧内预测启用ACT模块212及帧内预测禁用ACT模块214并非局限于以平行的方式排列。在一实施例中,帧间预测启用ACT模块204、帧间预测禁用ACT模块 206、帧内预测启用ACT模块212及帧内预测禁用ACT模块214可以依序排列。帧间预测启用ACT模块204、帧间预测禁用ACT模块206、帧内预测启用ACT模块212及帧内预测禁用ACT模块214的排列方式可以变更。The inter-prediction-enabled
图3说明根据本公开一实施例的编码方法300,其决定TU尺寸的估计 (TU sizeevaluation)是否需执行在启用ACT的帧内预测编码程序(ACT enabled intra predictionencoding process)中。更具体来说,编码方法300利用关于CU尺寸的临界值计算(threshold calculation)来决定是否需执行TU尺寸的估计。FIG. 3 illustrates an
在步骤304中,分量相关性分析(component correlation analysis)执行于一原始CU,以决定CU的ACT的编码模式是否需被启用。CU内的各个像素的色彩分量的相关性被分析出来。在各个像素中,色彩分量的相关性与一像素相关临界值(pixel correlationthreshold)进行比较,以分析出相关性是否高于、等于或低于像素相关性临界值。In
在一CU中,计算出高于像素相关性临界值的像素的总数量,其中等于像素相关性临界值的像素也视为高于像素相关性临界值而被计算在内。像素的总数量接着与一CU相关性临界值(CU correlation threshold)进行比较。In a CU, the total number of pixels above the pixel correlation threshold is calculated, wherein pixels equal to the pixel correlation threshold are also counted as being above the pixel correlation threshold. The total number of pixels is then compared to a CU correlation threshold.
若像素的总数量低于CU相关性临界值,则判定CU的色彩分量具有低相关性。因此,CU并不需要ACT,故流程进入步骤308,而在CU的编码禁用ACT。If the total number of pixels is below the CU correlation threshold, it is determined that the color components of the CU have low correlation. Therefore, the CU does not need ACT, so the flow proceeds to step 308, and ACT is disabled in the coding of the CU.
然而,若像素的总数量高于CU相关性临界值,则判定CU的色彩分量具有高相关性。在这种情况下,ACT是需要用来去除CU的各个像素的分量相关性。当确认为高相关性,ACT被启用且流程进入步骤306。在步骤306,在帧内预测启用ACT之下,进行一概略模式决定。However, if the total number of pixels is higher than the CU correlation threshold, it is determined that the color components of the CU have high correlation. In this case, ACT is the component correlation needed to remove the individual pixels of the CU. When high correlation is confirmed, ACT is enabled and the flow proceeds to step 306 . At
步骤304的相关性分析可以更进一步或可选择性地根据CU的色彩空间 (colorspace)来进行。举例来说,在步骤304,CU内的像素的色彩分量可以进行分析,且CU的色彩空间可以进行判定。色彩空间可以判定为红色、绿色及蓝色(RGB)空间或亮度与色度(luminance and chrominance,YUV) 空间。The correlation analysis of
当判定为色彩空间为RGB色彩空间,流程进入步骤306。在步骤306,在帧内预测启用ACT之下,进行概略模式决定(Rough mode decision)。由于RGB像素分量通常具有高相关性,需要进行ACT来去除CU内各个像素的分量的相关性,以将像素能量(pixel energy)隔离为单一成份(single component)。When it is determined that the color space is the RGB color space, the flow proceeds to step 306 . At
相对地,当色彩空间判定为YUV色彩空间,流程进入步骤308,而禁用 ACT。这是由于YUV像素分量通常具有低相关性,且多数像素能量(pixel energy)存储于单一像素分量(single pixel component)。由于CU像素分量的进一步去相关性动作(de-correlation)不会产生额外的编码效益,故并不需要于YUV像素分量启用ACT。On the other hand, when the color space is determined to be the YUV color space, the flow proceeds to step 308, and ACT is disabled. This is because YUV pixel components generally have low correlation, and most pixel energy is stored in a single pixel component. It is not necessary to enable ACT on YUV pixel components since further de-correlation of CU pixel components does not yield additional coding benefits.
在帧内预测启用ACT模块212中,在编码方法300禁用ACT时,帧内预测启用ACT的编码模式被禁用,且在帧内预测启用ACT模块212不会输出预测至模式决定模块210。In the intra-prediction-enabled
在帧间预测启用ACT模块204,在帧间预测编码禁用ACT时,帧间预测启用ACT的编码模式被禁用,且帧间预测启用ACT模块204不会输出预测至模式决定模块210。In the inter prediction enabled
在步骤306中,帧内预测启用ACT下进行概略模式决定。概略模式决定可以是一成本模式决定(cost-based mode decision)。举例来说,在概略模式决定中,可以决定为低复杂度成本的已选择编码模式,以快速做出决定,其通常具有最高质量及最低编码成本。In
在步骤310中,在启用ACT的编码模式下,进行码率失真函数模式决定 (ratedistortion optimization mode decision,RDO mode decision)。在此,当 ACT、CCP、转换、量化及熵编码执行时,计算原始视频的变异(deviation) 及编码模式的比特成本。变异可以由错误计算(error calculation)来获得,例如是均方差(mean squared error,MSE)。接着,ROD分析选择出具有最低编码成本及最高编码质量的编码模式由。In
举例来说,在帧内预测启用ACT模块212中,35个帧内预测模式(intra predictionmodes,IPMs)可供编码。帧内预测启用ACT模块212在步骤306 的概略模式决定中,采用简单、低复杂度编码成本决定法来从这些帧内预测模式选择出最低编码成本及最高编码质量者。举例来说,绝对转换误差和(sum of absolute transform distortion,SATD)成本可用来决定出各个IPM的低复杂度编码成本。举例来说,最低编码成本及最高偏码质量的选择可以是选择 3个IPM或选择8个IPM。帧内预测启用ACT模块212在步骤310的RDO 模式决定中,对各个已选择IPM进行RDO模式决定。当ACT、CCP、转换、量化及熵编码执行时,计算各个已选择IPM的原始视频的变异及编码的比特成本。变异可以由错误计算(error calculation)来获得,例如是均方差(mean squared error,MSE)。接着,藉由ROD分析从已选择IPM中选择出具有最低编码成本及最高编码质量的IPM。For example, in the intra prediction enabled
上述相关于帧内预测启用ACT模块212的变化流程也可以执行在帧间预测启用ACT模块204。举例来说,当帧间预测启用ACT模块204执行编码方法300,在步骤306,进行时间相邻的视频画面的最佳帧间预测的概略模式决定,其提供最低编码成本及最高编码质量。在步骤310,进行帧间预测的 RDO模式决定。在此,当ACT、CCP、转换、量化及熵编码执行时,计算帧间预测的原始视频的变异(deviation)及编码比特成本。变异可以由错误计算(errorcalculation)来获得,例如是均方差(mean squared error,MSE)。接着,ROD分析选择出具有最低编码成本及最高编码质量的帧间预测。Variations of the flow described above with respect to enabling the
在步骤312,计算目前处理的CU的CU尺寸。CU的尺寸可以是NxN,其中N可以是4、8、16、32或64。CU的N值与临界值T1比较。临界值 T1可以是4、8、16、32或64。根据比较结果,判定出CU尺寸是否小于临界值T1,并藉此估计出欲启用编码模式的转换单元的尺寸。若CU尺寸小于临界值T1,流程进入步骤314,以进行TU尺寸的决定(TU size decision)。然而,若CU尺寸等于或大于临界值T1,流程进入步骤316,而跳过步骤314 的TU尺寸决定步骤。在步骤312,当CU尺寸大于临界值T1,对于CU的 TU尺寸可以大于临界值T1。若CU尺寸CU等于或大于临界值T1,TU四分树结构(quadtree structure)可以决定为最大可能的TU尺寸。举例来说,当 CU尺寸等于或大于临界值T1,对于尺寸为64x64的CU,可以决定出四个 32x32的TU。在另一实施例中,当CU尺寸等于或大于临界值T1,对于32x32、 16x16、8x8或4x4的CU来说,TU可以与CU相同尺寸。举例来说,若CU 的尺寸为32x32,对应的PU尺寸可以是32x32。In
由于TU尺寸的决定耗费时间且增加编码成本,步骤312可改善编码时间及效率。因此,若TU尺寸的决定能够省略,则可解省编码成本即时间。再者,CU尺寸等于或大于临界值T1表示CU的内容并不复杂。举例来说, CU尺寸大于临界值T1可能表示视频图像有大范围区域没有边界、动态或复杂图像。因此,TU尺寸的决定可以不需要进行,以有效率地进行高视频质量的CU的编码。Since TU size determination is time consuming and increases encoding cost, step 312 may improve encoding time and efficiency. Therefore, if the determination of the TU size can be omitted, the coding cost, that is, time can be saved. Furthermore, the CU size equal to or greater than the threshold T1 indicates that the content of the CU is not complicated. For example, a CU size larger than the threshold T1 may indicate that the video image has a large area without borders, dynamic or complex images. Therefore, the determination of the TU size may not need to be performed to efficiently perform encoding of CUs with high video quality.
在步骤314中,若CU尺寸低于临界值T1,则执行TU尺寸的决定。在此,决定了原始CU的TU。藉由步骤310的RDO成本估计,分析出TU尺寸,已获得最高效率及高视频质量的CU的ACT转换。举例来说,可分析出 4x4、8x8、16x16及32x32的TU尺寸。当能够获得最高效率的ACT转换的 TU尺寸被决定出来,此TU尺寸被选择用来作CU的ACT转换并进入步骤 316。已选择的TU尺寸作为最佳的TU四分树结构尺寸。In
在步骤316,进行色度模式决定(chroma mode decision)。色度模式的决定是依据步骤310的预测模式的决定来进行,且利用已决定的预测模式 (determined predictionmode)来使色度预测(chroma prediction)产生色度 PU(chroma PU)及对应的色度TU(chroma TU)。从步骤312或步骤314决定的TU也可用来产生色度TU。色度TU亦根据色度格式(chroma format) 进行二次采样(subsample)。因此,在一实施例中,当色度格式为4:2:0,且亮度TU的尺寸为32x32,决定的色度TU为16x16的色度TU。At
在步骤308,帧内预测启用ACT模块的选择最佳帧内预测模式及选择最佳TU四分树结构尺寸的程序已完成。预测及RDO成本已产生,且输入至模式决定模块210,以与其他预测模块输入至模式决定模块210的RDO成本进行比较。举例来说,帧间预测启用ACT模块204可能产生启用ACT的CU 的预测及RDO成本,并输入预测CU及RDO成本至模式决定模块210。帧间预测禁用ACT模块206及帧内预测禁用ACT模块214也产生预测CU及 RDO成本,并输入其各自的预测CU及RDO成本至模式决定模块210。模式决定模块210比较帧间预测启用ACT模块204、帧间预测禁用ACT模块206、帧内预测启用ACT模块212及帧内预测禁用ACT模块214所输入的预测CU 及RDO成本,并决定将要输入至加总模块216、218的预测CU。At
图4说明根据本公开另一实施例的编码方法400,其根据本公开的另一实施例决定ACT是否需要启用。更具体来说,编码方法400利用了关于CU 尺寸的临界值计算(thresholdcalculation)及CU像素的色彩分量的相关性的决定。根据临界值计算,ACT可以启用或禁用。相同标号的元件可参考前述相关说明。FIG. 4 illustrates an
在步骤304,分量相关性分析(component correlation analysis)执行于原始CU,以决定ACT是否需启用或禁用。步骤304如同编码方法300的说明。若CU的色彩分量的相关性高,则启用ACT且流程进入步骤306、310、314、 316及308(同上述编码步骤300)。然而,若相关性低,则流程进入步骤402。At
在步骤402,决定目前处理的CU的尺寸。如前所述,CU尺寸为NxN,其中N可以是4、8、16、32或64。CU的N值与临界值T2进行比较,以比较出CU尺寸是否小于临界值T2。临界值T2可以是4、8、16、32或64。若 CU尺寸小于临界值T2,则启用ACT且流程进入步骤310,如同编码方法300 的步骤310的RDO模式决定。然而,若CU尺寸等于或大于临界值T2,流程进入步骤308,而禁用ACT。In
在帧间预测启用ACT模块204,当编码方法400中ACT被禁用,帧间预测启用ACT模块204的输出为未应用ACT的帧间预测CU。因此,在这种情况下,帧间预测启用ACT模块204输出的CU相同于帧间预测禁用ACT 模块206的输出。同样地,在帧内预测启用ACT模块212,当编码方法400 中ACT被禁用,帧内预测启用ACT模块212的输出为未应用ACT的帧内预测CU。因此,在这种情况下,帧内预测启用ACT模块212的输出CU相同于帧内预测禁用ACT模块214的输出。In the inter prediction enabled
由于CU尺寸相同或大于临界值T2表示CU的内容不复杂,步骤402可改善编码时间及编码效率。CU尺寸大于临界值T2可能表示视频图像有大范围区域没有边界、动态或复杂图像。在组合已经充分去相关性的色彩分量下,为了有效率地编码CU,可能不需要ACT。Since the size of the CU is the same or larger than the threshold T2, it means that the content of the CU is not complicated, step 402 can improve the coding time and coding efficiency. A CU size larger than the critical value T2 may indicate that the video image has a large area without borders, dynamic or complex images. In combining color components that have been sufficiently decorrelated, ACT may not be required in order to encode the CU efficiently.
图5说明根据本公开另一实施例的编码方法500,其根据本公开的另一实施例决定ACT是否需要启用以及是否需要通过两个临界值计算来进行TU 尺寸估计。更具体来说,编码方法500使用关于CU尺寸的第一临界值计算 (first threshold calculation)以及用以判断是否要启用ACT的CU像素色彩分量的相关性决定。编码方法500也使用关于CU尺寸的第二临界值计算(second threshold calculation),以决定TU尺寸的估计是否需执行。相同标号的元件可参考前述相关说明。5 illustrates an
在步骤304,分量相关性分析(component correlation analysis)执行于原始CU,以决定ACT是否需启用或禁用。步骤304如同编码方法300的说明。若CU的色彩分量的相关性高,则启用ACT且流程进入步骤306,以进行概略模式决定及步骤310的RDO模式决定。步骤306及310如同前述编码方法 300的叙述。然而,若相关性低,则流程进入步骤402。At
在步骤402,决定目前处理的CU的尺寸(如前述图4的编码方法400 所述)。若CU尺寸小于临界值T2,则启用ACT,并进入步骤310,以进行 RDO模式决定。然而,若CU尺寸等于或大于临界值T2,则流程进入步骤 308,而禁用ACT。In
在帧间预测启用ACT模块204,当编码方法500中ACT被禁用,帧间预测启用ACT模块204的输出为未应用ACT的帧间预测CU。因此,在这种情况下,帧间预测启用ACT模块204输出的CU相同于帧间预测禁用ACT 模块206的输出。In the inter prediction enabled
同样地,在帧内预测启用ACT模块212,当编码方法500中ACT被禁用,帧内预测启用ACT模块212的输出为未应用ACT的帧内预测CU。因此,在这种情况下,帧内预测启用ACT模块212的输出CU相同于帧内预测禁用 ACT模块214的输出。Likewise, in the intra-prediction-enabled
在步骤310,RDO模式决定如同前述编码方法300所述的内容。At
在步骤312,目前处理的CU尺寸的计算如同前述编码方法300所述的内容,来决定CU尺寸是否小于临界值T1。若CU尺寸小于临界值T1,则流程进入步骤314,以进行TU尺寸决定。然而,若CU尺寸等于或大于临界值 T1,流程进入步骤316,而跳过步骤314的TU尺寸决定。步骤314、316的决定过程如同前述的编码方法300。In
临界值T1及T2可以设定为相同或不同值。The thresholds T1 and T2 can be set to the same or different values.
图5的编码方法500结合临界值计算来改善编码效率及时间。如上所述, CU尺寸等于或大于临界值T2表示CU的内容不复杂,且可预期有大范围区域的无边界、动态或复杂图样。在组合已经充分去相关性的色彩分量下,为了有效率地编码CU,可能不需要ACT。再者,步骤314的TU尺寸决定被省略后,能够节省编码成本。The
图6说明根据本公开另一实施例的编码方法600(类似于编码方法300),其根据本公开的另一实施例决定是否需要在启用ACT的帧内预测程序中执行TU尺寸估计。更具体来说,编法方法600使用关于CU尺寸的临界值计算(threshold calculation),并根据临界值计算判断是否需要执行TU尺寸估计。6 illustrates an encoding method 600 (similar to encoding method 300) according to another embodiment of the present disclosure for determining whether TU size estimation needs to be performed in an ACT-enabled intra prediction procedure according to another embodiment of the present disclosure. More specifically, the
在步骤304,分量相关性分析(component correlation analysis)执行于原始CU,以决定ACT是否需启用或禁用。步骤304如同编码方法300的说明。若CU的色彩分量的相关性高,则启用ACT且流程进入步骤306,以进行概略模式决定及步骤310的RDO模式决定。步骤306及310如同前述编码方法 300的叙述。然而,若在步骤304的相关性低,或色彩空间判断为YUV色彩空间,则启用ACT的编码模式并直接进入步骤310,但不执行步骤306的概略模式决定。在此,对于低相关性像素分量或YUV色彩空间,ACT仍然启用,以确认像素分量的去相关性可能会产生附加的编码效益。At
在步骤310,RDO模式决定的计算如同前述编码方法300。At
在步骤312,目前处理的CU尺寸的计算如同前述编码方法300所述的内容,来决定CU尺寸是否小于临界值T1。若CU尺寸小于临界值T1,则流程进入步骤314,以进行TU尺寸决定。然而,若CU尺寸等于或大于临界值 T1,流程进入步骤316,而跳过步骤314的TU尺寸决定。步骤314、316的决定过程如同前述的编码方法300。In
临界值T1及T2可以设定为相同或不同值。The thresholds T1 and T2 can be set to the same or different values.
执行编码方法300、400、500、600的相反步骤的解码方法可以有效率地对编码方法300、400、500、600所编码的视频进行解码。因此,本公开的上述内容足以了解执行编码方法300、400、500、600的相反步骤的解码方法。本公开上述内容亦足以了解对编码方法300、400、500、600所编码的视频进行解码所需的其他解码程序。A decoding method that performs the reverse steps of the
如果大CU使用IPM作为屏幕虚拟内容(screen visual content),则可能表示该区域的内容并不复杂,且并不需要估计TU的尺寸。因此,非444色度格式的IPM被禁止部分大CU的TU分割。图7说明非444色度格式的IPM 的演算流程。步骤306及310如同前述编码方法300的叙述。在步骤310, RDO模式决定的计算如同前述编码方法300。If a large CU uses IPM as screen visual content, it may indicate that the content of this area is not complex and the size of the TU does not need to be estimated. Therefore, IPMs in non-444 chroma formats are prohibited from TU partitioning for some large CUs. FIG. 7 illustrates the calculation flow of IPM in non-444 chroma format.
在步骤311,判断色度格式是否为非444。若色度格式为非444,则进入步骤312。若色度格不是非444,则进入步骤314,以近行TU尺寸决定。In
在步骤312,目前处理的CU尺寸的计算如同前述编码方法300所述的内容,来决定CU尺寸是否小于临界值T1。若CU尺寸小于临界值T1,则流程进入步骤314,以进行TU尺寸决定。然而,若CU尺寸等于或大于临界值 T1,流程进入步骤316,而跳过步骤314的TU尺寸决定。步骤314、316的决定过程如同前述的编码方法300。In
临界值T1及T2可以设定为相同或不同值。The thresholds T1 and T2 can be set to the same or different values.
图8绘示执行本公开的编码与解码方法的系统700。系统700包括一非暂态计算机可读取介质(non-transitory computer-readable medium)702,其可以是存储数组指令的存储器。此些指令可被处理器704执行。值得注意的是,一或多个非暂态计算机可读取介质702和/或一或多个处理器704可以选择性地采用,以执行本公开的编码与解码方法。FIG. 8 illustrates a
非暂态计算机可读取介质702可以是任何类型的非暂态计算机可读取记录介质(non-transitory computer-readable storage medium,non-transitory CRM)。非暂态计算机可读取记录介质可以包括软性磁盘(floppy disk)、可挠性盘片 (flexible disk)、硬盘(hard disk)、硬盘机(hard drive)、固态硬盘(solid state drive)、磁带(magnetictape)、任何磁性数据存储介质(magnetic data storage medium)、光碟机(CD-ROM)、任何光学数据存储介质(optical data storage medium)、任何具有孔洞图样的物理性介质、动态随机存取存储器(RAM)、可编程只读存储器(PROM)、可抹除可编程只读存储器(EPROM)、快闪可抹除可编程只读存储器(FLASH-EPROM)、任何快闪存储器、非易失性存储器(NVRAM)、快取(cache)、寄存器(register)、存储器芯片(memory chip)、胶卷(cartridge)及网络。计算机可读取记录介质可存储由至少一处理器执行的数组指令。此些指令包含令处理器去执行本公开的编码与解码方法的步骤或阶段。再者,一或多个计算机可读取记录介质可以用来实现本公开的编码与解码方法。“计算机可读取记录介质”包含有形物体但不包含载体载波信号和瞬态信号。The non-transitory computer-
处理器704可以是任何形式的数字信号处理器(digital signal processor,DSP)、特定应用集成电路(application specific integrated circuit,ASIC)、数字信号处理装置(digital signal processing device,DSPD)、可编程逻辑装置 (programmablelogic device,PLD)、可编程逻辑阵列(field programmable gate arrays,FPGA)、控制器(controller)、微控制器(micro-controller)、微处理器(micro-processor)、计算机或任何其他能够执行本公开的编码与解码方法的电子元件。The
实验结果Experimental results
以下说明本公开的编码方法的实验结果。The experimental results of the encoding method of the present disclosure are described below.
此处的实验室采用HEVC SCC参考模型、一般测试条件(common test conditions,CTC)下的SCM 4.0。本公开的编码方法的编码效能是与HEVC 的参考模型进行比较。HEVC参考模型花费了编码时间A来进行编码。本公开的测试编码方法花费了来进行编码时间B来进行编码。编码时间百分比为编码时间B除以编码时间A。实验可采用HEVC一般测试流程。视频可混合文字、图像、动态画面、混合内容、动画、照相机提取内容。视频可以是具有720p、1080p、或1440p的画质的RGB色彩空间及YUV色彩空间。实验采用有损条件(lossycondition)下的全帧内预测、随机存取及低B预测(low-B prediction)。全帧内预测采用目前正被压缩的画面内的信息来压缩视频画面,而随机存取及低B预测采用先前已编码的画面及目前正被压缩的画面的信息来压缩视频画面。在以下的说明中,低B预测也可以是指低延迟B预测(low delay B prediction)。在每次的实验中,编码时间及解码时间都被以百分比记录下来,此些百分比表示相对于参考模型的编码方法与解码方法的比例。相对于原始视频源,关于各个G/Y、B/U及R/V分量的正的百分比表示比特率编码损失(bit rate codingloss),负的百分比表示比特率编码增益(bit rate coding gain)。举例来说,G/Y分量的0.1%数值表示已编码视频的G/Y分量相对于原始视频的G/Y分量的编码损失为0.1%。在另一实例中,G/Y分量的 -0.1%数值表示已编码视频的G/Y分量相对于原始视频的G/Y分量的编码增益为0.1%。The laboratory here uses the HEVC SCC reference model, SCM 4.0 under common test conditions (CTC). The coding performance of the coding method of the present disclosure is compared with the reference model of HEVC. The HEVC reference model took encoding time A to encode. The test encoding method of the present disclosure took to encode time B to encode. The encoding time percentage is encoding time B divided by encoding time A. The experiment can adopt the general test process of HEVC. Video can be mixed with text, images, motion pictures, mixed content, animation, and camera-extracted content. Video can be in RGB color space and YUV color space with 720p, 1080p, or 1440p quality. The experiments use full intra-frame prediction, random access and low-B prediction under lossy conditions. Full intra prediction uses information within the picture currently being compressed to compress video pictures, while random access and low-B prediction uses information from previously encoded pictures and the picture currently being compressed to compress video pictures. In the following description, low B prediction may also refer to low delay B prediction. In each experiment, encoding time and decoding time were recorded as percentages, which represented the ratio of encoding method to decoding method relative to the reference model. A positive percentage for each G/Y, B/U, and R/V component represents a bit rate coding loss relative to the original video source, and a negative percentage represents a bit rate coding gain. For example, a value of 0.1% for the G/Y component means that the encoding loss of the G/Y component of the encoded video relative to the G/Y component of the original video is 0.1%. In another example, a value of -0.1% for the G/Y component represents a coding gain of 0.1% for the G/Y component of the encoded video relative to the G/Y component of the original video.
请参考图5的编码方法500及下表1。在编码方法500,实验室执行于以下三种设定之下。在设定一,临界值T2及临界值T1皆设定为64。在设定二,临界值T2设定为64,临界值T1设定为32。在设定三,临界值T2设定为64,临界值T1设定为16。帧内预测为预定的编码模式。Please refer to the
在设定一,当像素分量具有低相关性,尺寸大于或等于64x64的CU被以不启用ACT的方式编码。尺寸小于64x64的CU被以启用ACT的方式编码。再者,在CU尺寸大于64x64的情况下,省略TU尺寸决定的步骤314。对于CU尺寸小于64x64的情况下,执行TU尺寸决定的步骤314。In setting one, when pixel components have low correlation, CUs with size greater than or equal to 64x64 are encoded without ACT enabled. CUs with sizes smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, when the CU size is larger than 64×64,
在设定二,当像素分量具有低相关性,尺寸大于或等于64x64的CU被以不启用ACT的方式编码。尺寸小于64x64的CU被以启用ACT的方式编码。再者,在CU尺寸大于32x32的情况下,省略TU尺寸决定的步骤314。对于CU尺寸小于32x32的情况下,执行TU尺寸决定的步骤314。In setting two, when the pixel components have low correlation, CUs of size greater than or equal to 64x64 are encoded without ACT enabled. CUs with sizes smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, when the CU size is larger than 32×32,
在设定三,当像素分量具有低相关性,尺寸大于或等于64x64的CU被以不启用ACT的方式编码。尺寸小于64x64的CU被以启用ACT的方式编码。再者,在CU尺寸大于16x16的情况下,省略TU尺寸决定的步骤314。对于CU尺寸小于16x16的情况下,执行TU尺寸决定的步骤314。In setting three, when the pixel components have low correlation, CUs with size greater than or equal to 64x64 are encoded without ACT enabled. CUs with sizes smaller than 64x64 are encoded in an ACT-enabled manner. Furthermore, when the CU size is larger than 16×16,
表1Table 1
如表1所示,设定一、设定二及设定三的编码效能均有改善。设定一降低了3%的编码复杂度(encoding complexity),设定二降低了6%的编码复杂度。设定三降低了9%的编码复杂度(设定三降低最多)。因此,所有的设定都能够改善编码效率。各个设定在比特率的最小损失(minimal loss of bit rate) 下,编码时间及效率都有改善。As shown in Table 1, the encoding performance of Setting 1, Setting 2 and Setting 3 are improved. Setting one reduces encoding complexity by 3%, setting two reduces encoding complexity by 6%. Setting three reduces coding complexity by 9% (setting three reduces the most). Therefore, all settings can improve coding efficiency. Each setting improves encoding time and efficiency at a minimal loss of bit rate.
请参照编码方法500及下表2、3。在此,实验是在全帧内、随机存取及低延迟B(lowdelay B)之下进行。在实验一,临界值T2及临界值T1皆设定为32。在实验二,临界值T2及临界值T1皆设定为16。如同编码方法500,在实验一,尺寸大于或等于32x32的CU禁用TU估计(TU evaluation),且尺寸大于或等于32x32以不启用ACT的方式编码。在实验二,尺寸大于或等于16x16的CU禁用TU估计,且尺寸大于或等于16x16以不启用ACT的方式编码。尺寸小于16x16的CU在启用ACT的方式编码。实验是在有损条件 (lossy condition)及全画面帧内区块复制技术(full frame intra block copy) 下进行。Please refer to the
表2Table 2
如表2所述,在实验一,全帧内模式(all intra mode)降低了5%的编码复杂度。随机存取及低延迟B各降低了1%的编码复杂度。各个设定显示出非常低的比特率损失,全帧内及随机存取几乎没有改变比特率。As described in Table 2, in Experiment 1, all intra mode reduces coding complexity by 5%. Random access and low latency B each reduce coding complexity by 1%. The individual settings showed very low bitrate loss, with full intraframe and random access barely changing the bitrate.
在实验二,全帧内模式降低了8%的编码复杂度。随机存取降低了1%的编码复杂度。低延迟B没有改变编码复杂度。相较于实验一,各个模式具有较多的比特率损失,但比特率损失仍然维持在最小(仅在百分比的小数范围内)。相较于原始视频,已编码视频仅略微降低比特率,故仅损失少部分的视频质量。由于编码方法500改善了编码时间,故这样的视频质量在大部分的应用是可以接受的。In experiment two, the full intra mode reduces the coding complexity by 8%. Random access reduces coding complexity by 1%. Low latency B does not change the coding complexity. Compared with experiment 1, each mode has more bit rate loss, but the bit rate loss is still kept to a minimum (only in the decimal range of the percentage). Compared to the original video, the bitrate of the encoded video is only slightly reduced, so only a small amount of video quality is lost. Since the
表3table 3
如表3所述,在实验一及实验二,各个模式在全部或平均来看,对比特率都没有改变。全帧内降低了最多比率的编码复杂度(在各实验均降低1%)。As shown in Table 3, in Experiment 1 and Experiment 2, each mode does not change the bit rate in total or on average. Encoding complexity is reduced by the most ratio (1% reduction in each experiment) in full frame.
请参考图5的编码方法500及下表4。在此,实验是在有损条件(lossy condition)、4-CTU帧内区块复制技术(Intra block copy)及4:4:4色度模式下进行。帧内区块复制技术利用运动向量从先前已编码CU复制一区块至目前编码视频画面。4-CTU指出运动向量能够搜寻的范围。Please refer to the
在实验一,临界值T2及临界值T1皆设定为32。在实验二,临界值T2 及临界值T1皆设定为16。如同编码方法500,在实验一,尺寸大于或等于 32x32的CU禁用TU估计。在实验二,尺寸大于或等于16x16的CU禁用 TU估计。在实验一,尺寸大于32x32的CU启用ACT,尺寸大于或等于32x32 的CU禁用ACT。在实验二,尺寸小于16x16的CU启用ACT,尺寸大于或等于16x16的CU禁用ACT。In experiment 1, both the threshold value T2 and the threshold value T1 are set to 32. In experiment 2, the threshold value T2 and the threshold value T1 are both set to 16. As with the
表4Table 4
如表4所述,在实验一及实验二,全帧内、随机存取或低延迟B模式皆为最小比特率改变。全帧内降低了最多的编码复杂度,其在实验一降低了5%,在实验二降低了8%。As described in Table 4, in Experiment 1 and Experiment 2, full intra-frame, random access or low-latency B-mode are all the minimum bit rate changes. Intra-frame reduces the coding complexity the most, by 5% in experiment one and 8% in experiment two.
请参考图4的编码方法400及以下的表5.1及表5.2。在此,临界值T2 设定为64。因此,当步骤304的分量相关性分析分析出CU的色彩分量具有低相关性时,执行步骤402,以判断CU尺寸是否小于64x64。若CU尺寸小于64x64,则启用ACT且执行步骤310的RDO模式决定。若CU尺寸大于或等于64x64,则禁用ACT且进入步骤308。实验一采用全画面帧内区块复制技术(full frame intra block copy)的有损全帧内编码模式(lossy all intra encodingmode),实验二采用4CTU IBC技术的有损全帧内编码模式。色度模式于各个实验选择为4:4:4。Please refer to the
表5.1:实验一Table 5.1: Experiment 1
表5.2:实验二Table 5.2: Experiment 2
如表5.1所示,在YUV色彩空间且全帧内、有损、全画面帧内区块复制技术下,编码方法400在最小比特率损失下,降低了1%到3%的编码时间。如表5.2所示,在全帧内、有损、4CTU帧内区块复制技术下,编码方法400 在最小比特损失下,降低编码时间的比率近似于表5.1的实验一。As shown in Table 5.1, the
请参考编码方法400及下表6。在此,临界值T2设定为64。在4:4:4 的色度模式执行无损帧内编码(Lossless intra encoding)。Please refer to the
表6Table 6
在YUV色彩空间,编码方法节省了0%到2%的编码时间。In YUV color space, the encoding method saves 0% to 2% of encoding time.
请参考图3的编码方法300及下表7。在此,临界值T1在实验一设定为 32,在实验二设定为16。如同编码方法300,在实验一,CU尺寸大于或等于 32x32时,将省略步骤314的TU尺寸决定;CU尺寸小于32x32时,则执行步骤314的TU尺寸决定。在实验二,CU尺寸大于或等于16x16时,将省略步骤314的TU尺寸决定;CU尺寸小于16x16时,则执行步骤314的TU尺寸决定。实验执行启用ACT的有损全帧内编码。Please refer to the
表7Table 7
实验一的编码时间节省了3%到6%。实验二的编码时间节省了6%到10%。因此,仅在CU尺寸低于32x32或16x16之下才允许进行TU尺寸决定,以帮助编码效率。The encoding time for experiment one was saved by 3% to 6%. The encoding time for experiment two was saved by 6% to 10%. Therefore, TU size decisions are only allowed under CU sizes below 32x32 or 16x16 to help coding efficiency.
上述内容用以说明本公开的技术,然其并非用以局限本发明的内容。实施例的修改与调整均落于本公开的范围。举例来说,所公开的实施例包含软件及硬件,但本公开的系统与方法可以仅以硬件来实现。The above content is used to illustrate the technology of the present disclosure, but it is not intended to limit the content of the present invention. Modifications and adjustments of the embodiments fall within the scope of the present disclosure. For example, the disclosed embodiments include both software and hardware, but the systems and methods of the present disclosure may be implemented in hardware only.
软件开发者可基于本公开的方法开发一计算机程序,其可采用各种计算机程序技术来开发。举例来说,程序片段或程序模块可以藉由Java、C、C++、组合语言或任何其他程序语言来开发。一或多个软件片段与模块可以安装于一计算机系统、非暂态计算机可读取介质、或现存的通信软件。A software developer can develop a computer program based on the methods of the present disclosure, which can be developed using various computer programming techniques. For example, program segments or program modules may be developed in Java, C, C++, assembly language, or any other programming language. One or more software segments and modules may be installed on a computer system, non-transitory computer readable medium, or existing communications software.
再者,虽然上述以公开各种实施例,然本公开的范围包含各种元件的均等、修改、省略、组合(例如不同实施例之间的组合)、应用、或选择。权利要求书的元件以最广的范围来作解释,而不局限于实施例的内容。此外,方法的步骤可以进行修改(包含调整顺序、插入或删除步骤)。虽然本公开已以优选实施例公开如上,然其并非用以限定本公开。本公开的保护范围当视所附权利要求书界定范围为准。Furthermore, while the foregoing discloses various embodiments, the scope of the present disclosure includes equivalents, modifications, omissions, combinations (eg, between different embodiments), applications, or selections of various elements. The elements of the claims are to be construed in the broadest sense and are not limited to the content of the embodiments. Furthermore, the steps of the method may be modified (including reordering, insertion or deletion of steps). Although the present disclosure has been disclosed above in terms of preferred embodiments, it is not intended to limit the present disclosure. The scope of protection of the present disclosure should be determined by the scope defined by the appended claims.
本发明所属领域技术人员也可根据本公开的说明了解其他的实施例。本公开的范围包含结合一般知识的各种变化、实施与应用。说明书与实施例仅仅作为示例,本公开的保护范围当视所附权利要求书界定范围为准。Those skilled in the art to which the present invention pertains will also recognize other embodiments from the description of the present disclosure. The scope of the present disclosure includes various changes, implementations, and applications incorporating general knowledge. The description and the embodiments are only used as examples, and the protection scope of the present disclosure should be determined by the scope defined by the appended claims.
Claims (14)
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201562172256P | 2015-06-08 | 2015-06-08 | |
US62/172,256 | 2015-06-08 | ||
US14/757,556 US20160360205A1 (en) | 2015-06-08 | 2015-12-24 | Video encoding methods and systems using adaptive color transform |
US14/757,556 | 2015-12-24 | ||
US201662290992P | 2016-02-04 | 2016-02-04 | |
US62/290,992 | 2016-02-04 | ||
TW105114323 | 2016-05-09 | ||
TW105114323A TWI597977B (en) | 2015-06-08 | 2016-05-09 | Video encoding methods and systems using adaptive color transform |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106254870A CN106254870A (en) | 2016-12-21 |
CN106254870B true CN106254870B (en) | 2020-08-18 |
Family
ID=57626642
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610357374.8A Active CN106254870B (en) | 2015-06-08 | 2016-05-26 | Video encoding method, system and computer-readable recording medium using adaptive color conversion |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106254870B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106851272B (en) * | 2017-01-20 | 2019-11-12 | 杭州当虹科技股份有限公司 | A kind of method of HDR and SDR adaptive rate control |
US10820017B2 (en) * | 2017-03-15 | 2020-10-27 | Mediatek Inc. | Method and apparatus of video coding |
WO2019076138A1 (en) | 2017-10-16 | 2019-04-25 | Huawei Technologies Co., Ltd. | METHOD AND APPARATUS FOR ENCODING |
CN108174214A (en) * | 2017-12-08 | 2018-06-15 | 重庆邮电大学 | A Remote Desktop Sharing Method Based on Screen Content Video Coding |
CN111758255A (en) | 2018-02-23 | 2020-10-09 | 华为技术有限公司 | Position dependent spatially varying transforms for video coding |
IL279095B2 (en) | 2018-05-31 | 2024-12-01 | Huawei Tech Co Ltd | Spatially varying transform with a type of adaptive transform |
CN117579830A (en) | 2019-06-21 | 2024-02-20 | 北京字节跳动网络技术有限公司 | Selective use of adaptive intra-annular color space conversion and other codec tools |
WO2021018082A1 (en) * | 2019-07-26 | 2021-02-04 | Beijing Bytedance Network Technology Co., Ltd. | Determination of picture partition mode based on block size |
CN115152219A (en) | 2019-11-07 | 2022-10-04 | 抖音视界有限公司 | Quantization characteristics of adaptive in-loop color space transforms for video coding |
TWI743919B (en) * | 2020-08-03 | 2021-10-21 | 緯創資通股份有限公司 | Video processing apparatus and processing method of video stream |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20130136180A1 (en) * | 2011-11-29 | 2013-05-30 | Futurewei Technologies, Inc. | Unified Partitioning Structures and Signaling Methods for High Efficiency Video Coding |
-
2016
- 2016-05-26 CN CN201610357374.8A patent/CN106254870B/en active Active
Non-Patent Citations (3)
Title |
---|
AHG6: On Adaptive Color Transform (ACT) in SCM2.0;PoLin Lai, Shan Liu, Shawmin Lei;《19.JCT-VC MEETING》;20141008;摘要,第2-4小节 * |
High Efficiency Video Coding (HEVC) Test Model 16 (HM 16) Encoder Description;K. McCann;《18.JCT-VC MEETING》;20141014;4.1.3小节,4.1.5小节,图4-4,图4-6 * |
Screen content coding test model 2 (SCM 2);Rajan Joshi;《17.JCT-VC MEETING》;20141017;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN106254870A (en) | 2016-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106254870B (en) | Video encoding method, system and computer-readable recording medium using adaptive color conversion | |
CN113812148B (en) | Reference picture resampling and inter coding tools for video coding | |
TWI839662B (en) | Video decoding method | |
US20160360205A1 (en) | Video encoding methods and systems using adaptive color transform | |
EP3598758B1 (en) | Encoder decisions based on results of hash-based block matching | |
US10390020B2 (en) | Video encoding methods and systems using adaptive color transform | |
KR102606414B1 (en) | Encoder, decoder and corresponding method to derive edge strength of deblocking filter | |
JP6355715B2 (en) | Encoding device, decoding device, encoding method, decoding method, and program | |
CN112385233B (en) | Intra-frame smoothing (MDIS) of combined dependent modes and interpolation filter switching with position dependent intra-frame prediction combining (PDPC) | |
WO2015138954A1 (en) | Color-space inverse transform both for lossy and lossless encoded video | |
CN113545051B (en) | Reconstruction of blocks of video data using block size restriction | |
GB2582929A (en) | Residual signalling | |
JP7592968B2 (en) | Encoder, decoder and corresponding method of deblocking filter adaptation | |
CA2878440A1 (en) | Restricted intra deblocking filtering for video coding | |
CN112673636A (en) | Rounding motion vectors to adaptive motion vector difference resolution and improve motion vector storage accuracy in video coding | |
CN113330743A (en) | Encoder, decoder and corresponding method for deblocking filter adaptation | |
CN114598873B (en) | Decoding method and device for quantization parameter | |
EP3104606B1 (en) | Video encoding methods and systems using adaptive color transform | |
TWI597977B (en) | Video encoding methods and systems using adaptive color transform | |
WO2015142618A1 (en) | Systems and methods for low complexity encoding and background detection | |
US20150117521A1 (en) | Method and apparatus for inter color component prediction | |
CN111327899A (en) | Video decoder and corresponding method | |
KR20230123947A (en) | Adaptive loop filter with fixed filters | |
EP2899975A1 (en) | Video encoder with intra-prediction pre-processing and methods for use therewith | |
RU2786626C2 (en) | Method and device for image separation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |