WO2020005007A1 - 비디오 신호 처리 방법 및 장치 - Google Patents
비디오 신호 처리 방법 및 장치 Download PDFInfo
- Publication number
- WO2020005007A1 WO2020005007A1 PCT/KR2019/007881 KR2019007881W WO2020005007A1 WO 2020005007 A1 WO2020005007 A1 WO 2020005007A1 KR 2019007881 W KR2019007881 W KR 2019007881W WO 2020005007 A1 WO2020005007 A1 WO 2020005007A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- merge candidate
- block
- merge
- motion information
- prediction
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/577—Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/513—Processing of motion vectors
- H04N19/517—Processing of motion vectors by encoding
- H04N19/52—Processing of motion vectors by encoding by predictive encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
Definitions
- the present invention relates to a video signal processing method and apparatus.
- High efficiency image compression techniques can be used to solve these problems caused by high resolution and high quality image data.
- An inter-screen prediction technique for predicting pixel values included in the current picture from a picture before or after the current picture using an image compression technique an intra prediction technique for predicting pixel values included in a current picture using pixel information in the current picture
- An object of the present invention is to provide a method and apparatus for efficiently performing inter prediction on an encoding / decoding target block in encoding / decoding a video signal.
- An object of the present invention is to provide a method and apparatus for obtaining bidirectional motion information in encoding / decoding a video signal.
- An object of the present invention is to provide a method and apparatus for rearranging merge candidates in encoding / decoding a video signal.
- the video signal decoding method and apparatus may derive at least one merge candidate based on at least one of a spatial neighboring block or a temporal neighboring block of the current block, generate a merge candidate list including the merge candidate, Obtain LX direction motion information of the current block from a first merge candidate included in the merge candidate list, and obtain L (1-X) direction motion information of the current block from a second merge candidate different from the first merge candidate.
- the inter prediction may be performed based on the LX direction motion information and the L (1-X) direction motion information.
- the video signal encoding method and apparatus may derive at least one merge candidate based on at least one of a spatial neighboring block or a temporal neighboring block of the current block, generate a merge candidate list including the merge candidate, Obtain LX direction motion information of the current block from a first merge candidate included in the merge candidate list, and obtain L (1-X) direction motion information of the current block from a second merge candidate different from the first merge candidate.
- the inter prediction may be performed based on the LX direction motion information and the L (1-X) direction motion information.
- the second merge A merge candidate corresponding to a value obtained by adding 1 to an index may be determined as the second merge candidate.
- the second merge candidate may be selected from an additional merge candidate list generated by extracting only merge candidates having L (X-1) motion information from the merge candidate list. Can be.
- the LX direction prediction of the current block may include the LX motion information and the LX motion information of the second merge candidate. It can be performed based on.
- the LX direction prediction may include a first LX prediction based on the LX motion information and a second LX prediction based on the LX motion information of the second merge candidate. Can be.
- the LX direction prediction includes: a first LX motion vector for the LX motion information and a second LX motion vector for the LX motion information of the second merge candidate; It may be performed based on the third LX motion vector derived based.
- inter prediction based on the LX motion information is performed on a first partition of the current block, and on the basis of the L (1-X) motion information on a second partition. Inter prediction may be performed.
- inter prediction efficiency can be improved by performing motion compensation using a plurality of merge candidate lists.
- inter prediction efficiency can be improved by using bidirectional motion information.
- a method of efficiently encoding / decoding a merge index can be provided by rearranging merge candidates.
- FIG. 1 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present invention.
- FIG. 2 is a block diagram illustrating an image decoding apparatus according to an embodiment of the present invention.
- FIG. 3 is a diagram illustrating a partition mode candidate that can be applied to a coding block when the coding block is encoded by inter-screen prediction.
- FIG. 4 illustrates an example in which coding blocks are hierarchically divided based on a tree structure according to an embodiment to which the present invention is applied.
- FIG. 5 is a diagram illustrating a partition form in which binary tree based partitioning is allowed as an embodiment to which the present invention is applied.
- FIG. 7 illustrates an example in which only a specific type of binary tree based partitioning is allowed.
- FIG. 8 is a diagram for describing an example in which information related to a binary tree split permission count is encoded / decoded according to an embodiment to which the present invention is applied.
- FIG. 9 is a flowchart illustrating an inter prediction method according to an embodiment to which the present invention is applied.
- FIG. 10 is a diagram illustrating a process of deriving motion information of a current block when a merge mode is applied to the current block.
- FIG. 11 is a diagram illustrating an example of a spatial neighboring block.
- FIG. 12 is a diagram for describing an example of deriving a motion vector of a temporal merge candidate.
- FIG. 13 illustrates positions of candidate blocks that can be used as collocated blocks.
- FIG. 14 is a diagram illustrating a process of deriving motion information of a current block when an AMVP mode is applied to the current block.
- FIG. 15 illustrates an example of deriving a merge candidate from a second merge candidate block when the first merge candidate block is not available.
- FIG. 16 illustrates an example of deriving a merge candidate from a second merge candidate block located on the same line as the first merge candidate block.
- 17 to 20 are diagrams illustrating a search order of merge candidate blocks.
- 21 illustrates an example in which a merge candidate of a non-square block is derived based on the square block.
- FIG. 22 illustrates an example of deriving a merge candidate based on an upper node block.
- FIG. 23 is a diagram for describing an example in which availability of spatial neighboring blocks is determined based on a merge induction region.
- FIG. 24 is a diagram illustrating an example in which a merge candidate is derived based on a merge induction region.
- 25 is a diagram illustrating an embodiment of a multiple inter prediction method.
- FIG. 26 illustrates an example in which a multi-inter prediction method is performed when a merge candidate has bidirectional information.
- first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.
- the first component may be referred to as the second component, and similarly, the second component may also be referred to as the first component.
- FIG. 1 is a block diagram illustrating an image encoding apparatus according to an embodiment of the present invention.
- the image encoding apparatus 100 may include a picture splitter 110, a predictor 120 and 125, a transformer 130, a quantizer 135, a realigner 160, and an entropy encoder. 165, an inverse quantizer 140, an inverse transformer 145, a filter 150, and a memory 155.
- each of the components shown in FIG. 1 is independently illustrated to represent different characteristic functions in the image encoding apparatus, and does not mean that each of the components is made of separate hardware or one software component unit.
- each component is included in each component for convenience of description, and at least two of the components may be combined into one component, or one component may be divided into a plurality of components to perform a function.
- Integrated and separate embodiments of the components are also included within the scope of the present invention without departing from the spirit of the invention.
- the components may not be essential components for performing essential functions in the present invention, but may be optional components for improving performance.
- the present invention can be implemented including only the components essential for implementing the essentials of the present invention except for the components used for improving performance, and the structure including only the essential components except for the optional components used for improving performance. Also included in the scope of the present invention.
- the picture dividing unit 110 may divide the input picture into at least one processing unit.
- the processing unit may be a prediction unit (PU), a transform unit (TU), or a coding unit (CU).
- the picture dividing unit 110 divides one picture into a combination of a plurality of coding units, prediction units, and transformation units, and combines one coding unit, prediction unit, and transformation unit on a predetermined basis (eg, a cost function). You can select to encode the picture.
- one picture may be divided into a plurality of coding units.
- a recursive tree structure such as a quad tree structure may be used, and coding is divided into other coding units by using one image or a largest coding unit as a root.
- the unit may be split with as many child nodes as the number of split coding units. Coding units that are no longer split according to certain restrictions become leaf nodes. That is, when it is assumed that only square division is possible for one coding unit, one coding unit may be split into at most four other coding units.
- a coding unit may be used as a unit for encoding or may be used as a unit for decoding.
- the prediction unit may be split in the form of at least one square or rectangle having the same size in one coding unit, or the prediction unit of any one of the prediction units split in one coding unit is different from one another. It may be divided to have a different shape and / or size than the unit.
- the intra prediction may be performed without splitting into a plurality of prediction units NxN.
- the predictors 120 and 125 may include an inter predictor 120 that performs inter prediction and an intra predictor 125 that performs intra prediction. Whether to use inter prediction or intra prediction on the prediction unit may be determined, and specific information (eg, an intra prediction mode, a motion vector, a reference picture, etc.) according to each prediction method may be determined. In this case, the processing unit in which the prediction is performed may differ from the processing unit in which the prediction method and the details are determined. For example, the method of prediction and the prediction mode may be determined in the prediction unit, and the prediction may be performed in the transform unit. The residual value (residual block) between the generated prediction block and the original block may be input to the transformer 130.
- specific information eg, an intra prediction mode, a motion vector, a reference picture, etc.
- prediction mode information and motion vector information used for prediction may be encoded by the entropy encoder 165 together with the residual value and transmitted to the decoder.
- the original block may be encoded as it is and transmitted to the decoder without generating the prediction block through the prediction units 120 and 125.
- the inter prediction unit 120 may predict the prediction unit based on the information of at least one of the previous picture or the next picture of the current picture. In some cases, the inter prediction unit 120 may predict the prediction unit based on the information of the partial region in which the encoding is completed in the current picture. You can also predict units.
- the inter predictor 120 may include a reference picture interpolator, a motion predictor, and a motion compensator.
- the reference picture interpolator may receive reference picture information from the memory 155 and generate pixel information of an integer pixel or less in the reference picture.
- a DCT based 8-tap interpolation filter having different filter coefficients may be used to generate pixel information of integer pixels or less in units of 1/4 pixels.
- a DCT-based interpolation filter having different filter coefficients may be used to generate pixel information of an integer pixel or less in units of 1/8 pixels.
- the motion predictor may perform motion prediction based on the reference picture interpolated by the reference picture interpolator.
- various methods such as full search-based block matching algorithm (FBMA), three step search (TSS), and new three-step search algorithm (NTS) may be used.
- FBMA full search-based block matching algorithm
- TSS three step search
- NTS new three-step search algorithm
- the motion vector may have a motion vector value of 1/2 or 1/4 pixel units based on the interpolated pixels.
- the motion prediction unit may predict the current prediction unit by using a different motion prediction method.
- various methods such as a skip method, a merge method, an advanced motion vector prediction (AMVP) method, an intra block copy method, and the like may be used.
- AMVP advanced motion vector prediction
- the intra predictor 125 may generate a prediction unit based on reference pixel information around the current block, which is pixel information in the current picture. If the neighboring block of the current prediction unit is a block that has performed inter prediction, and the reference pixel is a pixel that has performed inter prediction, the reference pixel of the block that has performed intra prediction around the reference pixel included in the block where the inter prediction has been performed Can be used as a substitute for information. That is, when the reference pixel is not available, the unavailable reference pixel information may be replaced with at least one reference pixel among the available reference pixels.
- a prediction mode may have a directional prediction mode using reference pixel information according to a prediction direction, and a non-directional mode using no directional information when performing prediction.
- the mode for predicting the luminance information and the mode for predicting the color difference information may be different, and the intra prediction mode information or the predicted luminance signal information used for predicting the luminance information may be utilized to predict the color difference information.
- intra prediction When performing intra prediction, if the size of the prediction unit and the size of the transform unit are the same, the intra prediction on the prediction unit is performed based on the pixels on the left of the prediction unit, the pixels on the upper left, and the pixels on the top. Can be performed. However, when performing intra prediction, if the size of the prediction unit is different from that of the transform unit, intra prediction may be performed using a reference pixel based on the transform unit. In addition, intra prediction using NxN division may be used only for a minimum coding unit.
- the intra prediction method may generate a prediction block after applying an adaptive intra smoothing (AIS) filter to a reference pixel according to a prediction mode.
- AIS adaptive intra smoothing
- the type of AIS filter applied to the reference pixel may be different.
- the intra prediction mode of the current prediction unit may be predicted from the intra prediction mode of the prediction unit existing around the current prediction unit.
- the prediction mode of the current prediction unit is predicted by using the mode information predicted from the neighboring prediction unit, if the intra prediction mode of the current prediction unit and the neighboring prediction unit is the same, the current prediction unit and the neighboring prediction unit using the predetermined flag information If the prediction modes of the current prediction unit and the neighboring prediction unit are different, entropy encoding may be performed to encode the prediction mode information of the current block.
- a residual block may include a prediction unit performing prediction based on the prediction units generated by the prediction units 120 and 125 and residual information including residual information that is a difference from an original block of the prediction unit.
- the generated residual block may be input to the transformer 130.
- the transform unit 130 converts the residual block including residual information of the original block and the prediction unit generated by the prediction units 120 and 125 into a discrete cosine transform (DCT), a discrete sine transform (DST), and a KLT. You can convert using the same conversion method. Whether to apply DCT, DST, or KLT to transform the residual block may be determined based on intra prediction mode information of the prediction unit used to generate the residual block.
- DCT discrete cosine transform
- DST discrete sine transform
- KLT KLT
- the quantization unit 135 may quantize the values converted by the transformer 130 into the frequency domain.
- the quantization coefficient may change depending on the block or the importance of the image.
- the value calculated by the quantization unit 135 may be provided to the inverse quantization unit 140 and the reordering unit 160.
- the reordering unit 160 may reorder coefficient values with respect to the quantized residual value.
- the reordering unit 160 may change the two-dimensional block shape coefficients into a one-dimensional vector form through a coefficient scanning method. For example, the reordering unit 160 may scan from DC coefficients to coefficients in the high frequency region by using a Zig-Zag scan method and change them into one-dimensional vectors.
- a vertical scan that scans two-dimensional block shape coefficients in a column direction instead of a zig-zag scan may be used, and a horizontal scan that scans two-dimensional block shape coefficients in a row direction. That is, according to the size of the transform unit and the intra prediction mode, it is possible to determine which scan method among the zig-zag scan, the vertical scan, and the horizontal scan is used.
- the entropy encoder 165 may perform entropy encoding based on the values calculated by the reordering unit 160.
- Entropy coding may use various coding methods such as, for example, Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC).
- Exponential Golomb Context-Adaptive Variable Length Coding
- CABAC Context-Adaptive Binary Arithmetic Coding
- the entropy encoder 165 receives residual value coefficient information, block type information, prediction mode information, partition unit information, prediction unit information, transmission unit information, and motion of the coding unit from the reordering unit 160 and the prediction units 120 and 125.
- Various information such as vector information, reference frame information, interpolation information of a block, and filtering information can be encoded.
- the entropy encoder 165 may entropy encode a coefficient value of a coding unit input from the reordering unit 160.
- the inverse quantizer 140 and the inverse transformer 145 inverse quantize the quantized values in the quantizer 135 and inversely transform the transformed values in the transformer 130.
- the residual value generated by the inverse quantizer 140 and the inverse transformer 145 is reconstructed by combining the prediction units predicted by the motion estimator, the motion compensator, and the intra predictor included in the predictors 120 and 125. You can create a Reconstructed Block.
- the filter unit 150 may include at least one of a deblocking filter, an offset correction unit, and an adaptive loop filter (ALF).
- a deblocking filter may include at least one of a deblocking filter, an offset correction unit, and an adaptive loop filter (ALF).
- ALF adaptive loop filter
- the deblocking filter may remove block distortion caused by boundaries between blocks in the reconstructed picture.
- it may be determined whether to apply a deblocking filter to the current block based on the pixels included in several columns or rows included in the block.
- a strong filter or a weak filter may be applied according to the required deblocking filtering strength.
- horizontal filtering and vertical filtering may be performed in parallel when vertical filtering and horizontal filtering are performed.
- the offset correction unit may correct the offset with respect to the original image on a pixel-by-pixel basis for the deblocking image.
- the pixels included in the image are divided into a predetermined number of areas, and then, an area to be offset is determined, an offset is applied to the corresponding area, or offset considering the edge information of each pixel. You can use this method.
- Adaptive Loop Filtering may be performed based on a value obtained by comparing the filtered reconstructed image with the original image. After dividing the pixels included in the image into a predetermined group, one filter to be applied to the group may be determined and filtering may be performed for each group. For information related to whether to apply ALF, a luminance signal may be transmitted for each coding unit (CU), and the shape and filter coefficient of an ALF filter to be applied may vary according to each block. In addition, regardless of the characteristics of the block to be applied, the same type (fixed form) of the ALF filter may be applied.
- ALF Adaptive Loop Filtering
- the memory 155 may store the reconstructed block or picture calculated by the filter unit 150, and the stored reconstructed block or picture may be provided to the predictors 120 and 125 when performing inter prediction.
- FIG. 2 is a block diagram illustrating an image decoding apparatus according to an embodiment of the present invention.
- the image decoder 200 includes an entropy decoder 210, a reordering unit 215, an inverse quantizer 220, an inverse transformer 225, a predictor 230, 235, and a filter unit ( 240, a memory 245 may be included.
- the input bitstream may be decoded by a procedure opposite to that of the image encoder.
- the entropy decoder 210 may perform entropy decoding in a procedure opposite to that of the entropy encoding performed by the entropy encoder of the image encoder. For example, various methods such as Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC) may be applied to the method performed by the image encoder.
- various methods such as Exponential Golomb, Context-Adaptive Variable Length Coding (CAVLC), and Context-Adaptive Binary Arithmetic Coding (CABAC) may be applied to the method performed by the image encoder.
- the entropy decoder 210 may decode information related to intra prediction and inter prediction performed by the encoder.
- the reordering unit 215 may reorder the entropy decoded bitstream by the entropy decoding unit 210 based on a method of rearranging the bitstream. Coefficients expressed in the form of a one-dimensional vector may be reconstructed by reconstructing the coefficients in a two-dimensional block form.
- the reordering unit 215 may be realigned by receiving information related to coefficient scanning performed by the encoder and performing reverse scanning based on the scanning order performed by the corresponding encoder.
- the inverse quantization unit 220 may perform inverse quantization based on the quantization parameter provided by the encoder and the coefficient values of the rearranged block.
- the inverse transform unit 225 may perform an inverse transform, i.e., an inverse DCT, an inverse DST, and an inverse KLT, for a quantization result performed by the image encoder, that is, a DCT, DST, and KLT. Inverse transformation may be performed based on a transmission unit determined by the image encoder.
- the inverse transform unit 225 of the image decoder may selectively perform a transform scheme (eg, DCT, DST, KLT) according to a plurality of pieces of information such as a prediction method, a size of a current block, and a prediction direction.
- a transform scheme eg, DCT, DST, KLT
- the prediction units 230 and 235 may generate the prediction block based on the prediction block generation related information provided by the entropy decoder 210 and previously decoded blocks or picture information provided by the memory 245.
- Intra prediction is performed on a prediction unit based on a pixel, but when intra prediction is performed, when the size of the prediction unit and the size of the transformation unit are different, intra prediction may be performed using a reference pixel based on the transformation unit. Can be. In addition, intra prediction using NxN division may be used only for a minimum coding unit.
- the predictors 230 and 235 may include a prediction unit determiner, an inter predictor, and an intra predictor.
- the prediction unit determination unit receives various information such as prediction unit information input from the entropy decoder 210, prediction mode information of the intra prediction method, and motion prediction related information of the inter prediction method, and distinguishes the prediction unit from the current coding unit, and predicts It may be determined whether the unit performs inter prediction or intra prediction.
- the inter prediction unit 230 predicts the current prediction based on information included in at least one of a previous picture or a subsequent picture of the current picture including the current prediction unit by using information required for inter prediction of the current prediction unit provided by the image encoder. Inter prediction may be performed on a unit. Alternatively, inter prediction may be performed based on information of some regions pre-restored in the current picture including the current prediction unit.
- a motion prediction method of a prediction unit included in a coding unit based on a coding unit includes a skip mode, a merge mode, an AMVP mode, and an intra block copy mode. It can be determined whether or not.
- the intra predictor 235 may generate a prediction block based on pixel information in the current picture.
- intra prediction may be performed based on intra prediction mode information of the prediction unit provided by the image encoder.
- the intra predictor 235 may include an adaptive intra smoothing (AIS) filter, a reference pixel interpolator, and a DC filter.
- the AIS filter is a part of filtering the reference pixel of the current block and determines whether to apply the filter according to the prediction mode of the current prediction unit.
- AIS filtering may be performed on the reference pixel of the current block by using the prediction mode and the AIS filter information of the prediction unit provided by the image encoder. If the prediction mode of the current block is a mode that does not perform AIS filtering, the AIS filter may not be applied.
- the reference pixel interpolator may generate a reference pixel having an integer value or less by interpolating the reference pixel. If the prediction mode of the current prediction unit is a prediction mode for generating a prediction block without interpolating the reference pixel, the reference pixel may not be interpolated.
- the DC filter may generate the prediction block through filtering when the prediction mode of the current block is the DC mode.
- the reconstructed block or picture may be provided to the filter unit 240.
- the filter unit 240 may include a deblocking filter, an offset correction unit, and an ALF.
- Information about whether a deblocking filter is applied to a corresponding block or picture, and when the deblocking filter is applied to the corresponding block or picture, may be provided with information about whether a strong filter or a weak filter is applied.
- the deblocking filter related information provided by the image encoder may be provided and the deblocking filtering of the corresponding block may be performed in the image decoder.
- the offset correction unit may perform offset correction on the reconstructed image based on the type of offset correction and offset value information applied to the image during encoding.
- the ALF may be applied to a coding unit based on ALF application information, ALF coefficient information, and the like provided from the encoder. Such ALF information may be provided included in a specific parameter set.
- the memory 245 may store the reconstructed picture or block to use as a reference picture or reference block, and may provide the reconstructed picture to the output unit.
- a coding unit is used as a coding unit for convenience of description, but may also be a unit for performing decoding as well as encoding.
- the current block represents a block to be encoded / decoded, and according to the encoding / decoding step, a coding tree block (or a coding tree unit), an encoding block (or a coding unit), a transform block (or a transform unit), or a prediction block. (Or prediction unit) or the like.
- 'unit' may indicate a basic unit for performing a specific encoding / decoding process
- 'block' may indicate a sample array having a predetermined size.
- 'block' and 'unit' may be used interchangeably.
- the coding block (coding block) and the coding unit (coding unit) may be understood to have the same meaning.
- One picture may be divided into square or non-square basic blocks and encoded / decoded.
- the basic block may be referred to as a coding tree unit.
- a coding tree unit may be defined as a coding unit of the largest size allowed in a sequence or slice.
- Information indicating whether the coding tree unit is square or non-square or information related to the size of the coding tree unit may be signaled through a sequence parameter set, a picture parameter set or a slice header.
- the coding tree unit may be divided into smaller sized partitions.
- the partition generated by dividing the coding tree unit is called depth 1
- the partition generated by dividing the partition having depth 1 may be defined as depth 2. That is, a partition generated by dividing a partition that is a depth k in a coding tree unit may be defined as having a depth k + 1.
- a partition of any size generated as the coding tree unit is split may be defined as a coding unit.
- the coding unit may be split recursively or split into basic units for performing prediction, quantization, transform, or in-loop filtering.
- an arbitrary size partition generated as a coding unit is divided may be defined as a coding unit or a transform unit or a prediction unit that is a basic unit for performing prediction, quantization, transform, or in-loop filtering.
- a prediction block having the same size as that of the coding block or a size smaller than the coding block may be determined through prediction division of the coding block.
- Part_mode partition mode
- Information for determining a partition index indicating any one of the partition mode candidates may be signaled through the bitstream.
- the partition index of the coding block may be determined based on at least one of the size, shape, or coding mode of the coding block. The size or shape of the predictive block may be determined based on the partition mode specified by the partition index.
- Partition mode candidates may include asymmetric partition types (eg, nLx2N, nRx2N, 2NxnU, 2NxnD).
- the number or type of asymmetric partition mode candidates that a coding block may use may be determined based on at least one of the size, shape, or coding mode of the coding block.
- FIG. 3 is a diagram illustrating a partition mode candidate that can be applied to a coding block when the coding block is encoded by inter-screen prediction.
- any one of eight partition mode candidates illustrated in FIG. 3 may be applied to the coding block.
- a coding block when a coding block is encoded by intra prediction, only square partition division may be applied to the coding block. That is, when the coding block is encoded by intra prediction, partition mode PART_2Nx2N or PART_NxN may be applied to the coding block.
- PART_NxN may be applied when the coding block has a minimum size.
- the minimum size of the coding block may be predefined in the encoder and the decoder.
- information about the minimum size of the coding block may be signaled through the bitstream.
- the minimum size of the coding block may be signaled through the slice header. Accordingly, the minimum size of the coding block for each slice may be determined differently.
- the partition mode candidates available to the coding block may be determined differently according to at least one of the size or shape of the coding block.
- the number or type of partition mode candidates that a coding block may use may be differently determined according to at least one of the size or shape of the coding block.
- the type or number of asymmetric partition mode candidates that the coding block may use may be determined based on the size or shape of the coding block.
- the number or type of asymmetric partition mode candidates that a coding block may use may be determined differently according to at least one of the size or shape of the coding block. For example, when the coding block has a non-square shape whose width is greater than the height, at least one of PART_2NxN, PART_2NxnU, or PART_2NxnD may not be used as a partition mode candidate of the coding block.
- PART_Nx2N When the coding block has a non-square shape whose height is greater than the width, at least one of PART_Nx2N, PART_nLx2N, and PART_nRx2N may not be used as a partition mode candidate of the coding block.
- the size of the prediction block may have a size of 64x64 to 4x4.
- the prediction block may not have a 4x4 size in order to reduce the memory bandwidth.
- the partition mode it is also possible to recursively split the coding blocks. That is, based on the partition mode determined by the partition index, the coding block may be divided, and each partition generated as a result of the division of the coding block may be defined as the coding block.
- a coding unit may mean a coding tree unit or a coding unit included in a coding tree unit.
- 'partition' generated as the coding block is divided may mean 'coding block'.
- the division method described below may be applied to dividing a coding block into a plurality of prediction blocks or a plurality of transform blocks.
- the coding unit may be divided by at least one line.
- the angle of the line dividing the coding unit may be a value within a range of 0 degrees to 360 degrees.
- the angle of the horizontal line may be 0 degrees
- the angle of the vertical line may be 90 degrees
- the angle of the diagonal line in the upper right direction may be 45 degrees
- the angle of the upper left diagonal line may be 135 degrees.
- the plurality of lines may all have the same angle. Alternatively, at least one of the plurality of lines may have a different angle than the other lines. Alternatively, the coding tree unit or the plurality of lines dividing the coding unit may have a predefined angle difference (eg, 90 degrees).
- Information about a line dividing the coding unit may be determined by the partition mode. Alternatively, information about at least one of the number, direction, angle, or position of a line in a block may be encoded.
- a coding unit is divided into a plurality of coding units using at least one of a vertical line or a horizontal line.
- the number of vertical or horizontal lines partitioning the coding unit may be at least one.
- a coding unit may be divided into two partitions by using one vertical line or one horizontal line.
- the coding unit may be divided into three partitions by using two vertical lines or two horizontal lines.
- one vertical line and one horizontal line may be used to divide the coding unit into four partitions that are 1/2 smaller in width and height than the coding unit.
- the partitions may have a uniform size.
- either partition may have a different size than the remaining partitions, or each partition may have a different size.
- the coding unit may be divided into three partitions.
- the width ratio or height ratio of the three partitions may be n: 2n: n, 2n: n: n, or n: n: 2n.
- the division of the coding unit into four partitions will be referred to as quad-tree based partitioning.
- the division of the coding unit into two partitions will be referred to as binary tree based partitioning.
- the division of the coding unit into three partitions will be referred to as triple tree-based partitioning.
- one vertical line and / or one horizontal line will be shown to be used to divide the coding unit, but with more vertical lines and / or more horizontal lines than shown, It would also be within the scope of the present invention to divide the coding unit into more partitions than shown or fewer partitions than shown.
- FIG. 4 illustrates an example of hierarchically dividing a coding block based on a tree structure according to an embodiment to which the present invention is applied.
- the input video signal is decoded in predetermined block units, and the basic unit for decoding the input video signal in this way is called a coding block.
- the coding block may be a unit for performing intra / inter prediction, transformation, and quantization.
- a prediction mode eg, an intra prediction mode or an inter prediction mode
- the coding block can be a square or non-square block with any size in the range 8x8 to 64x64, and can be a square or non-square block with a size of 128x128, 256x256 or more.
- the coding block may be hierarchically divided based on at least one of a quad tree splitting method, a binary tree splitting method, or a triple tree splitting method.
- Quad tree-based partitioning may refer to a method in which a 2N ⁇ 2N coding block is divided into four N ⁇ N coding blocks.
- Binary tree based partitioning may refer to a method in which one coding block is divided into two coding blocks.
- Triple tree based splitting may refer to a method in which one coding block is divided into three coding blocks. Even if a binary tree or triple tree based splitting is performed, there may be a square coding block at a lower depth.
- Partitions created due to binary tree based partitioning may be symmetrical or asymmetrical.
- the coding block divided based on the binary tree may be a square block or a non-square block (eg, a rectangle).
- the partition form of a coding block based on binary tree partitioning is a large symmetric type such as 2NxN (horizontal non-square coding unit) or Nx2N (vertical non-square coding unit) or asymmetric such as nLx2N, nRx2N, 2NxnU or 2NxnD. It may include an (asymmetric) type. Only one of the symmetric type or the asymmetric type may be allowed in the split form of the coding block.
- the triple tree splitting form may include at least one of splitting a coding block into two vertical lines or splitting the coding block into two horizontal lines. Three non-square partitions can be created by triple tree partitioning.
- Triple tree splitting may include splitting a coding block into two horizontal lines or splitting a coding block into two vertical lines.
- the width ratio or height ratio of the partitions resulting from the splitting of the coding block may be n: 2n: n, 2n: n: n or n: n: 2n.
- the location of the partition having the largest width or height among the three partitions may be predefined in the encoder and the decoder. Alternatively, information indicating a partition having the largest width or height among the three partitions may be signaled through the bitstream.
- dividing the coding unit into square partitions may constitute quad-tree CU partitioning, and dividing the coding unit into symmetric non-square partitions may correspond to binary tree partitioning. have. Dividing the coding tree unit into square partitions and symmetric non-square partitions may correspond to quad and binary tree CU partitioning (QTBT).
- QTBT quad and binary tree CU partitioning
- Binary tree or triple tree based splitting may be performed on coding blocks in which quadtree based splitting is no longer performed.
- the coding block generated as a result of the binary tree or triple tree based splitting may be split into smaller coding blocks.
- at least one of quad tree division, triple tree division, or binary tree division may not be applied to the coding block.
- binary tree splitting in a predetermined direction or triple tree splitting in a predetermined direction may not be allowed in the coding block.
- quad-tree splitting and triple-tree splitting may not be allowed in a coding block generated as a result of binary tree or triple tree based splitting. Only binary tree splitting may be allowed in the coding block.
- only the coding block having the largest size among the three coding blocks generated as a result of the triple tree based splitting may be divided into smaller coding blocks.
- binary tree based splitting or triple tree based splitting may be allowed only to a coding block having the largest size among three coding blocks generated as a result of triple tree based splitting.
- the divided form of the lower depth partition may be determined depending on the divided form of the upper depth partition. For example, when the upper partition and the lower partition are partitioned based on the binary tree, only the partition based on the binary tree of the same type as the binary tree partition of the upper depth partition may be allowed in the lower depth partition. For example, when the binary tree splitting shape of the upper depth partition is 2NxN type, the binary tree splitting shape of the lower depth partition may also be set to 2NxN shape. Alternatively, when the binary tree partition type of the upper depth partition is Nx2N type, the partition shape of the lower depth partition may also be set to Nx2N type.
- a binary tree partition identical to the partition direction of the upper depth partition or a triple tree partition identical to the partition direction of the upper depth partition may not be allowed in the partition having the largest size among partitions generated as a result of the triple tree based partitioning. have.
- the split type of the lower depth partition may be determined in consideration of the split type of the upper depth partition and the split type of the neighboring lower depth partition. Specifically, when the upper depth partition is partitioned based on the binary tree, the partition type of the lower depth partition may be determined so that the same result as that of partitioning the upper depth partition based on the quad tree does not occur. For example, when the partition type of the upper depth partition is 2NxN and the partition type of the neighboring lower depth partition is Nx2N, the partition type of the current lower depth partition cannot be set to Nx2N. This is because, when the partition type of the current lower depth partition has Nx2N, the same result as that of the quadtree partition of the upper depth partition of the NxN type is caused.
- the partition type of the upper depth partition is Nx2N and the partition shape of the neighboring lower depth partition is 2NxN
- the partition type of the current lower depth partition cannot be set to 2NxN. That is, when the binary tree splitting shape of the upper depth partition and the binary tree splitting shape of the neighboring lower depth partition are different, the binary tree splitting shape of the current lower depth partition may be set to be the same as the binary tree splitting shape of the upper depth partition.
- the binary tree split type of the lower depth partition may be set differently from the binary tree split type of the upper depth partition.
- the allowed binary tree splitting forms may be determined.
- the binary tree splitting type allowed for a coding tree unit may be limited to 2NxN or Nx2N type.
- the allowed partition type may be predefined in the encoder or the decoder.
- information about an allowed split type or a not allowed split type may be encoded and signaled through a bitstream.
- FIG. 7 illustrates an example in which only a specific type of binary tree based partitioning is allowed.
- FIG. 7A illustrates an example in which only Nx2N type binary tree based partitioning is allowed
- FIG. 7B illustrates an example in which only 2NxN type binary tree based partitioning is allowed.
- information about quad tree splitting, information about binary tree splitting, or information about triple tree splitting may be used.
- the information about quad tree splitting may include at least one of information indicating whether quadtree-based splitting is performed or information on the size / depth of a coding block allowing quadtree-based splitting.
- the information about the binary tree splitting includes information indicating whether a binary tree-based split is performed, information indicating whether the binary tree-based split is vertical or horizontal, and a coding block allowing binary-tree based split. It may include at least one of information about the size / depth of the information or the size / depth of the coding block that is not allowed to be divided based on the binary tree.
- the information on triple tree splitting includes information indicating whether tripletree-based splitting is performed, information indicating whether tripletree-based splitting is vertical or horizontal, coding blocks that allow tripletree-based splitting. It may include at least one of information about the size / depth of the information or the size / depth of the coding block that is not allowed to split based on the triple tree.
- the information about the size of the coding block may indicate a minimum or maximum value of at least one of the width, the height, the product of the width and the height, or the width and the height ratio of the coding block.
- the binary tree-based Partitioning may not be allowed.
- Partitioning may not be allowed when the width or height of the coding block is less than or equal to the minimum size allowed for triple tree splitting, or when the split depth of the coding block is larger than the maximum depth allowed for triple tree splitting.
- Information about a binary tree or triple tree based splitting permission condition may be signaled through a bitstream.
- the information may be encoded in a sequence, picture or fragment image unit.
- the fragment image may mean at least one of a slice, a tile group, a tile, a brick, a coding block, a prediction block, or a transform block.
- a syntax 'max_mtt_depth_idx_minus1' indicating a maximum depth that allows binary tree / triple tree splitting may be encoded / decoded through the bitstream.
- max_mtt_depth_idx_minus1 + 1 may indicate the maximum depth allowed for binary tree / triple tree splitting.
- At least one of the number of times binary tree / triple tree splitting is allowed, the maximum depth allowed for binary tree / triple tree splitting, or the number of depths allowed for binary tree / triple tree splitting, may be signaled at the sequence or slice level.
- at least one of the number of binary tree / triple tree splits, the maximum depth allowed for binary tree / triple tree splits, or the number of depths allowed for binary tree / triple tree splits of the first and second slices may be different.
- binary tree / triple tree splitting is allowed only at one depth
- binary tree / triple tree splitting may be allowed at two depths.
- binary tree splitting is performed on a coding unit having a depth of 2 and a coding unit having a depth of 3. Accordingly, information indicating the number of times binary tree splitting has been performed in the coding tree unit (2 times), information indicating the maximum depth (depth 3) of the partition generated by the binary tree splitting in the coding tree unit, or the binary tree in the coding tree unit. At least one of information indicating the number of partition depths (2, depth 2, and depth 3) to which division is applied may be encoded / decoded through a bitstream.
- the number of times that binary tree / triple tree splitting is allowed, the depth of which binary tree / triple tree splitting is allowed or the number of depths that allow binary tree / triple tree splitting may be predefined in the encoder and the decoder. Or, based on at least one of the index or the size / type of the coding unit or the index of the sequence or slice, the number of times binary tree / triple tree splitting is allowed, or the depth or binary tree / triple tree splitting allowed The number of depths allowed can be determined. For example, in the first slice, binary tree / triple tree splitting may be allowed at one depth, and in the second slice, binary tree / triple tree splitting may be allowed at two depths.
- At least one of the number of times that binary tree splitting is allowed, the depth of allowing binary tree splitting, or the number of depths allowing binary tree splitting may be differently set according to a temporal identifier (TemporalID) of a slice or picture.
- TemporalID may be used to identify each of a plurality of layers of an image having at least one scalability among a view, a spatial, a temporal, or a quality. will be.
- the first coding block 300 having a split depth of k may be divided into a plurality of second coding blocks based on a quad tree.
- the second coding blocks 310 to 340 are square blocks having half the width and the height of the first coding block, and the split depth of the second coding block may be increased to k + 1.
- the second coding block 310 having the division depth k + 1 may be divided into a plurality of third coding blocks having the division depth k + 2. Partitioning of the second coding block 310 may be selectively performed using either a quart tree or a binary tree according to a partitioning scheme.
- the splitting scheme may be determined based on at least one of information indicating splitting based on the quad tree or information indicating splitting based on the binary tree.
- the second coding block 310 When the second coding block 310 is divided on the basis of the quart tree, the second coding block 310 is divided into four third coding blocks 310a having half the width and the height of the second coding block, The split depth can be increased to k + 2.
- the second coding block 310 when the second coding block 310 is divided on a binary tree basis, the second coding block 310 may be split into two third coding blocks. In this case, each of the two third coding blocks is a non-square block having one half of the width and the height of the second coding block, and the split depth may be increased to k + 2.
- the second coding block may be determined as a non-square block in the horizontal direction or the vertical direction according to the division direction, and the division direction may be determined based on information about whether the binary tree-based division is the vertical direction or the horizontal direction.
- the second coding block 310 may be determined as an end coding block that is no longer split based on the quad tree or the binary tree, and in this case, the corresponding coding block may be used as a prediction block or a transform block.
- the third coding block 310a may be determined as an end coding block like the division of the second coding block 310, or may be further divided based on a quad tree or a binary tree.
- the third coding block 310b split based on the binary tree may be further divided into a vertical coding block 310b-2 or a horizontal coding block 310b-3 based on the binary tree, and corresponding coding
- the partition depth of the block can be increased to k + 3.
- the third coding block 310b may be determined as an end coding block 310b-1 that is no longer split based on the binary tree, in which case the coding block 310b-1 may be used as a prediction block or a transform block. Can be.
- the above-described partitioning process allows information about the size / depth of a coding block that allows quad-tree based partitioning, information about the size / depth of the coding block that allows binary tree-based partitioning, or binary-tree based partitioning. It may be limitedly performed based on at least one of the information about the size / depth of the coding block that is not.
- a size candidate that a coding block may have is limited to a predetermined number, or the size of a coding block in a predetermined unit may have a fixed value.
- the size of the coding block in the sequence or the size of the coding block in the picture may be limited to have any one of 256x256, 128x128, or 32x32.
- Information representing the size of a coding block in a sequence or picture may be signaled through a sequence header or picture header.
- the coding unit may take a square or a rectangle of any size.
- the first coding block 300 having a split depth of k may be divided into a plurality of second coding blocks based on a quad tree.
- the second coding blocks 310 to 340 are square blocks having half the width and the height of the first coding block, and the split depth of the second coding block may be increased to k + 1.
- the second coding block 310 having the division depth k + 1 may be divided into a plurality of third coding blocks having the division depth k + 2. Partitioning of the second coding block 310 may be selectively performed using either a quart tree or a binary tree according to a partitioning scheme.
- the splitting scheme may be determined based on at least one of information indicating splitting based on the quad tree or information indicating splitting based on the binary tree.
- the second coding block 310 When the second coding block 310 is divided on the basis of the quart tree, the second coding block 310 is divided into four third coding blocks 310a having half the width and the height of the second coding block, The split depth can be increased to k + 2.
- the second coding block 310 when the second coding block 310 is divided on a binary tree basis, the second coding block 310 may be split into two third coding blocks. In this case, each of the two third coding blocks is a non-square block having one half of the width and the height of the second coding block, and the split depth may be increased to k + 2.
- the second coding block may be determined as a non-square block in the horizontal direction or the vertical direction according to the division direction, and the division direction may be determined based on information about whether the binary tree-based division is the vertical direction or the horizontal direction.
- the second coding block 310 may be determined as an end coding block that is no longer split based on the quad tree or the binary tree, and in this case, the corresponding coding block may be used as a prediction block or a transform block.
- the third coding block 310a may be determined as an end coding block like the division of the second coding block 310, or may be further divided based on a quad tree or a binary tree.
- the third coding block 310b split based on the binary tree may be further divided into a vertical coding block 310b-2 or a horizontal coding block 310b-3 based on the binary tree, and corresponding coding
- the partition depth of the block can be increased to k + 3.
- the third coding block 310b may be determined as an end coding block 310b-1 that is no longer split based on the binary tree, in which case the coding block 310b-1 may be used as a prediction block or a transform block. Can be.
- the above-described partitioning process allows information about the size / depth of a coding block that allows quad-tree based partitioning, information about the size / depth of the coding block that allows binary tree-based partitioning, or binary-tree based partitioning. It may be limitedly performed based on at least one of the information about the size / depth of the coding block that is not.
- a size candidate that a coding block may have is limited to a predetermined number, or the size of a coding block in a predetermined unit may have a fixed value.
- the size of the coding block in the sequence or the size of the coding block in the picture may be limited to have any one of 256x256, 128x128, or 32x32.
- Information representing the size of a coding block in a sequence or picture may be signaled through a sequence header or picture header.
- the coding unit may take a square or a rectangle of any size.
- Transform skip may be set not to be used for a coding unit generated as a result of binary tree based splitting or triple tree based splitting.
- the transform skip may be set to be applied to the non-square coding unit only in at least one of the vertical direction and the horizontal direction. For example, when the transform skip is applied in the horizontal direction, only scaling is performed without transform / inverse transform in the horizontal direction, and transform / inverse transform using DCT or DST is performed in the vertical direction. When transform skip is applied in the vertical direction, only scaling is performed without transform / inverse transform in the vertical direction, and transform / inverse transform using DCT or DST is performed in the horizontal direction.
- Information on whether to skip the inverse transform in the horizontal direction or information indicating whether to skip the inverse transform in the vertical direction may be signaled through the bitstream.
- information indicating whether to skip the inverse transform in the horizontal direction is a 1-bit flag, 'hor_transform_skip_flag'
- information indicating whether to skip the inverse transform in the vertical direction is a 1-bit flag and 'ver_transform_skip_flag'.
- the encoder may determine whether to encode 'hor_transform_skip_flag' or 'ver_transform_skip_flag' according to the size and / or shape of the current block. As an example, when the current block is Nx2N type, hor_transform_skip_flag may be encoded, and encoding of ver_transform_skip_flag may be omitted. If the current block has a 2N ⁇ N form, ver_transform_skip_flag may be encoded and hor_transform_skip_flag may be omitted.
- the transform skip may be applied to the horizontal direction and the transform / inverse transform may be performed on the vertical direction.
- transform skip may be applied in the vertical direction and transform / inverse transform may be performed in the horizontal direction.
- the transform / inverse transform may be performed based on at least one of DCT or DST.
- coding blocks that are no longer divided may be used as prediction blocks or transform blocks. That is, it can be used as a coding block, prediction block, or transform block generated as a result of quad tree partitioning or binary tree partitioning.
- a prediction image may be generated in units of coding blocks, and a residual signal that is a difference between the original image and the prediction image may be converted in units of coding blocks.
- motion information may be determined based on a coding block, or an intra prediction mode may be determined based on a coding block.
- the coding block may be encoded using at least one of a skip mode, an intra prediction or an inter prediction.
- the plurality of coding blocks generated by dividing the coding blocks may be configured to share at least one of motion information, merge candidates, reference samples, reference sample lines, or intra prediction modes.
- partitions generated by dividing the coding block may include at least one of motion information, merge candidate, reference sample, reference sample line, or intra prediction mode according to the size or shape of the coding block.
- Can share Alternatively, only some of the plurality of coding blocks may share the information, and the remaining coding blocks may be configured not to share the information.
- FIG. 9 is a flowchart illustrating an inter prediction method according to an embodiment to which the present invention is applied.
- the motion information of the current block may be determined (S910).
- the motion information of the current block may include at least one of a motion vector of the current block, a reference picture index of the current block, an inter prediction direction of the current block, or a weighted prediction weight index.
- the inter prediction direction of the current block indicates at least one of whether prediction is performed in the L0 direction or prediction in the L1 direction.
- the weighted prediction weights indicate weights applied to the L0 reference block and weights applied to the L1 reference block.
- the weighted prediction weight index indicates any one of the plurality of weighted prediction weight candidates.
- the motion vector of the current block may be determined based on the information signaled through the bitstream.
- the motion vector precision indicates the display unit of the motion vector of the current block.
- the motion vector precision of the current block may be determined by at least one of an integer fel, 1 ⁇ 2 fel, 1 ⁇ 4 fel or 1/8 fel.
- the motion vector precision may be determined in a picture unit, a slice unit, a tile group unit, a tile unit, or a block unit.
- a block may represent a coding tree unit, a coding unit, a prediction unit, or a transform unit.
- the motion information of the current block may be obtained based on at least one of information signaled through a bitstream or motion information of a neighboring block neighboring the current block.
- FIG. 10 is a diagram illustrating a process of deriving motion information of a current block when a merge mode is applied to the current block.
- the merge mode represents a method of deriving motion information of the current block from neighboring blocks.
- a spatial merge candidate may be derived from a spatial neighboring block of the current block (S1010).
- the spatial neighboring block may include at least one of a top boundary of the current block, a left boundary of the current block, or a block adjacent to a corner of the current block (eg, at least one of an upper left corner, an upper right corner, or a lower left corner).
- FIG. 11 is a diagram illustrating an example of a spatial neighboring block.
- the spatial neighboring block includes a neighboring block A 1 neighboring to the left of the current block, a neighboring block B 1 neighboring to the top of the current block, and a lower left corner of the current block. At least one of an adjacent neighboring block A 0 , a neighboring block B 0 adjacent to the upper right corner of the current block, and a neighboring block B 2 adjacent to the upper left corner of the current block may be included.
- Block A 1 may include a sample of position (-1, H-1).
- Block B 1 may include a sample of position (W-1, -1).
- Block A 0 may comprise a sample of the position (-1, H).
- Block B 0 may include a sample of position (W, -1).
- Block B 2 may include a sample of position (-1, -1).
- the spatial merge candidate may be derived from a block neighboring the upper left sample and the block neighboring the upper center sample of the current block.
- a block neighboring the upper left sample of the current block may include at least one of a block including a sample at a position (0, -1) or a block including a sample at a position (-1, 0).
- a spatial merge candidate may be derived from at least one of a block neighboring the top center sample of the current block or a block neighboring the left center sample.
- a block neighboring the upper middle sample of the current block may include a sample at a position (W / 2, -1).
- a block neighboring the left center sample of the current block may include a sample at the position (-1, H / 2).
- the location of the top neighboring block and / or the left neighboring block used to derive the spatial merge candidate may be determined. For example, when the size of the current block is greater than or equal to a threshold value, a spatial merge candidate may be derived from a block neighboring the top center sample of the current block and a block neighboring the left center sample. On the other hand, when the size of the current block is smaller than the threshold, the spatial merge candidate may be derived from a block neighboring the upper right sample and the lower left sample of the current block.
- the size of the current block may be expressed based on at least one of the width, the height, the sum of the width and the height, the product of the width and the height or the ratio of the width and the height of the current block.
- the threshold may be an integer of 2, 4, 8, 16, 32 or 128.
- the availability of the extended spatial neighboring block may be determined. For example, if the current block is a non-square block having a width greater than the height, a block neighboring the upper left sample of the current block, a block neighboring the left center sample, or a block neighboring the lower left sample of the current block is used. It may be determined that it is impossible. On the other hand, if the current block is a block whose height is greater than the width, it may be determined that the block neighboring the upper left sample of the current block, the block neighboring the upper center sample, or the block neighboring the upper right sample of the current block is unavailable. .
- the motion information of the spatial merge candidate may be set to be the same as the motion information of the spatial neighboring block.
- the spatial merge candidate can be determined by searching for neighboring blocks in a predetermined order. For example, in the example shown in FIG. 11, a search for spatial merge candidate determination may be performed in the order of A 1 , B 1 , B 0 , A 0, and B 2 .
- the B 2 block may be used when at least one of the remaining blocks (that is, A 1 , B 1 , B 0, and A 0 ) does not exist or at least one is encoded in the intra prediction mode.
- the search order of the spatial merge candidates may be predefined in the encoder / decoder. Alternatively, the search order of the spatial merge candidates may be adaptively determined according to the size or shape of the current block. Alternatively, the search order of the spatial merge candidates may be determined based on the information signaled through the bitstream.
- a temporal merge candidate may be derived from a temporal neighboring block of the current block (S1020).
- a temporal neighboring block may mean a co-located block included in a collocated picture.
- the collocated picture has a different temporal order (Picture Order Count, POC) than the current picture containing the current block.
- POC Picture Order Count
- the collocated picture may be determined as a picture having a predefined index in the reference picture list or a picture having a smallest difference in output order (POC) from the current picture.
- the collocated picture may be determined by the information signaled from the bitstream.
- the information signaled from the bitstream may include at least one of information indicating a reference picture list (eg, an L0 reference picture list or an L1 reference picture list) containing a collocated picture and / or an index pointing to a collocated picture in the reference picture list. It may include. Information for determining a collocated picture may be signaled in at least one of a picture parameter set, a slice header, or a block level.
- a reference picture list eg, an L0 reference picture list or an L1 reference picture list
- Information for determining a collocated picture may be signaled in at least one of a picture parameter set, a slice header, or a block level.
- the motion information of the temporal merge candidate may be determined based on the motion information of the collocated block.
- the motion vector of the temporal merge candidate may be determined based on the motion vector of the collocated block.
- the motion vector of the temporal merge candidate may be set to be the same as the motion vector of the collocated block.
- the motion vector of the temporal merge candidate is based on the output order (POC) difference between the current picture and the reference picture of the current block and / or the output order (POC) difference between the collocated picture and the reference picture of the collocated picture. By scaling the motion vector of the collocated block.
- FIG. 12 is a diagram for describing an example of deriving a motion vector of a temporal merge candidate.
- tb represents the POC difference between the current picture (curr_pic) and the reference picture (curr_ref) of the current picture
- td represents between the collocated picture (col_pic) and the reference picture of the collocated block ( col_ref).
- the motion vector of the temporal merge candidate may be derived by scaling the motion vector of the collocated block col_PU based on tb and / or td.
- both the motion vector of the collocated block and the scaled motion vector may be used as the motion vector of the temporal merge candidate.
- the motion vector of the collocated block may be set as the motion vector of the first temporal merge candidate
- the scaled value of the motion vector of the collocated block may be set as the motion vector of the second temporal merge candidate.
- the inter prediction direction of the temporal merge candidate may be set to be the same as the inter prediction direction of the temporal neighboring block.
- the reference picture index of the temporal merge candidate may have a fixed value.
- the reference picture index of the temporal merge candidate may be set to '0'.
- the reference picture index of the temporal merge candidate may be adaptively determined based on at least one of the reference picture index of the spatial merge candidate and the reference picture index of the current picture.
- the collocated block may be determined as any block in the block having the same position and size as the current block in the collocated picture or a block adjacent to a block having the same position and size as the current block.
- FIG. 13 illustrates positions of candidate blocks that can be used as collocated blocks.
- the candidate block may include at least one of a block adjacent to the upper left corner position of the current block in the collocated picture, a block adjacent to the center sample position of the current block, or a block adjacent to the lower left corner position of the current block.
- the candidate block may include a block TL including the upper left sample position of the current block in the collocated picture, a block BR including the lower right sample position of the current block, and a lower right corner of the current block.
- a block including a location of a neighboring block adjacent to a predetermined boundary of a current block in a collocated picture may be selected as a collocated block.
- the number of temporal merge candidates may be one or more. For example, based on one or more collocated blocks, one or more temporal merge candidates may be derived.
- the maximum number of temporal merge candidates may be encoded and signaled in the encoder.
- the maximum number of temporal merge candidates may be derived based on the maximum number of merge candidates and / or the maximum number of spatial merge candidates that may be included in the merge candidate list.
- the maximum number of temporal merge candidates may be determined based on the number of available collocated blocks.
- the availability of candidate blocks may be determined according to a predetermined priority, and at least one collocated block may be determined based on the determination and the maximum number of temporal merge candidates. For example, when the block C3 including the center sample position of the current block and the block H adjacent to the lower right corner of the current block are candidate blocks, one of the C3 block and the H block is a collocated block. You can decide. If an H block is available, the H block may be determined as a collocated block.
- the C3 block may be determined as the collocated block.
- the unavailable block is replaced with another available block.
- Another block that replaces the unavailable block is at least one of a block adjacent to the center sample position of the current block (e.g., C0 and / or C3) or a block adjacent to the upper left corner position of the current block (e.g., TL) within the collocated picture. It may include one.
- an unusable block can be replaced with another available block.
- a merge candidate list including a spatial merge candidate and a temporal merge candidate may be generated (S1030).
- the merge candidate having the same motion information as the previously added merge candidate may be deleted from the merge candidate list.
- Information about the maximum number of merge candidates may be signaled through the bitstream. For example, information representing the maximum number of merge candidates may be signaled through a sequence parameter or a picture parameter. For example, if the maximum number of merge candidates is six, six may be selected by adding the spatial merge candidates and the temporal merge candidates. For example, five of five spatial merge candidates may be selected, and one of two temporal merge candidates may be selected.
- the maximum number of merge candidates may be predefined in the encoder and the decoder.
- the maximum number of merge candidates may be two, three, four, five, or six.
- the maximum number of merge candidates may be determined based on at least one of whether to perform merge with MVD (MMVD), mixed prediction, or triangular partitioning.
- the merge candidates included in the second merge candidate list may be added to the merge candidate list.
- the second merge candidate list may include a merge candidate derived based on motion information of a block encoded / decoded by inter prediction before the current block. For example, when motion compensation is performed on a block having an encoding mode of inter prediction, a merge candidate derived based on the motion information of the block may be added to the second merge candidate list. When encoding / decoding of the current block is completed, motion information of the current block may be added to the second merge candidate list for inter prediction of the next block.
- the second merge candidate list may be initialized on a CTU, tile or slice basis.
- the maximum number of merge candidates that may be included in the second merge candidate list may be predefined in the encoder and the decoder. Alternatively, information representing the maximum number of merge candidates that may be included in the second merge candidate list may be signaled through the bitstream.
- the indexes of merge candidates included in the second merge candidate list may be determined based on the order added to the second merge candidate list. For example, an index allocated to the merge candidate added to the Nth second merge candidate list may have a smaller value than an index allocated to the merge candidate added to the N + 1th merge candidate list. For example, the index of the N + 1th merge candidate may be set to a value greater than 1 by the index of the Nth merge candidate. Alternatively, the index of the Nth merge candidate may be set as the index of the N + 1th merge candidate, and the value of the index of the Nth merge candidate may be subtracted by one.
- the index allocated to the merge candidate added to the Nth second merge candidate list may have a larger value than the index allocated to the merge candidate added to the N + 1th merge candidate list.
- the index of the Nth merge candidate may be set as the index of the N + 1th merge candidate, and the value of the index of the Nth merge candidate may be increased by one.
- Whether to add a merge candidate derived from the block to the second merge candidate list based on whether the motion information of the block on which motion compensation is performed is identical to the motion information of the merge candidate included in the second merge candidate list You can decide. For example, when a merge candidate equal to the motion information of the block is included in the second merge candidate list, the merge candidate derived based on the motion information of the block may not be added to the second merge candidate list. Or, if a merge candidate equal to the motion information of the block is included in the second merge candidate list, the merge candidate is deleted from the second merge candidate list, and the merge candidate derived based on the motion information of the block is removed. Can be added to the merge candidate list.
- the merge candidate with the lowest index or the merge candidate with the highest index is deleted from the second merge candidate list and based on the motion information of the block.
- the merge candidate derived as may be added to the second merge candidate list. That is, after deleting the oldest merge candidate among the merge candidates included in the second merge candidate list, the merge candidate derived based on the motion information of the block may be added to the second merge candidate list.
- a merge candidate having a combination of two or more merge candidates or a merge candidate having a (0,0) zero motion vector It may be included in the merge candidate list.
- an average merge candidate obtained by averaging motion vectors of two or more merge candidates may be added to the merge candidate list.
- the average merge candidate may be derived by averaging motion vectors of two or more merge candidates included in the merge candidate list.
- the average merge candidate may be obtained by averaging the motion vector of the first merge candidate and the motion vector of the second merge candidate.
- the L0 motion vector of the average merge candidate is derived by averaging the L0 motion vector of the first merge candidate and the L0 motion vector of the second merge candidate
- the L1 motion vector of the average merge candidate is the L1 motion vector of the first merge candidate.
- the motion vector of the bidirectional merge candidate may be set as the L0 motion vector or the L1 motion vector of the average merge candidate. have.
- the L0 motion vector of the average merge candidate is the L0 motion vector and the first merge candidate.
- the average merge candidate L1 motion vector may be derived as the first merge candidate L1 motion vector.
- the motion vector of the first merge candidate or the second merge candidate is considered in consideration of the distance (ie, the POC difference) between the current picture and the reference picture of each merge candidate.
- the average merge candidate may be derived by averaging the motion vector of the first merge candidate and the motion vector of the scaled second merge candidate.
- the priority is set based on the size of the reference picture index of each merge candidate, the distance between the current block and the reference picture of each merge candidate, or whether bidirectional prediction is applied, and the like. Scaling may be applied to the motion vector of the candidate.
- the reference picture index of the average merge candidate may be set to point to a reference picture at a specific position in the reference picture list.
- the reference picture index of the average merge candidate may indicate the first or last reference picture of the reference picture list.
- the reference picture index of the average merge candidate may be set to be the same as the reference picture index of the first merge candidate or the second merge candidate.
- the reference picture index of the average merge candidate may be set to be the same as the reference picture index of the first merge candidate and the second merge candidate.
- the reference picture index of the merge candidate with the higher priority may be set as the reference picture index of the average merge candidate.
- the reference picture index of the first merge candidate to which the bidirectional prediction is applied may be determined as the reference picture index of the average merge candidate.
- the combining order for generating the average merge candidate may be determined.
- the priority may be predefined in the encoder and the decoder.
- the combination order may be determined based on whether the merge candidate is bidirectionally predicted or not. For example, the combination of merge candidates encoded by bidirectional prediction may be set to have a higher priority than the combination of merge candidates encoded by unidirectional prediction.
- the combining order may be determined based on the reference picture of the merge candidate. For example, the combination of merge candidates having the same reference picture may have a higher priority than the combination of merge candidates having different reference pictures.
- the merge candidate may be included in the merge candidate list according to a predefined priority. The higher the priority, the smaller the index assigned to the merge candidate.
- the spatial merge candidate may be added to the merge candidate list before the temporal merge candidate.
- the spatial merge candidates are adjacent to the spatial merge candidate of the left neighboring block, the spatial merge candidate of the upper neighboring block, the spatial merge candidate of the block adjacent to the upper right corner, the spatial merge candidate of the block adjacent to the lower left corner, and the upper left corner.
- the blocks may be added to the merge candidate list in the spatial merge candidate order.
- the spatial merge candidate derived from the neighboring block (B2 of FIG. 11) adjacent to the upper left corner of the current block may be set to be added to the merge candidate list in a lower order than the temporal merge candidate.
- priority may be determined between merge candidates according to the size or shape of the current block. For example, when the current block has a rectangular shape having a width greater than the height, the spatial merge candidate of the left neighboring block may be added to the merge candidate list before the spatial merge candidate of the upper neighboring block. On the other hand, if the current block has a rectangular shape whose height is greater than the width, the spatial merge candidate of the upper neighboring block may be added to the merge candidate list before the spatial merge candidate of the left neighboring block.
- the priority of the merge candidates may be determined according to the motion information of each of the merge candidates. For example, a merge candidate having bidirectional motion information may have a higher priority than a merge candidate having unidirectional motion information. Accordingly, a merge candidate having bidirectional motion information may be added to the merge candidate list before the merge candidate having unidirectional motion information.
- the merge candidates may be rearranged.
- Rearrangement may be performed based on the motion information of the merge candidates. For example, the rearrangement may be performed based on at least one of whether the merge candidate has bidirectional motion information, the size of the motion vector, the motion vector precision, or the temporal order (POC) between the current picture and the reference picture of the merge candidate. .
- the rearrangement may be performed to have a higher priority than the merge candidate having the unidirectional merge candidate than after the merge having the bidirectional motion information.
- rearrangement may be performed such that a merge candidate having a small motion vector precision has a higher priority than a merge candidate having a motion vector precision of an integer.
- At least one of the merge candidates included in the merge candidate list may be specified based on the merge candidate index (S1040).
- a merge candidate index for specifying at least one of the merge candidates included in the merge candidate list may be signaled through the bitstream.
- the motion information of the current block may be set to be the same as the motion information of the merge candidate specified by the merge candidate index (S1050).
- the motion information of the current block may be set to be the same as the motion information of the spatial neighboring block.
- the motion information of the current block may be set to be the same as the motion information of the temporal neighboring block.
- the partition may be a coding unit, a prediction unit or a transform unit.
- the plurality of partitions may be created by applying quad tree partitioning, binary tree partitioning, triple tree partitioning, or triangular partitioning to the coding unit.
- the partition can be square, non-square or triangular.
- the merge candidate derivation order between the plurality of partitions may be based on a priority or a predetermined order between the partitions.
- the priority or predetermined order may be determined based on at least one of an encoding / decoding order of partitions, a block scan order, a last scan order, a size, a shape, a partition index, or a location.
- a merge candidate derivation order may be determined based on an encoding / decoding order. For example, a partition having an earlier encoding / decoding order may derive a merge candidate before a partition having a later encoding / decoding order.
- a partition having a derivation order of merge candidates is referred to as a first partition
- a partition having a derivation order of merge candidates is referred to as a second partition.
- the merge candidate of the second partition may be determined in consideration of the motion information, the merge candidate, or the merge index of the first partition.
- a merge candidate ie, a merge candidate indicated by the merge index of the first partition
- the merge candidate used to derive the motion information of the first partition may be determined to be unavailable as the merge candidate of the second partition.
- a merge candidate having the same motion information as the motion information of the first partition among the merge candidates of the second coding unit may be set not to be used as the merge candidate of the second coding unit.
- a merge candidate having the same motion information as the motion information of the first partition may be determined to be unavailable as the merge candidate of the second partition.
- FIG. 14 is a diagram illustrating a process of deriving motion information of a current block when an AMVP mode is applied to the current block.
- At least one of the inter prediction direction or the reference picture index of the current block may be decoded from the bitstream (S1410). That is, when the AMVP mode is applied, at least one of the inter prediction direction or the reference picture index of the current block may be determined based on information encoded through the bitstream.
- a spatial motion vector candidate may be determined based on the motion vector of the spatial neighboring block of the current block (S1420).
- the spatial motion vector candidate may include at least one of a first spatial motion vector candidate derived from an upper neighboring block of the current block or a second spatial motion vector candidate derived from a left neighboring block of the current block.
- the upper neighboring block includes at least one of the blocks adjacent to the upper or upper right corner of the current block
- the left neighboring block of the current block includes at least one of the blocks adjacent to the left or lower left corner of the current block.
- the block adjacent to the upper left corner of the current block may be treated as the upper neighboring block, or may be treated as the left neighboring block.
- a spatial motion vector candidate may be derived from a spatial non-neighbor block not neighboring the current block. For example, a block located on the same vertical line as the block adjacent to the upper, upper right corner, or upper left corner of the current block, a block located on the same horizontal line as the block adjacent to the left, lower left corner, or upper left corner of the current block. Alternatively, at least one of blocks located on the same diagonal line as a block adjacent to a corner of the current block may be used to derive a spatial motion vector candidate of the current block. When the spatial neighboring block is unavailable, the spatial non-neighborhood block can be used to derive the spatial motion vector candidate.
- two or more spatial motion vector candidates may be derived using spatial neighboring blocks and spatial non-neighborhood blocks.
- a first spatial motion vector candidate and a second spatial motion vector candidate are derived based on neighboring blocks adjacent to the current block, while neighboring the neighboring blocks that are not neighboring the current block but are neighboring the neighboring blocks.
- a third spatial motion vector candidate and / or a fourth spatial motion vector candidate are derived using spatial neighboring blocks and spatial non-neighborhood blocks.
- the spatial motion vector may be obtained by scaling the motion vector of the spatial neighboring block.
- a temporal motion vector candidate may be determined based on the motion vector of the temporal neighboring block of the current block (S1430). If the reference picture is different between the current block and the temporal neighboring block, the temporal motion vector may be obtained by scaling the motion vector of the temporal neighboring block. In this case, temporal motion vector candidates may be derived only when the number of spatial motion vector candidates is less than or equal to a predetermined number.
- a motion vector candidate list including a spatial motion vector candidate and a temporal motion vector candidate may be generated (S1440).
- At least one of the motion vector candidates included in the motion vector candidate list may be specified based on information for specifying at least one of the motion vector candidate lists (S1450).
- the motion vector candidate specified by the information may be set as a motion vector prediction value of the current block, and the motion vector difference value is added to the motion vector prediction value to obtain a motion vector of the current block (S1460).
- the motion vector difference value may be parsed through the bitstream.
- motion compensation for the current block may be performed based on the obtained motion information (S920).
- motion compensation for the current block may be performed based on the inter prediction direction, the reference picture index, and the motion vector of the current block.
- the inter prediction direction indicates whether to predict the L0 direction, whether to predict the L1 direction, or whether to predict the bidirectional direction.
- the prediction block of the current block may be obtained based on a weighted sum operation or an average operation of the L0 reference block and the L1 reference block.
- the current block may be restored based on the generated prediction sample.
- a reconstructed sample may be obtained by adding the prediction sample and the residual sample of the current block.
- the merge candidate of the current block may be derived based on the motion information of the block encoded / decoded by inter prediction before the current block.
- a merge candidate of the current block may be derived based on the motion information of the neighboring block of the predefined location adjacent to the current block.
- the neighboring block includes a block adjacent to the left side of the current block, a block adjacent to the top of the current block, a block adjacent to the upper left corner of the current block, a block adjacent to the upper right corner of the current block, or a lower left corner of the current block. It may include at least one of the blocks adjacent to the corner.
- the merge candidate of the current block may be derived based on the motion information of blocks other than the neighboring block.
- a neighboring block at a predetermined position adjacent to the current block is called a first merge candidate block
- a block having a position different from the first merge candidate block is called a second merge candidate block.
- the second merge candidate block may include at least one of a block encoded / decoded by inter prediction before the current block, a block adjacent to the first merge candidate block, or a block located on the same line as the first merge candidate block.
- FIG. 15 illustrates a second merge candidate block adjacent to the first merge candidate block
- FIG. 16 illustrates a second merge candidate block located on the same line as the first merge candidate block.
- the merge candidate derived based on the motion information of the second merge candidate block may be added to the merge candidate list. Or, even if at least one of the spatial merge candidate or the temporal merge candidate is added to the merge candidate list, when the number of merge candidates included in the merge candidate list is smaller than the maximum merge candidate number, based on the motion information of the second merge candidate block The derived merge candidate may be added to the merge candidate list.
- FIG. 15 illustrates an example of deriving a merge candidate from a second merge candidate block when the first merge candidate block is not available.
- the merge candidate of the current block can be derived by replacing the unused first merge candidate block with the second merge candidate block.
- a block placed in a predetermined direction from the first merge candidate block among blocks adjacent to the first merge candidate block may be set as the second merge candidate block.
- the predefined direction may indicate a left direction, a right direction, an upper direction, a lower direction, or a diagonal direction.
- a predefined direction may be set for each first merge candidate block.
- the predefined direction of the first merge candidate block adjacent to the left side of the current block may be a left direction.
- the predefined direction of the first merge candidate block adjacent to the top of the current block may be a top direction.
- the predefined direction of the first merge candidate block adjacent to the corner of the current block may include at least one of a left direction, an upper direction, or a diagonal direction.
- a merge candidate of the current block may be derived based on B0 adjacent to A0. If A1 adjacent to the top of the current block is not available, a merge candidate of the current block may be derived based on B1 adjacent to A1. If A2 adjacent to the upper right corner of the current block is not available, a merge candidate of the current block may be derived based on B2 adjacent to A2. If A3 adjacent to the lower left corner of the current block is not available, a merge candidate of the current block may be derived based on B3 adjacent to A3. If A4 adjacent to the upper left corner of the current block is not available, a merge candidate of the current block may be derived based on at least one of B4 to B6 adjacent to A4.
- the illustrated example of FIG. 15 is only for describing an embodiment of the present invention, but does not limit the present invention.
- the position of the second merge candidate block may be set differently from the illustrated example of FIG. 15.
- the second merge candidate block adjacent to the first merge candidate block adjacent to the left side of the current block may be located in an upper direction or a lower direction of the first merge candidate block.
- the second merge candidate block adjacent to the first merge candidate block adjacent to the top of the current block may be located in the left direction or the right direction of the first merge candidate block.
- FIG. 16 illustrates an example of deriving a merge candidate from a second merge candidate block located on the same line as the first merge candidate block.
- the block located on the same line as the first merge candidate block may be a block located on the same horizontal line as the first merge candidate block, a block located on the same vertical line as the first merge candidate block, or the same as the first merge candidate block. It may include at least one of the blocks located on the diagonal.
- the y coordinate positions of blocks located on the same horizontal line are the same.
- the x coordinate positions of blocks located on the same vertical line are the same.
- the difference value of the x-coordinate position of blocks located on the same diagonal line is the same as the difference value of the y-coordinate position.
- the second merge candidate blocks positioned on the same vertical line as the first merge candidate block based on the rightmost block (eg, block A1 including (W-1, -1) coordinates) of the upper side of the coding block For example, the positions of B4, C6) are shown to be determined.
- the positions of the second merge candidate blocks may include a leftmost block (eg, a block including (0, -1) coordinates) at the top of the coding block or a block located at the top center of the coding block (eg, (W / 2, -1) a block including coordinates). Further, the positions of the second merge candidate blocks may be located at the topmost block (eg, a block including (-1, 0) coordinates) on the left side of the coding block or at a block (eg, (-1, H / 2) at the center left of the coding block ) Block containing the coordinates).
- a leftmost block eg, a block including (0, -1) coordinates
- the positions of the second merge candidate blocks may be located at the topmost block (eg, a block including (-1, 0) coordinates) on the left side of the coding block or at a block (eg, (-1, H / 2) at the center left of the coding block ) Block containing the coordinates).
- the second merge candidate block may be determined using all or part of the plurality of top neighboring blocks. For example, by using a block of a specific position among the plurality of upper neighboring blocks (eg, at least one of the upper neighboring block located at the leftmost side, the upper neighboring block located at the rightmost side, or the upper neighboring block located at the center) Two merge candidate blocks may be determined.
- the number of upper neighboring blocks used to determine the second merge candidate block among the plurality of upper neighboring blocks may be one, two, three, or more.
- the second merge candidate block may be determined using all or part of the plurality of left neighboring blocks.
- a second block may be formed by using a block (eg, at least one of a left neighboring block located at the bottommost side, a left neighboring block located at the topmost position, or a centered left neighboring block) among a plurality of left neighboring blocks.
- the merge candidate block may be determined.
- the number of left neighboring blocks used to determine the second merge candidate block among the plurality of left neighboring blocks may be one, two, three, or more.
- the position and / or number of the top neighboring block and / or the left neighboring block used to determine the second merge candidate block may be determined differently. For example, when the size of the current block is greater than or equal to a threshold value, the second merge candidate block may be determined based on the upper middle block and / or the left central block. On the other hand, when the size of the current block is smaller than the threshold value, the second merge candidate block may be determined based on the upper rightmost block and / or the left lowermost block.
- the threshold may be an integer of 8, 16, 32, 64 or 128.
- a first merge candidate list and a second merge candidate list may be configured, and motion compensation of the current block may be performed based on at least one of the first merge candidate list or the second merge candidate list.
- the first merge candidate list may include at least one of a spatial merge candidate derived based on motion information of a neighboring block at a predetermined position adjacent to the current block, or a temporal merge candidate derived based on motion information of a collocated block. It may include.
- the second merge candidate list may include a merge candidate derived based on the motion information of the second merge candidate block.
- the first merge candidate list includes a merge candidate derived from the first merge candidate block
- the second merge candidate list includes a merge candidate derived from the second merge candidate block.
- merge candidates derived from A0 to A4 blocks may be added to the first merge candidate list
- merge candidates derived from B0 to B6 blocks may be added to the second merge candidate list.
- merge candidates derived from A0 to A4 blocks are added to the first merge candidate list
- merge candidates derived from B0 to B5, C0 to C7 blocks are added to the second merge candidate list.
- the second merge candidate list may include a merge candidate derived based on motion information of a block encoded / decoded by inter prediction before the current block. For example, when motion compensation is performed on a block having an encoding mode of inter prediction, a merge candidate derived based on the motion information of the block may be added to the second merge candidate list. When encoding / decoding of the current block is completed, motion information of the current block may be added to the second merge candidate list for inter prediction of the next block.
- the indexes of merge candidates included in the second merge candidate list may be determined based on the order added to the second merge candidate list. For example, an index allocated to the merge candidate added to the Nth second merge candidate list may have a smaller value than an index allocated to the merge candidate added to the N + 1th merge candidate list. For example, the index of the N + 1th merge candidate may be set to a value greater than 1 by the index of the Nth merge candidate. Alternatively, the index of the Nth merge candidate may be set as the index of the N + 1th merge candidate, and the value of the index of the Nth merge candidate may be subtracted by one.
- the index allocated to the merge candidate added to the Nth second merge candidate list may have a larger value than the index allocated to the merge candidate added to the N + 1th merge candidate list.
- the index of the Nth merge candidate may be set as the index of the N + 1th merge candidate, and the value of the index of the Nth merge candidate may be increased by one.
- Whether to add a merge candidate derived from the block to the second merge candidate list based on whether the motion information of the block on which motion compensation is performed is identical to the motion information of the merge candidate included in the second merge candidate list You can decide. For example, when a merge candidate equal to the motion information of the block is included in the second merge candidate list, the merge candidate derived based on the motion information of the block may not be added to the second merge candidate list. Or, if a merge candidate equal to the motion information of the block is included in the second merge candidate list, the merge candidate is deleted from the second merge candidate list, and the merge candidate derived based on the motion information of the block is removed. Can be added to the merge candidate list.
- the merge candidate with the lowest index or the merge candidate with the highest index is deleted from the second merge candidate list and based on the motion information of the block.
- the merge candidate derived as may be added to the second merge candidate list. That is, after deleting the oldest merge candidate among the merge candidates included in the second merge candidate list, the merge candidate derived based on the motion information of the block may be added to the second merge candidate list.
- the second merge candidate list may be initialized on a CTU, tile or slice basis. That is, a block included in a different CTU, a different tile, or a different slice than the current block may be set as unavailable as the second merge candidate block.
- the maximum number of merge candidates that may be included in the second merge candidate list may be predefined in the encoder and the decoder. Alternatively, information representing the maximum number of merge candidates that may be included in the second merge candidate list may be signaled through the bitstream.
- One of the first merge candidate list and the second merge candidate list may be selected, and inter prediction of the current block may be performed using the selected merge candidate list. Specifically, based on the index information, any one of the merge candidates included in the merge candidate list may be specified, and motion information of the current block may be obtained from the selected merge candidate.
- Information specifying one of the first merge candidate list and the second merge candidate list may be signaled through the bitstream.
- the decoder may select one of the first merge candidate list and the second merge candidate list based on the information.
- a merge candidate list having a larger number of available merge candidates among the first merge candidate list and the second merge candidate list may be selected.
- one of the first merge candidate list and the second merge candidate list may be selected based on at least one of the size, shape, or split depth of the current block.
- a merge candidate list configured by adding (or appending) one of the first merge candidate list and the second merge candidate list to another one may be used.
- inter prediction may be performed based on a merge candidate list including at least one merge candidate included in the first merge candidate list and at least one merge candidate included in the second merge candidate list.
- a merge candidate included in the second merge candidate list may be added to the first merge candidate list.
- a merge candidate included in the first merge candidate list may be added to the second merge candidate.
- the merge candidates included in the second merge candidate list may be added to the first merge candidate list. have.
- a merge candidate derived from a block adjacent to the first merge candidate block among merge candidates included in the second merge candidate list may be added to the first merge candidate list.
- a merge candidate derived based on motion information of B0 among merge candidates included in the second merge candidate list may be added to the first merge candidate list.
- the merge candidate derived based on the motion information of B1 among the merge candidates included in the second merge candidate list may be added to the first merge candidate list.
- the merge candidate derived based on the motion information of B2 among the merge candidates included in the second merge candidate list may be added to the first merge candidate list.
- the merge candidate derived based on the motion information of B3 among the merge candidates included in the second merge candidate list may be added to the first merge candidate list. If A4 is not available, the merge candidate derived based on the motion information of B4, B5, or B6 among the merge candidates included in the second merge candidate list may be added to the first merge candidate list.
- the merge candidate to be added to the first merge candidate list may be determined according to the priority of the merge candidates included in the second merge candidate list.
- the priority may be determined based on an index value assigned to each merge candidate. For example, when the number of merge candidates included in the first merge candidate list is smaller than the maximum number or when the first merge candidate block is unavailable, the merge candidates having the smallest index value among the merge candidates included in the second merge candidate list Alternatively, a merge candidate having the largest index value may be added to the first merge candidate list.
- the merge candidate having the highest priority is the first merge candidate list. May not be added to And a merge candidate having a priority of the next priority (for example, an index value larger than 1 assigned to the index value assigned to the merge candidate having the highest priority) or 1 than the index value assigned to the merge candidate having the highest priority. It may be determined whether a merge candidate to which this small index value is assigned may be added to the first merge candidate list.
- a merge candidate list including both a merge candidate derived based on the motion information of the first merge candidate block and a merge candidate derived based on the motion information of the second merge candidate block may be generated.
- the merge candidate list may be a combination of a first merge candidate list and a second merge candidate list.
- the merge candidate list may be generated by searching for the first merge candidate block and the second merge candidate block according to a predetermined search order.
- 17 to 20 are diagrams illustrating a search order of merge candidate blocks.
- B5 and B6 may be searched only when the B4 block is not available or when the number of merge candidates included in the merge candidate list is equal to or less than a preset number.
- a search order different from the example of FIGS. 17-20 may be set.
- a combined merge candidate list including at least one merge candidate included in the first merge candidate list and at least one merge candidate included in the second merge candidate list may be generated.
- the combined merge candidate list may include N of the merge candidates included in the first merge candidate list and M of the merge candidates included in the second merge candidate list.
- N and M may represent the same number or different numbers.
- at least one of N or M may be determined based on at least one of the number of merge candidates included in the first merge candidate list or the number of merge candidates included in the second merge candidate list.
- information for determining at least one of N or M may be signaled through the bitstream. Either N or M may be derived by subtracting the other one from the maximum merge candidate number of the combined merge candidate list.
- Merge candidates added to the combined merge candidate list may be determined according to a predefined priority.
- the predefined priority may be determined based on an index assigned to merge candidates.
- a merge candidate to be added to the combined merge candidate list may be determined based on the association between the merge candidates. For example, if A0 included in the first merge candidate list is added to the combined merge candidate list, the merge candidate (eg, B0) adjacent to A0 may not be added to the combined merge list.
- the number of merge candidates included in the first merge candidate list is less than N
- more than M merge candidates among the merge candidates included in the second merge candidate list may be added to the combined merge candidate list. For example, when N is 4 and M is 2, four of the merge candidates included in the first merge candidate list are added to the combined merge candidate list, and two of the merge candidates included in the second merge candidate list are added. Can be added to the combined merge candidate list. If the number of merge candidates included in the first merge candidate list is smaller than four, two or more merge candidates among merge candidates included in the second merge candidate list may be added to the combined merge candidate list. If the number of merge candidates included in the second merge candidate list is less than 2, four or more of the merge candidates included in the first merge candidate list may be added to the combined merge candidate list.
- N or M may be adjusted according to the number of merge candidates included in each merge candidate list.
- the total number of merge candidates included in the combined merge candidate list may be fixed.
- the combined merge candidate, the average merge candidate, or the zero motion vector candidate may be added.
- the rectangular block may be divided into a plurality of triangular blocks. Merge candidates of the triangular blocks may be derived based on the rectangular block block including the triangular blocks. Triangular blocks may share the same merge candidate.
- the merge index may be signaled for each of the triangular blocks.
- the triangular blocks may be set not to use the same merge candidate.
- the merge candidate used in the first triangular block may not be used as the merge candidate of the second triangular block.
- the merge index of the second triangular block may specify any one of the remaining merge candidates except the merge candidate selected from the first triangular block.
- Merge candidates may be derived based on blocks of a predetermined shape or a predetermined size or more.
- the merge candidate of the current block may be derived based on a predetermined shape including the current block or a block of a predetermined size or more.
- the predetermined form may be square or non-square.
- the merge candidate for the non-square type coding unit may be derived based on the square type coding unit including the non-square type coding unit.
- 21 illustrates an example in which a merge candidate of a non-square block is derived based on the square block.
- the merge candidate of the non-square block may be derived based on the square block including the non-square block.
- a merge candidate of the non-square coding block 0 and the non-square coding block 1 may be derived based on a square block including coding block 0 and coding block 1. That is, the position of the spatial neighboring block may be determined based on the position, width / height, or size of the square block.
- the merge candidates of the coding block 0 and the coding block 1 may be derived based on at least one of the spatial neighboring blocks A0, A1, A2, A3, or A4 adjacent to the square block.
- a temporal merge candidate may also be determined based on a square block. That is, the temporal neighboring block may be determined based on the position, width / height, or size of the square block.
- the merge candidates of the coding block 0 and the coding block 1 may be derived based on a temporal neighboring block determined based on a square block.
- one of the spatial merge candidate and the temporal merge candidate may be derived based on the square block, and the other merge candidate may be derived based on the non- forward block.
- the spatial merge candidate of coding block 0 may be derived based on the forward block
- the temporal merge candidate of coding block 0 may be derived based on coding block 0.
- a plurality of blocks included in a block having a predetermined shape or a predetermined size or more may share a merge candidate.
- a merge candidate For example, in the example illustrated in FIG. 21, at least one of the spatial merge candidate or the temporal merge candidate of the coding block 0 and the coding block 1 may be the same.
- the predetermined form may be non-square, such as 2NxN or Nx2N. If the predetermined shape is non-square, the merge candidate of the current block may be derived based on the non-square block including the current block. For example, when the current block has a 2Nxn type (where n is 1 / 2N), a merge candidate of the current block may be derived based on a 2NxN type non-square block. Alternatively, when the current block is of type nx2N, the merge candidate for the current block may be derived based on an Nx2N type of non-square block.
- Information indicative of a predetermined shape or a predetermined size may be signaled through the bitstream. For example, information indicating either non-square or square may be signaled through the bitstream.
- the predetermined form or the predetermined size may be determined according to a rule defined in the encoder and the decoder.
- the merge candidate of the child node may be derived based on the parent node satisfying the predetermined condition.
- the predetermined condition may include at least one of whether the block is a result of quadtree splitting, the size of the block, the shape of the block, whether the picture is out of the boundary of the picture, or whether the depth difference between the child node and the parent node is greater than or equal to a predetermined value. It may include one.
- the predetermined condition may include whether the block is a block generated as a result of quadtree splitting and whether the block is a square coding block of a predetermined size or more. If the current block is generated by binary tree splitting or triple tree splitting, a merge candidate of the current block may be derived based on an upper node block including the current block and satisfying the predetermined condition. If there is no upper node block that satisfies the predetermined condition, a higher node block including a current block, a block of a predetermined size or more including the current block, or a depth difference from the current block and the current block is 1; As a criterion, a merge candidate of the current block can be derived.
- FIG. 22 illustrates an example of deriving a merge candidate based on an upper node block.
- Blocks 0 and 1 are generated by dividing square blocks based on a binary tree.
- the merge candidates of block 0 and block 1 may be derived based on a neighboring block (ie, at least one of A 0, A 1, A 2, A 3, or A 4) determined based on a higher node block including block 0 and block 1.
- block 0 and block 1 may use the same spatial merge candidate.
- an upper node block including block 2 and block 3 and block 4 may be generated.
- blocks 2 and 3 may be generated by dividing non-square blocks based on the binary tree.
- Merge candidates of blocks 2, 3, and 4 which are in a non-square form may be derived based on higher node blocks including them. That is, a merge candidate based on a neighboring block (eg, at least one of B0, B1, B2, B3, or B4) determined based on the position, width / height, or size of the square block including blocks 2, 3, and 4; Can be derived.
- blocks 2, 3, and 4 may use the same spatial merge candidate.
- a temporal merge candidate for non-square type blows may be derived based on the higher node block.
- temporal merge candidates for block 0 and block 1 may be derived based on a square block including block 0 and block 1.
- Temporal merge candidates for block 2, block 3, and block 4 may be derived based on square blocks including block 2, block 3, and block 4.
- the same temporal merge candidate derived from the temporal neighboring block determined based on the quad tree block unit may be used.
- Lower node blocks included in the upper node block may share at least one of a spatial merge candidate or a temporal merge candidate.
- lower node blocks included in an upper node block may use the same merge candidate list.
- the spatial merge candidate and the temporal merge candidate may be derived based on the lower node block, and the other may be derived based on the higher node block.
- the spatial merge candidates for block 0 and block 1 may be derived based on the higher node block.
- the temporal merge candidate for block 0 may be derived based on block 0
- the temporal merge candidate for block 1 may be derived based on block 1.
- the merge candidate may be derived based on the upper node block including the predetermined number of samples or more.
- the merge candidate may be derived based on the upper node block including the predetermined number of samples or more.
- the merge candidate may be derived based on the upper node block including the predetermined number of samples or more.
- at least one of the lower node blocks generated based on at least one of quad tree splitting, binary tree splitting, or triple tree splitting is smaller than a predetermined size
- at least one of the lower node blocks is a non-forward block.
- the upper node block does not leave the picture boundary, or if the width or height of the upper node block is greater than or equal to a predefined value, or more than a predefined number of samples (eg, 64, 128 or Merge candidates may be derived based on the upper node block in the form of square or non-square.
- a predefined value e.g, 64, 128 or Merge candidates may be derived based on the upper node block in the form of square or non-square.
- Lower node blocks included in the upper node block may share merge candidates derived based on the upper node block.
- the merge candidate may be derived based on any one of the lower node blocks, and the other lower node blocks may be configured to use the merge candidate.
- the lower node blocks may be included in blocks of a predetermined shape or a predetermined size or more.
- the lower node blocks may share a merge candidate list derived based on any one of the lower node blocks.
- Information about a lower node block, which is a derivation criterion of a merge candidate may be signaled through a bitstream.
- the information may be index information indicating any one of the lower node blocks.
- the lower node block serving as the derivation criterion of the merge candidate may be determined based on at least one of the position, size, shape, or scan order of the lower node blocks.
- Information indicating whether the lower node blocks share the merge candidate list derived based on the upper node block may be signaled through the bitstream. Based on the information, it may be determined whether a merge candidate of a non-predetermined block or a block of a size smaller than a predetermined size is derived based on an upper node block including the block. Alternatively, whether to derive a merge candidate based on a higher node block may be determined based on a rule defined in the encoder and the decoder.
- the neighboring block When a neighboring block adjacent to the current block exists in a predefined region, the neighboring block may be determined to be unavailable as a spatial merge candidate.
- the predefined area may be a parallel processing area defined for inter-block parallel processing.
- the parallel processing region may be referred to as a merge estimation region (MER).
- MER merge estimation region
- the neighboring block adjacent to the current block is included in the same merge determination region as the current block, the neighboring block may be determined to be unavailable.
- a shift operation may be performed.
- the current block and the neighboring block are included in the same merge determination region based on whether the shifted position of the upper left reference sample of the current block and the shifted position of the upper left reference sample of the neighboring block are the same. Whether or not it can be determined.
- FIG. 23 is a diagram for describing an example in which availability of spatial neighboring blocks is determined based on a merge determination region.
- the merge determination region is illustrated as having an N ⁇ 2N form.
- Merge candidates of block 1 may be derived based on spatial neighboring blocks neighboring to block 1.
- the spatial neighboring block may include B0, B1, B2, B3, and B4.
- the spatial neighboring blocks B0 and B3 included in the same merge determination region as the block 1 may be determined to be unavailable as merge candidates.
- the merge candidate of block 1 may be derived from at least one of spatial neighboring blocks B1, B2, and B4 except for spatial neighboring blocks B0 and B3.
- the merge candidate of block 3 may derive the merge candidate of block 3 based on the spatial neighboring block neighboring block 3.
- Spatial neighboring blocks may include C0, C1, C2, C3, and C4.
- the spatial neighboring block C0 included in the same merge determination region as that of block 3 may be determined to be unavailable as a merge candidate.
- the merge candidate of block 3 may be derived from at least one of the spatial neighboring blocks C1, C2, C3 and C4 excluding the spatial neighboring block C0.
- a merge candidate of a block included in the merge determination region may be derived based on at least one of the position, size, width, or height of the merge determination region.
- a merge candidate of a plurality of blocks included in the merge determination region may be derived from at least one of a spatial neighboring block or a temporal neighboring block determined based on at least one of the position, size, width, or height of the merge determination region.
- Blocks included in the merge determination region may share the same merge candidate.
- FIG. 24 is a diagram illustrating an example in which a merge candidate is derived based on a merge determination region.
- the merge candidates of the plurality of coding units may be derived based on the merge determination region. That is, the merge determination region may be treated as a coding unit to derive a merge candidate based on the position, size, or width / height of the merge determination region.
- a merge candidate of coding unit 0 (CU0) and coding unit 1 (CU1) having a size of (n / 2) xN (where n is N / 2) included in the merge determination region having a size of (N / 2) xN. May be derived based on the merge determination region. That is, the merge candidates of the coding unit 0 and the coding unit 1 may be derived from at least one of the neighboring blocks C0, C1, C2 C3, or C4 adjacent to the merge determination region.
- the merge candidates of nxn-sized coding unit 2 (CU2), coding unit 3 (CU3), coding unit 4 (CU4), and coding unit 5 (CU5) included in the merge determination region of size NxN are based on the merge determination region. May be induced. That is, the merge candidates of the coding unit 2, the coding unit 3, the coding unit 4, and the coding unit 5 may be derived from at least one of the neighboring blocks C0, C1, C2, C3, or C4 adjacent to the merge determination region.
- the shape of the merge determination region may be square or non-square.
- a square coding unit (or prediction unit) or a non-square coding unit (or prediction unit) may be determined as a merge determination region.
- the width and height ratio of the merge determination region may be limited so as not to exceed the predetermined range.
- the merge determination region may not have a non-square shape having a width and height ratio of more than 2 or a non-square shape having a width and height ratio of less than 1/2. That is, the non-square merge determination region may be in the form of 2NxN or Nx2N.
- Information regarding the width and height ratio limitation may be signaled through the bitstream. Alternatively, the limitation of the width and height ratio may be predefined in the encoder and the decoder.
- At least one of the information indicating the shape of the merge determination region or the information indicating the size of the merge determination region may be signaled through the bitstream.
- at least one of information indicating the shape of the merge determination region or information indicating the size of the merge determination region may be signaled through a slice header, a tile group header, a picture parameter, or a sequence parameter.
- the shape of the merge determination region or the size of the merge determination region may be updated in units of sequence, picture, slice, tile group, tile, or block (CTU).
- CTU block
- the merge determination region may include at least one block. Blocks included in the merge determination region may be square or non-square. The maximum number or the minimum number of blocks that the merge determination region may include may be determined. For example, the merge determination region may include two, three, four, or more CUs. The determination may be determined based on information signaled through the bitstream. Alternatively, the maximum number or the minimum number of blocks that may be included in the merge determination region may be predefined in the encoder and the decoder.
- the parallel processing of the blocks may be allowed.
- the merge candidates of the blocks may be derived based on the merge determination region. Can be. If the number of blocks included in the merge determination region is greater than the maximum number, or if the number of blocks included in the merge determination region is smaller than the minimum number, the merge candidate of each block may be the size, position, width, or It can be derived based on height.
- the information indicating the shape of the merge determination region may include a 1-bit flag.
- the syntax 'isrectagular_mer_flag' may indicate that the merge candidate region is square or non-square.
- a value of 1 isrectagular_mer_flag indicates that the merge determination region is non-square, and a value of 0 isrectagular_mer_flag may indicate that the merge determination region is square.
- information representing at least one of the width, the height, or the width and the height ratio of the merge determination region may be signaled through the bitstream. Based on this, the size and / or shape of the merge determination region may be determined. There may be a plurality of merge determination regions having different sizes in the sequence.
- L0 prediction based on L0 motion information or L1 prediction based on L1 motion information may be performed.
- the L0 motion information includes an L0 reference picture index and / or an L0 motion vector
- the L1 motion information includes an L1 reference picture index and / or an L1 motion vector.
- the L0 reference picture index may be used to specify the L0 reference picture in the L0 reference picture list
- the L1 reference picture index may be used to specify the L1 reference picture in the L1 reference picture list.
- L0 motion information or L1 motion information of the current block may be derived based on a predefined inter prediction mode.
- the inter mode may include at least one of a merge mode, a skip mode, or an AMVP mode.
- additional motion information may be obtained from the merge candidate, and bidirectional prediction may be applied to the current block based on the obtained additional motion information.
- An inter prediction method for performing bidirectional prediction based on additional motion information may be referred to as a multi inter prediction method.
- unidirectional motion information derived based on information signaled from a merge candidate, a motion vector candidate, or a bitstream will be referred to as basic motion information.
- motion information in a direction different from the basic motion information obtained from the merge candidate will be referred to as additional motion information.
- 25 is a diagram illustrating an embodiment of a multiple inter prediction method.
- Basic motion information for the current block may be obtained based on information signaled from the merge candidate, the motion vector candidate, or the bitstream.
- the L0 prediction may be performed based on the obtained L0 basic motion information.
- the L0 prediction may be performed based on the basic motion vector mvL0 in the L0 direction.
- the L1 motion information may be additionally obtained from the merge candidate of the current block, and the L1 prediction may be performed based on the L1 additional motion information.
- the L1 motion vector of the merge candidate may be set as the L1 motion vector of the current block
- the L1 reference picture index of the merge candidate may be set as the L1 reference picture index of the current block. That is, when unidirectional prediction using L0 basic motion information is applied to the current block or when the current block has only basic motion information for the L0 direction, L1 additional motion information may be derived from motion information of the merge candidate. Prediction on the L1 direction may be performed based on the L1 additional motion information. For example, L1 prediction may be performed based on the additional motion vector mvL1 in the L1 direction.
- basic motion information for the current block may be obtained based on information signaled from a merge candidate, a motion vector candidate, or a bitstream.
- L1 prediction may be performed based on the obtained L1 basic motion information.
- the L0 motion information may be additionally obtained from the merge candidate of the current block, and the L0 prediction may be performed based on the L0 additional motion information.
- the L0 motion vector of the merge candidate may be set as the L0 motion vector of the current block
- the L0 reference picture index of the merge candidate may be set as the L0 reference picture index of the current block. That is, when unidirectional prediction using L1 basic motion information is applied to the current block, or when the current block has only basic motion information for the L1 direction, additional L0 motion information may be derived from motion information of the merge candidate.
- bidirectional prediction may be applied to the current block.
- the bidirectional prediction may be performed by performing a weighted sum operation or an average operation of the prediction sample obtained by performing the L1 prediction and the prediction sample obtained as a result of performing the L0 prediction.
- the current block may be divided into two partitions, L1 prediction may be performed on the first partition, and L0 prediction may be performed on the second partition.
- a prediction value of a sample located at the boundary between the first partition and the second partition may be obtained based on a weighted sum operation or an average operation of the prediction sample obtained by the L1 prediction and the prediction sample obtained by the L0 prediction.
- the first partition and the second partition may be rectangular or triangular.
- additional motion information may be obtained from a merge candidate different from the merge candidate. That is, the merge candidate from which the basic motion information is derived may be different from the merge candidate from which the additional motion information is derived.
- a first merge index for specifying a merge candidate used for deriving basic motion information and a second merge index for specifying a merge candidate used for deriving additional motion information may be signaled through the bitstream.
- the second merge index may indicate any one of the remaining merge candidates except the merge candidate indicated by the first merge index.
- a merge candidate having a value obtained by adding 1 to the second merge index may be selected as a merge candidate for deriving additional motion information.
- the merge candidate has bidirectional motion information
- only motion information in a direction different from the basic motion information among bidirectional motion information of the merge candidate may be set as additional motion information.
- the basic motion information is for the L0 direction
- the motion information in the L1 direction of the merge candidate may be set as additional motion information.
- the final motion vector of the current block may be derived by using the base motion vector and the bidirectional motion vector of the merge candidate.
- FIG. 26 illustrates an example in which a multi-inter prediction method is performed when a merge candidate has bidirectional information.
- a merge candidate may be specified in order to obtain the motion information in the L1 direction.
- the L0 motion vector of the current block may be derived using the L0 basic motion information and the L0 motion information of the merge candidate.
- the L0 motion vector of the current block may be derived as shown in Equation 1 or 2 below based on the L0 basic motion vector mvL0 and the L0 additional motion vector mvL2 (or merge_mvL0).
- k is an integer including 0. k may be determined based on at least one of an output order of the reference picture specified by the basic motion information or an output order of the reference picture specified by the motion information of the merge candidate.
- the L0 reference picture of the current block may be determined based on the L0 basic motion information or the L0 motion information of the merge candidate.
- the L0 basic motion information among the L0 basic motion information and the L0 motion information of the merge candidate may be selected and used for L0 prediction, or the L0 motion information of the merge candidate may be selected and used for L0 prediction.
- the selection may be determined based on information signaled through the bitstream. Alternatively, the selection may be performed based on a comparison result of a reference picture index or a comparison result of a motion vector.
- the first prediction is performed on the L0 direction based on the L0 basic motion information
- the second prediction is performed on the L0 direction by using the merge candidate L0 motion information
- the first prediction result and the second prediction are performed.
- the final prediction result for the L0 direction may be derived based on the performance result.
- the L0 prediction image may be obtained based on a weighted sum operation or an average operation of the first L0 prediction image generated as the result of the first prediction and the second L0 prediction image generated as the result of the second prediction.
- Prediction on the L1 direction may be performed based on the L1 motion information of the merge candidate. That is, the L1 motion information of the merge candidate may be set as the L1 additional motion information of the current block. For example, prediction of the L1 direction may be performed based on the merge candidate L1 motion vector mvL1.
- the motion vector of the merge candidate may be scaled based on the scaling factor.
- the L0 motion vector of the merge candidate is scaled
- the L0 motion vector of the current block is derived based on the scaled L0 motion vector
- the second L0 prediction for the L0 direction is performed based on the scaled L0 motion vector.
- the scaling factor may be derived based on at least one of a distance between the L0 reference picture and the current picture or a distance between the reference picture and the current picture specified by the M0 reference picture index of the merge candidate. Scaling may be performed only when the reference picture specified by the L0 basic motion information and the reference picture specified by the M0 motion information of the merge candidate are different.
- one of L0 motion information and L1 motion information may be selected and used. The selection may be performed based on whether the basic motion information relates to the L0 direction or the L1 direction. For example, motion information in a direction opposite to the basic motion information may be selected from among the M0 motion information and the L1 motion information of the merge candidate. Accordingly, when the basic motion information relates to the L0 direction, only L1 motion information of the merge candidate L0 motion information and L1 motion information may be used for the current block. That is, the L0 prediction for the current block may be performed based on the L0 basic motion information, and the L1 prediction for the current block may be performed based on the L1 motion information of the merge candidate.
- the motion information in the same direction as the basic motion information may be selected from the merge candidate L0 motion information and the L1 motion information. Accordingly, when the basic motion information relates to the L0 direction, only L0 motion information of the merge candidate L0 motion information and L1 motion information may be used for the current block. That is, the L0 prediction for the current block may be performed based on the L0 basic motion information and the merge candidate L0 motion information.
- basic motion information is illustrated as being about the L0 direction.
- the above-described embodiments are performed by changing L0 motion information of the merge candidate to L1 motion information of the merge candidate, or changing L1 motion information of the merge candidate by changing the L0 motion information of the merge candidate.
- the L1 motion vector of the current block may be derived using the L1 basic motion information and the merge candidate L1 motion information.
- the L1 motion vector of the current block may be derived as in Equation 3 or 4 based on the L1 basic motion vector mvL1 and the L0 additional motion vector merge_mvL1.
- a merge candidate for deriving additional motion information may be specified based on index information for specifying any one of merge candidates included in the merge candidate list.
- the merge candidate list may include only merge candidates having L0 motion information or L1 motion information.
- merge candidates having L0 motion information or L1 motion information may be extracted, and an index may be reallocated for the extracted merge candidates. have. Whether the merge candidate list is composed of only merge candidates having L0 motion information or only merge candidates having L1 motion information may be determined according to the prediction direction of the basic motion information.
- the merge candidate list may be configured only with merge candidates having L1 (or L0) motion information.
- the merge candidate list may be configured only with merge candidates having L0 (or L1) motion information.
- merge candidates in the merge candidate list may be rearranged based on whether the merge candidates have L0 motion information or L1 motion information.
- merge candidates with L0 motion information are placed in the merge candidate list before merge candidates without L0 motion information, or merge candidates with L1 motion information are merge candidates before merge candidates without L1 motion information. Can be placed in the list.
- Whether to rearrange based on the L0 motion information or to rearrange based on the L1 motion information may be determined according to the prediction direction of the basic motion information. For example, when the basic motion information relates to the L0 direction, merge candidates may be rearranged based on whether the L1 (or L0) motion information is included. On the other hand, when the basic motion information relates to the L1 direction, merge candidates may be rearranged based on whether the L0 (or L1) motion information is included.
- the merge mode is used to acquire additional motion information.
- the additional motion information may be acquired based on the predefined inter prediction mode.
- the inter prediction module may include at least one of a skip mode, a merge mode, or an AMVP mode.
- the inter prediction mode used to derive the additional motion information may be determined based on the inter prediction mode used to derive the basic motion information. For example, additional motion information may be derived by using the same inter prediction mode as the inter prediction mode used to derive the basic motion vector. Alternatively, additional motion information may be derived using an inter prediction mode different from the inter prediction mode used to derive the basic motion vector.
- information for specifying the inter prediction mode used to derive additional motion information may be signaled through the bitstream.
- n direction prediction can be extended to the m direction prediction.
- n and m are integers of 1, 2, 3 or more, and n may be less than m.
- Information indicating whether to extend the number of prediction directions may be encoded and signaled through a bitstream. The information may be signaled at the video sequence, picture parameter, slice header or block level.
- the block level represents a coding block, a prediction block or a transform block.
- each component for example, a unit, a module, etc. constituting the block diagram may be implemented as a hardware device or software, and a plurality of components are combined into one hardware device or software. It may be implemented.
- the above-described embodiments may be implemented in the form of program instructions that may be executed by various computer components, and may be recorded in a computer-readable recording medium.
- the computer-readable recording medium may include program instructions, data files, data structures, etc. alone or in combination.
- Examples of computer readable recording media include magnetic media such as hard disks, floppy disks and magnetic tape, optical recording media such as CD-ROMs, DVDs, and magneto-optical media such as floptical disks. media) and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like.
- the hardware device may be configured to operate as one or more software modules to perform the process according to the invention, and vice versa.
- the present invention can be applied to an electronic device capable of encoding / decoding an image.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims (15)
- 현재 블록의 공간적 이웃 블록 또는 시간적 이웃 블록 중 적어도 하나를 기초로 적어도 하나의 머지 후보를 유도하는 단계;상기 머지 후보를 포함하는 머지 후보 리스트를 생성하는 단계;상기 머지 후보 리스트에 포함된 제1 머지 후보로부터 상기 현재 블록의 LX 방향 움직임 정보를 획득하는 단계;상기 제1 머지 후보와 상이한 제2 머지 후보로부터 상기 현재 블록의 L(1-X) 방향 움직임 정보를 획득하는 단계; 및상기 LX 방향 움직임 정보 및 상기 L(1-X) 방향 움직임 정보를 기초로 인터 예측을 수행하는 단계를 포함하는, 영상 복호화 방법.
- 제1 항에 있어서,상기 제2 머지 후보를 특정하기 위한 제2 머지 인덱스가 상기 제1 머지 후보를 특정하기 위한 제1 머지 인덱스보다 작은 경우, 상기 제2 머지 인덱스에 1을 더한 값에 대응하는 머지 후보가 상기 제2 머지 후보로 결정되는 것을 특징으로 하는, 영상 복호화 방법.
- 제1 항에 있어서,상기 제2 머지 후보는 상기 머지 후보 리스트로부터 L(X-1) 움직임 정보를 갖는 머지 후보들만을 추출하여 생성된 추가 머지 후보 리스트로부터 선택되는 것을 특징으로 하는, 영상 복호화 방법.
- 제1 항에 있어서,상기 제2 머지 후보가 양방향 움직임 정보를 갖는 경우, 상기 현재 블록의 LX 방향 예측은 상기 LX 움직임 정보와 상기 제2 머지 후보의 LX 움직임 정보를 기초로 수행되는 것을 특징으로 하는, 영상 복호화 방법.
- 제4 항에 있어서,상기 LX 방향 예측은, 상기 LX 움직임 정보에 기초한 제1 LX 예측 및 상기 제2 머지 후보의 LX 움직임 정보에 기초한 제2 LX 예측을 포함하는, 영상 복호화 방법.
- 제4 항에 있어서,상기 LX 방향 예측은, 상기 LX 움직임 정보에 대한 제1 LX 움직임 벡터 및 상기 제2 머지 후보의 LX 움직임 정보에 대한 제2 LX 움직임 벡터를 기초로 유도되는 제3 LX 움직임 벡터를 기초로 수행되는 것을 특징으로 하는, 영상 복호화 방법.
- 제1 항에 있어서,상기 현재 블록의 제1 파티션에는 상기 LX 움직임 정보에 기초한 인터 예측이 수행되고, 제2 파티션에는 상기 L(1-X) 움직임 정보에 기초한 인터 예측이 수행되는 것을 특징으로 하는, 영상 복호화 방법.
- 현재 블록의 공간적 이웃 블록 또는 시간적 이웃 블록 중 적어도 하나를 기초로 적어도 하나의 머지 후보를 유도하는 단계;상기 머지 후보를 포함하는 머지 후보 리스트를 생성하는 단계;상기 머지 후보 리스트에 포함된 제1 머지 후보로부터 상기 현재 블록의 LX 방향 움직임 정보를 획득하는 단계;상기 제1 머지 후보와 상이한 제2 머지 후보로부터 상기 현재 블록의 L(1-X) 방향 움직임 정보를 획득하는 단계; 및상기 LX 방향 움직임 정보 및 상기 L(1-X) 방향 움직임 정보를 기초로 인터 예측을 수행하는 단계를 포함하는, 영상 부호화 방법.
- 제8 항에 있어서,상기 제2 머지 후보를 특정하기 위한 제2 머지 인덱스가 상기 제1 머지 후보를 특정하기 위한 제1 머지 인덱스보다 작은 경우, 상기 제2 머지 인덱스에 1을 더한 값에 대응하는 머지 후보가 상기 제2 머지 후보로 결정되는 것을 특징으로 하는, 영상 부호화 방법.
- 제8 항에 있어서,상기 제2 머지 후보는 상기 머지 후보 리스트로부터 L(X-1) 움직임 정보를 갖는 머지 후보들만을 추출하여 생성된 추가 머지 후보 리스트로부터 선택되는 것을 특징으로 하는, 영상 부호화 방법.
- 제8 항에 있어서,상기 제2 머지 후보가 양방향 움직임 정보를 갖는 경우, 상기 현재 블록의 LX 방향 예측은 상기 LX 움직임 정보와 상기 제2 머지 후보의 LX 움직임 정보를 기초로 수행되는 것을 특징으로 하는, 영상 부호화 방법.
- 제11 항에 있어서,상기 LX 방향 예측은, 상기 LX 움직임 정보에 기초한 제1 LX 예측 및 상기 제2 머지 후보의 LX 움직임 정보에 기초한 제2 LX 예측을 포함하는, 영상 부호화 방법.
- 제11 항에 있어서,상기 LX 방향 예측은, 상기 LX 움직임 정보에 대한 제1 LX 움직임 벡터 및 상기 제2 머지 후보의 LX 움직임 정보에 대한 제2 LX 움직임 벡터를 기초로 유도되는 제3 LX 움직임 벡터를 기초로 수행되는 것을 특징으로 하는, 영상 부호화 방법.
- 제8 항에 있어서,상기 현재 블록의 제1 파티션에는 상기 LX 움직임 정보에 기초한 인터 예측이 수행되고, 제2 파티션에는 상기 L(1-X) 움직임 정보에 기초한 인터 예측이 수행되는 것을 특징으로 하는, 영상 부호화 방법.
- 현재 블록의 공간적 이웃 블록 또는 시간적 이웃 블록 중 적어도 하나를 기초로 적어도 하나의 머지 후보를 유도하고, 상기 머지 후보를 포함하는 머지 후보 리스트를 생성하고, 상기 머지 후보 리스트에 포함된 제1 머지 후보로부터 상기 현재 블록의 LX 방향 움직임 정보를 획득하고, 상기 제1 머지 후보와 상이한 제2 머지 후보로부터 상기 현재 블록의 L(1-X) 방향 움직임 정보를 획득하고, 상기 LX 방향 움직임 정보 및 상기 L(1-X) 방향 움직임 정보를 기초로 인터 예측을 수행하는 인터 예측부를 포함하는 영상 복호화 장치.
Priority Applications (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202411175169.0A CN118828015A (zh) | 2018-06-29 | 2019-06-28 | 解码图像和编码图像的方法以及用于传送压缩视频数据的设备 |
CN202411175098.4A CN118828014A (zh) | 2018-06-29 | 2019-06-28 | 解码图像和编码图像的方法以及用于传送压缩视频数据的设备 |
GB2018086.5A GB2587984B (en) | 2018-06-29 | 2019-06-28 | Method and apparatus for processing a video signal |
US17/057,347 US11394959B2 (en) | 2018-06-29 | 2019-06-28 | Method and apparatus for processing video signal |
CN202411175045.2A CN118803268A (zh) | 2018-06-29 | 2019-06-28 | 解码图像和编码图像的方法以及用于传送压缩视频数据的设备 |
CN202411175203.4A CN118803270A (zh) | 2018-06-29 | 2019-06-28 | 解码图像和编码图像的方法以及用于传送压缩视频数据的设备 |
MX2020012663A MX2020012663A (es) | 2018-06-29 | 2019-06-28 | Método y aparato para procesar una señal de video. |
CA3100986A CA3100986A1 (en) | 2018-06-29 | 2019-06-28 | Method and apparatus for processing a video signal |
CN202411175134.7A CN118803269A (zh) | 2018-06-29 | 2019-06-28 | 解码图像和编码图像的方法以及用于传送压缩视频数据的设备 |
CN201980035235.5A CN112204982B (zh) | 2018-06-29 | 2019-06-28 | 用于处理视频信号的方法和设备 |
CN202411175008.1A CN118870022A (zh) | 2018-06-29 | 2019-06-28 | 解码视频和编码视频的方法以及用于传送压缩视频数据的设备 |
US17/836,522 US12010294B2 (en) | 2018-06-29 | 2022-06-09 | Method and apparatus for processing video signal |
US18/604,929 US20240223751A1 (en) | 2018-06-29 | 2024-03-14 | Method and apparatus for processing video signal |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR10-2018-0075988 | 2018-06-29 | ||
KR20180075988 | 2018-06-29 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/057,347 A-371-Of-International US11394959B2 (en) | 2018-06-29 | 2019-06-28 | Method and apparatus for processing video signal |
US17/836,522 Division US12010294B2 (en) | 2018-06-29 | 2022-06-09 | Method and apparatus for processing video signal |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020005007A1 true WO2020005007A1 (ko) | 2020-01-02 |
Family
ID=68987470
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2019/007881 WO2020005007A1 (ko) | 2018-06-29 | 2019-06-28 | 비디오 신호 처리 방법 및 장치 |
Country Status (6)
Country | Link |
---|---|
US (3) | US11394959B2 (ko) |
CN (7) | CN118828015A (ko) |
CA (1) | CA3100986A1 (ko) |
GB (1) | GB2587984B (ko) |
MX (2) | MX2020012663A (ko) |
WO (1) | WO2020005007A1 (ko) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11671619B2 (en) * | 2018-07-02 | 2023-06-06 | Intellectual Discovery Co., Ltd. | Video coding method and device using merge candidate |
CN110809161B (zh) * | 2019-03-11 | 2020-12-29 | 杭州海康威视数字技术股份有限公司 | 运动信息候选者列表构建方法及装置 |
US11616966B2 (en) * | 2019-04-03 | 2023-03-28 | Mediatek Inc. | Interaction between core transform and secondary transform |
US11375222B2 (en) * | 2019-09-22 | 2022-06-28 | Tencent America LLC | Method and device for video encoding and decoding with interpolation filter flag being consistent with MMVD distances |
US20220405263A1 (en) * | 2021-06-21 | 2022-12-22 | International Business Machines Corporation | Increasing Index Availability in Databases |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130095295A (ko) * | 2010-11-23 | 2013-08-27 | 미디어텍 인크. | 공간 움직임 벡터 예측 방법 및 장치 |
WO2017003063A1 (ko) * | 2015-06-28 | 2017-01-05 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
KR20170073681A (ko) * | 2014-11-18 | 2017-06-28 | 미디어텍 인크. | 단방향 예측 및 병합 후보로부터의 모션 벡터에 기초한 양방향 예측 비디오 코딩 방법 |
WO2017188509A1 (ko) * | 2016-04-28 | 2017-11-02 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
Family Cites Families (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9124898B2 (en) | 2010-07-12 | 2015-09-01 | Mediatek Inc. | Method and apparatus of temporal motion vector prediction |
US8711940B2 (en) | 2010-11-29 | 2014-04-29 | Mediatek Inc. | Method and apparatus of motion vector prediction with extended motion vector predictor |
US9137544B2 (en) | 2010-11-29 | 2015-09-15 | Mediatek Inc. | Method and apparatus for derivation of mv/mvp candidate for inter/skip/merge modes |
SG189843A1 (en) | 2011-01-19 | 2013-06-28 | Mediatek Inc | Method and apparatus for parsing error robustness of temporal motion vector prediction |
US8755437B2 (en) | 2011-03-17 | 2014-06-17 | Mediatek Inc. | Method and apparatus for derivation of spatial motion vector candidate and motion vector prediction candidate |
EP3139611A1 (en) | 2011-03-14 | 2017-03-08 | HFI Innovation Inc. | Method and apparatus for deriving temporal motion vector prediction |
EP2687014B1 (en) | 2011-03-14 | 2021-03-10 | HFI Innovation Inc. | Method and apparatus for derivation of motion vector candidate and motion vector prediction candidate |
US9485518B2 (en) * | 2011-05-27 | 2016-11-01 | Sun Patent Trust | Decoding method and apparatus with candidate motion vectors |
PL4007276T3 (pl) * | 2011-05-27 | 2023-12-11 | Sun Patent Trust | Sposób kodowania obrazów, urządzenie do kodowania obrazów, sposób dekodowania obrazów, urządzenie do dekodowania obrazów, i urządzenie do kodowania i dekodowania obrazów |
US9282338B2 (en) | 2011-06-20 | 2016-03-08 | Qualcomm Incorporated | Unified merge mode and adaptive motion vector prediction mode candidates selection |
KR20140043242A (ko) * | 2011-06-30 | 2014-04-08 | 가부시키가이샤 제이브이씨 켄우드 | 화상 부호화 장치, 화상 부호화 방법, 화상 부호화 프로그램, 화상 복호 장치, 화상 복호 방법 및 화상 복호 프로그램 |
WO2013009104A2 (ko) * | 2011-07-12 | 2013-01-17 | 한국전자통신연구원 | 인터 예측 방법 및 그 장치 |
EP4009641B1 (en) * | 2011-09-09 | 2023-08-09 | LG Electronics Inc. | Picture decoding method, picture encoding method, method for transmitting data for picture information and computer-readable storage medium storing bitstream including encoded picture information |
US9736489B2 (en) | 2011-09-17 | 2017-08-15 | Qualcomm Incorporated | Motion vector determination for video coding |
CN110446039B (zh) | 2011-11-08 | 2022-01-11 | 韩国电子通信研究院 | 用于共享候选者列表的方法和装置 |
CN107682704B (zh) | 2011-12-23 | 2020-04-17 | 韩国电子通信研究院 | 图像解码方法、图像编码方法和记录介质 |
RU2629359C1 (ru) | 2013-01-18 | 2017-08-29 | ДжейВиСи КЕНВУД КОРПОРЕЙШН | Устройство и способ декодирования движущегося изображения, долговременный считываемый компьютером носитель записи для хранения программы декодирования движущегося изображения |
WO2015142057A1 (ko) * | 2014-03-21 | 2015-09-24 | 주식회사 케이티 | 다시점 비디오 신호 처리 방법 및 장치 |
KR102378459B1 (ko) * | 2014-06-30 | 2022-03-24 | 한국전자통신연구원 | 움직임 병합 모드에서 시점 합성 예측 후보의 중복성 제거 장치 및 방법 |
KR20170058838A (ko) * | 2015-11-19 | 2017-05-29 | 한국전자통신연구원 | 화면간 예측 향상을 위한 부호화/복호화 방법 및 장치 |
CN108293131B (zh) | 2015-11-20 | 2021-08-31 | 联发科技股份有限公司 | 基于优先级运动矢量预测子推导的方法及装置 |
KR20180136967A (ko) * | 2016-04-22 | 2018-12-26 | 엘지전자 주식회사 | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
US10560718B2 (en) | 2016-05-13 | 2020-02-11 | Qualcomm Incorporated | Merge candidates for motion vector prediction for video coding |
CN114513657B (zh) * | 2016-07-05 | 2024-06-04 | 株式会社Kt | 对视频进行解码的方法和设备以及对视频进行编码的方法 |
EP3496400A4 (en) | 2016-08-03 | 2020-02-19 | KT Corporation | VIDEO SIGNAL PROCESSING METHOD AND DEVICE |
KR20190110041A (ko) * | 2018-03-19 | 2019-09-27 | 주식회사 케이티 | 비디오 신호 처리 방법 및 장치 |
KR20200034639A (ko) * | 2018-09-21 | 2020-03-31 | 한국전자통신연구원 | 영상 부호화/복호화 방법, 장치 및 비트스트림을 저장한 기록 매체 |
CN113170130A (zh) * | 2019-05-02 | 2021-07-23 | 株式会社 Xris | 图像信号编码/解码方法及其装置 |
-
2019
- 2019-06-28 CN CN202411175169.0A patent/CN118828015A/zh active Pending
- 2019-06-28 US US17/057,347 patent/US11394959B2/en active Active
- 2019-06-28 WO PCT/KR2019/007881 patent/WO2020005007A1/ko active Application Filing
- 2019-06-28 CN CN202411175008.1A patent/CN118870022A/zh active Pending
- 2019-06-28 CN CN202411175134.7A patent/CN118803269A/zh active Pending
- 2019-06-28 CN CN202411175045.2A patent/CN118803268A/zh active Pending
- 2019-06-28 CN CN202411175098.4A patent/CN118828014A/zh active Pending
- 2019-06-28 CN CN202411175203.4A patent/CN118803270A/zh active Pending
- 2019-06-28 CA CA3100986A patent/CA3100986A1/en active Pending
- 2019-06-28 GB GB2018086.5A patent/GB2587984B/en active Active
- 2019-06-28 CN CN201980035235.5A patent/CN112204982B/zh active Active
- 2019-06-28 MX MX2020012663A patent/MX2020012663A/es unknown
-
2020
- 2020-11-24 MX MX2024005877A patent/MX2024005877A/es unknown
-
2022
- 2022-06-09 US US17/836,522 patent/US12010294B2/en active Active
-
2024
- 2024-03-14 US US18/604,929 patent/US20240223751A1/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20130095295A (ko) * | 2010-11-23 | 2013-08-27 | 미디어텍 인크. | 공간 움직임 벡터 예측 방법 및 장치 |
KR20170073681A (ko) * | 2014-11-18 | 2017-06-28 | 미디어텍 인크. | 단방향 예측 및 병합 후보로부터의 모션 벡터에 기초한 양방향 예측 비디오 코딩 방법 |
WO2017003063A1 (ko) * | 2015-06-28 | 2017-01-05 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
WO2017188509A1 (ko) * | 2016-04-28 | 2017-11-02 | 엘지전자(주) | 인터 예측 모드 기반 영상 처리 방법 및 이를 위한 장치 |
Non-Patent Citations (1)
Title |
---|
JICHENG AN: "Enhanced Merge Mode based on JEM7.0", JOINT VIDEO EXPERTS TEAM (JVET) OF ITU-T SG 16 WP 3, 20 April 2018 (2018-04-20), San Diego, US, pages 1 - 14 * |
Also Published As
Publication number | Publication date |
---|---|
GB2587984A (en) | 2021-04-14 |
MX2024005877A (es) | 2024-05-29 |
CN112204982B (zh) | 2024-09-17 |
MX2020012663A (es) | 2021-02-09 |
CN118803269A (zh) | 2024-10-18 |
GB202018086D0 (en) | 2020-12-30 |
CN118828014A (zh) | 2024-10-22 |
US20240223751A1 (en) | 2024-07-04 |
CN118828015A (zh) | 2024-10-22 |
CA3100986A1 (en) | 2020-01-02 |
US20220321876A1 (en) | 2022-10-06 |
US11394959B2 (en) | 2022-07-19 |
CN112204982A (zh) | 2021-01-08 |
US20210218955A1 (en) | 2021-07-15 |
CN118803270A (zh) | 2024-10-18 |
CN118803268A (zh) | 2024-10-18 |
GB2587984B (en) | 2022-12-14 |
CN118870022A (zh) | 2024-10-29 |
US12010294B2 (en) | 2024-06-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019225993A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018155986A2 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2017222326A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018026219A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2017176030A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018097626A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2017204532A1 (ko) | 영상 부호화/복호화 방법 및 이를 위한 기록 매체 | |
WO2017171370A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018026118A1 (ko) | 영상 부호화/복호화 방법 | |
WO2018008906A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019182295A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019190199A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019050292A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2020050685A1 (ko) | 인트라 예측을 이용한 영상 부호화/복호화 방법 및 장치 | |
WO2019182292A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018212579A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019190201A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2020096427A1 (ko) | 영상 신호 부호화/복호화 방법 및 이를 위한 장치 | |
WO2018056701A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2020096428A1 (ko) | 영상 신호 부호화/복호화 방법 및 이를 위한 장치 | |
WO2019066524A1 (ko) | 영상 부호화/복호화 방법, 장치 및 비트스트림을 저장한 기록 매체 | |
WO2020005007A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2018066958A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019225994A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
WO2019235893A1 (ko) | 비디오 신호 처리 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19824457 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 202018086 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20190628 |
|
ENP | Entry into the national phase |
Ref document number: 3100986 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19824457 Country of ref document: EP Kind code of ref document: A1 |