US20120213293A1 - Multi-metric filtering - Google Patents
Multi-metric filtering Download PDFInfo
- Publication number
- US20120213293A1 US20120213293A1 US13/401,685 US201213401685A US2012213293A1 US 20120213293 A1 US20120213293 A1 US 20120213293A1 US 201213401685 A US201213401685 A US 201213401685A US 2012213293 A1 US2012213293 A1 US 2012213293A1
- Authority
- US
- United States
- Prior art keywords
- pixels
- filter
- block
- metric
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
- H04N19/82—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/90—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
- H04N19/96—Tree coding, e.g. quad-tree coding
Definitions
- This disclosure relates to block-based digital video coding used to compress video data and, more particularly to, techniques for the filtering of video blocks.
- Digital video capabilities can be incorporated into a wide range of devices, including digital televisions, digital direct broadcast systems, wireless communication devices such as radio telephone handsets, wireless broadcast systems, personal digital assistants (PDAs), laptop computers, desktop computers, tablet computers, digital cameras, digital recording devices, video gaming devices, video game consoles, and the like.
- Digital video devices implement video compression techniques, such as MPEG-2, MPEG-4, or ITU-T H.264/MPEG-4, Part 10, Advanced Video Coding (AVC), to transmit and receive digital video more efficiently.
- Video compression techniques perform spatial and temporal prediction to reduce or remove redundancy inherent in video sequences.
- HEVC High Efficiency Video Coding
- JCTVC Joint Collaborative Team—Video Coding
- Block-based video compression techniques may perform spatial prediction and/or temporal prediction.
- Intra-coding relies on spatial prediction to reduce or remove spatial redundancy between video blocks within a given unit of coded video, which may comprise a video frame, a slice of a video frame, or the like.
- inter-coding relies on temporal prediction to reduce or remove temporal redundancy between video blocks of successive coding units of a video sequence.
- a video encoder performs spatial prediction to compress data based on other data within the same unit of coded video.
- the video encoder performs motion estimation and motion compensation to track the movement of corresponding video blocks of two or more adjacent units of coded video.
- a coded video block may be represented by prediction information that can be used to create or identify a predictive block, and a residual block of data indicative of differences between the block being coded and the predictive block.
- prediction information that can be used to create or identify a predictive block, and a residual block of data indicative of differences between the block being coded and the predictive block.
- inter-coding one or more motion vectors are used to identify the predictive block of data from a previous or subsequent coding unit
- the prediction mode can be used to generate the predictive block based on data within the CU associated with the video block being coded.
- Both intra-coding and inter-coding may define several different prediction modes, which may define different block sizes and/or prediction techniques used in the coding. Additional types of syntax elements may also be included as part of encoded video data in order to control or define the coding techniques or parameters used in the coding process.
- the video encoder may apply transform, quantization and entropy coding processes to further reduce the bit rate associated with communication of a residual block.
- Transform techniques may comprise discrete cosine transforms (DCTs) or conceptually similar processes, such as wavelet transforms, integer transforms, or other types of transforms.
- DCTs discrete cosine transforms
- the transform process converts a set of pixel difference values into transform coefficients, which may represent the energy of the pixel values in the frequency domain.
- Quantization is applied to the transform coefficients, and generally involves a process that limits the number of bits associated with any given transform coefficient.
- Entropy coding comprises one or more processes that collectively compress a sequence of quantized transform coefficients.
- Filtering of video blocks may be applied as part of the encoding and decoding loops, or as part of a post-filtering process on reconstructed video blocks. Filtering is commonly used, for example, to reduce blockiness or other artifacts common to block-based video coding. Filter coefficients (sometimes called filter taps) may be defined or selected in order to promote desirable levels of video block filtering that can reduce blockiness and/or improve the video quality in other ways.
- a set of filter coefficients may define how filtering is applied along edges of video blocks or other locations within video blocks. Different filter coefficients may cause different levels of filtering with respect to different pixels of the video blocks. Filtering, for example, may smooth or sharpen differences in intensity of adjacent pixel values in order to help eliminate unwanted artifacts.
- This disclosure describes techniques associated with filtering of video data in a video encoding and/or video decoding process.
- filtering is applied at an encoder, and filter information is encoded in the bitstream to enable a decoder to identify the filtering that was applied at the encoder.
- the decoder receives encoded video data that includes the filter information, decodes the video data, and applies filtering based on the filtering information. In this way, the decoder applies the same filtering that was applied at the encoder.
- an encoder may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering.
- the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2 ⁇ 2 block of pixels or a 4 ⁇ 4 block of pixels.
- a method of video coding includes determining a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; based on the first metric, determining a filter for the block of pixels; and, generating a filtered image by applying the filter to the block of pixels.
- a video coding device includes a filter unit configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block, determine a filter for the block of pixels based on the first metric, and generate a filtered image by applying the filter to the block of pixels; and a memory configured to store a filtered result of the filter unit;
- a video coding apparatus includes means for determining a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; means for determining a filter for the block of pixels based on the first metric; and, means for generating a filtered image by applying the filter to the block of pixels.
- a computer-readable storage medium stores instructions that when executed cause one or more processors to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a filter for the block of pixels based on the first metric; and, generate a filtered image by applying the filter to the block of pixels.
- FIG. 1 is a block diagram illustrating an exemplary video encoding and decoding system.
- FIGS. 2A and 2B are conceptual diagrams illustrating an example of quadtree partitioning applied to a largest coding unit (LCU).
- LCU largest coding unit
- FIGS. 2C and 2D are conceptual diagrams illustrating an example of a filter map for a series of video blocks corresponding to the example quadtree partitioning of FIGS. 2A and 2B .
- FIG. 3 is a block diagram illustrating an exemplary video encoder consistent with this disclosure.
- FIG. 4A is a conceptual diagram illustrating a mapping of ranges for two metrics to filters.
- FIG. 4B is a conceptual diagram illustrating a mapping of ranges for an activity metric and a direction metric to filters.
- FIG. 5 is a block diagram illustrating an exemplary video decoder consistent with this disclosure.
- FIG. 7 is a flow diagram illustrating coding techniques consistent with this disclosure.
- FIGS. 8A and 8B are flow diagrams illustrating coding techniques consistent with this disclosure.
- FIGS. 9A and 9B are flow diagrams illustrating coding techniques consistent with this disclosure.
- FIG. 10 is a flow diagram illustrating coding techniques consistent with this disclosure.
- FIG. 11 is a flow diagram illustrating coding techniques consistent with this disclosure.
- an encoder may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering.
- the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2 ⁇ 2 block of pixels or a 4 ⁇ 4 block of pixels.
- video data can be coded in units referred to as coded units (CUs).
- CUs can be partitioned into smaller CUs, or sub-units, using a quadtree partitioning scheme.
- Syntax identifying the quadtree partitioning scheme for a particular CU can be transmitted from an encoder to a decoder.
- Multiple inputs associated with each sub-unit of a given CU can be filtered during the process of decoding and reconstructing the encoded video data.
- filter description syntax can describe a set of filters, such as how many filters are in the set or what shape the filters take. Additional syntax in the bitstream received by the decoder can identify the filters (i.e.
- the filter used for a particular input can be selected based on two or metrics, where certain combinations of values for the two or metrics are indexed to specific filters within a set of filters. In other instances, two or more metrics may be combined to form a single metric.
- the mapping of filters to metrics can also be signaled in the bitstream
- Different types of filtering may be applied to pixels or blocks of pixels based on two or more metrics determined for the video data.
- the filter used for a particular pixel can be selected based on two or more metrics, such as some combination of an activity metric and a direction metric.
- An activity metric may quantify activity associated with one or more blocks of pixels within the video data.
- the activity metric may comprise a variance metric indicative of pixel variance within a set of pixels.
- An activity metric may be either direction-specific or non-direction-specific.
- a non-direction-specific activity metric may include a sum-modified Laplacian value, as explained in greater detail below.
- direction-specific activity metrics include a horizontal activity metric, a vertical activity metric, a 45-degree activity metric, and a 135-degree activity metric.
- a direction metric may for a block of pixels quantify any of the horizontal activity, vertical activity, or diagonal activity of a pixel or group of pixels, or a direction metric may include a comparison of horizontal activity, vertical activity, and/or diagonal activity, where horizontal activity generally refers to changes in pixel values in a horizontal direction, vertical activity generally refers to changes in pixel values in a vertical direction, and diagonal activity generally refers to changes in pixel values in a diagonal direction.
- a subset of pixels within the block may be used to reduce encoding and decoding complexity. For example, when determining a filter for a 4 ⁇ 4 block of pixels, it may not be necessary to use all sixteen pixels of the 4 ⁇ 4 block.
- the subset of pixels from within a current block being coded can be selected such that the metrics are calculated only using pixel values of the current block and not pixel values of neighboring blocks. For instance, the metric for a pixel being evaluated might be calculated based on comparing the pixel to nearby pixels. In some instances, one or more of the nearby pixels for the pixel being evaluated might be in a different block than the pixel being evaluated.
- the subset of pixels can be selected to include pixels that do not have nearby pixels in neighboring blocks. Additionally or alternatively, the subset of pixels may include pixels that have nearby pixels in neighboring blocks, but those nearby pixels in neighboring blocks may not be used when determining the metric. By basing the determination of a particular metric on pixels within a current block and not on pixels of neighboring blocks, the need for buffers at the encoder and/or decoder may, in some instances, be reduced or even eliminated.
- the subset of pixels from within a current block being coded can be selected such that the metrics are calculated only using pixel values of the current block and left and right neighboring blocks but not pixel values of upper neighboring blocks or lower neighboring blocks.
- line buffers for upper and lower neighboring blocks tend to need to store far more pixel values than line buffers for storing pixel values of left and right neighboring blocks.
- a filter unit such as an adaptive-in loop filter
- the multiple filters may be used in conjunction with a single input or multiple inputs.
- the multiple inputs described in this disclosure generally refer to intermediate video block data or image data that is produced during the encoding and decoding processes.
- Multiple inputs associated with a given video block can include, for example, a reconstructed block or image (RI), a pre-deblocked reconstructed block or image (pRI), a prediction block or image (PI), and/or a quantized prediction error image (EI).
- RI reconstructed block or image
- pRI pre-deblocked reconstructed block or image
- PI prediction block or image
- EI quantized prediction error image
- a filter may only be applied to one of the inputs above, such as RI.
- the filtering techniques of this disclosure can be applied to CUs of various sizes using a quadtree partitioning scheme.
- video coding performance as measured by one or both of compression rate and reconstructed video quality, might be improved.
- an encoder maintains, by generating, updating, storing, or other means, a mapping of combinations of ranges to filters.
- the combination of a first range for a first metric and a first range for a second metric may map to a first filter.
- the combination of the first range for the first metric and a second range for the second metric may also map to the first filter or may map to a second filter. If a first metric has eight ranges and a second metric has four ranges, for example, then the first and second metric can have thirty-two combinations of ranges, and each of the thirty-two combinations can be mapped to a filter. Each combination, however, is not necessarily mapped to a unique filter. Thus, the thirty-two combinations might map to four filters, eight filters, ten filters, or some other number of filters. In order to apply the same filters as an encoder, a decoder may also maintain the same mappings of range combinations to filters.
- This disclosure describes techniques for signaling from an encoder to a decoder, in an encoded bitstream, a mapping of range combinations to filters.
- the mapping may, for example, associate each range combination with a filter identification (ID).
- ID filter identification
- One simple way to signal this mapping is to use one codeword for each filter ID, and then for each combination of ranges, send the codeword of the corresponding filter ID. This technique, however, is typically inefficient.
- Techniques of the present disclosure may exploit correlations within the mapping by using differential coding methods. Combinations of ranges that share a common range sometimes use the same filter.
- the combination of a first range for a first metric and a first rage for a second metric and the combination of the first range for the first metric and a second range for the second metric share a common range (the first range of the first metric).
- these two combinations might, in some instances, map to the same filter ID.
- the techniques of this disclosure may reduce the number of bits needed to signal the mapping of range combinations to filter IDs from an encoder to a decoder.
- this disclosure also describes techniques for signaling, in an encoded bitstream, filter coefficients for filters.
- Techniques of the present disclosure include using differential coding methods to signal filter coefficients from an encoder to a decoder. In this manner, the filter coefficients for a second filter might be communicated to a decoder as difference information, where the difference information describes how to modify the filter coefficients of a first filter in a manner that produces the filter coefficients of the second filter.
- Differential coding techniques may be more effective (i.e. may result in a greater savings of bits) when the filter coefficients of the first and second filter are more similar than compared to when the filter coefficients of the first and second filter are less similar.
- the techniques of this disclosure include determining a sequential order in which to signal filter coefficients for filters.
- the orderings determined using the techniques described in this disclosure may result in improved differential coding of filter coefficients, and thus, may in some instances result in a savings of bits when signaling the filter coefficients.
- In-loop filtering generally refers to filtering in which the filtered data is part of the encoding and decoding loops such that filtered data is used for predictive intra- or inter-coding.
- Post-loop filtering refers to filtering that is applied to reconstructed video data after the encoding loop. With post-loop filtering, the unfiltered data, as opposed to the filtered data, is used for predictive intra- or inter-coding.
- the type of filtering may switch between post-loop filtering and in-loop filtering on, for example, a frame-by-frame, slice-by-slice, or other such basis, and the decision of whether to use post-loop filtering or in-loop filtering can be signaled from encoder to decoder for each frame, slice, etc.
- the techniques of this disclosure are not limited to in-loop filtering or post filtering, and may apply to a wide range of filtering applied during video coding.
- coder refers to any video encoder, video decoder, or combined encoder/decoder (codec). Accordingly, the term “coder” is used herein to refer to a specialized computer device or apparatus that performs video encoding or video decoding.
- filter generally refers to a set of filter coefficients.
- a 3 ⁇ 3 filter may be defined by a set of 9 filter coefficients
- a 5 ⁇ 5 filter may be defined by a set of 25 filter coefficients
- a 9 ⁇ 5 filter may be defined by a set of 45 filter coefficients, and so on.
- set of filters generally refers to a group of more than one filter. For example, a set of two 3 ⁇ 3 filters, could include a first set of 9 filter coefficients and a second set of 9 filter coefficients.
- shape sometimes called the “filter support,” generally refers to the number of rows of filter coefficients and number of columns of filter coefficients for a particular filter.
- shape sometimes called the “filter support,” generally refers to the number of rows of filter coefficients and number of columns of filter coefficients for a particular filter.
- 9 ⁇ 9 is an example of a first shape
- 9 ⁇ 5 is an example of a second shape
- 5 ⁇ 9 is an example of a third shape.
- filters may take non-rectangular shapes including diamond-shapes, diamond-like shapes, circular shapes, circular-like shapes, hexagonal shapes, octagonal shapes, cross shapes, X-shapes, T-shapes, other geometric shapes, or numerous other shapes or configuration.
- FIG. 1 is a block diagram illustrating an exemplary video encoding and decoding system 110 that may implement techniques of this disclosure.
- system 110 includes a source device 112 that transmits encoded video data to a destination device 116 via a communication channel 115 .
- Source device 112 and destination device 116 may comprise any of a wide range of devices.
- source device 112 and destination device 116 may comprise wireless communication device handsets, such as so-called cellular or satellite radiotelephones.
- the techniques of this disclosure which apply more generally to filtering of video data, are not necessarily limited to wireless applications or settings, and may be applied to non-wireless devices including video encoding and/or decoding capabilities.
- source device 112 includes a video source 120 , a video encoder 122 , a modulator/demodulator (modem) 123 and a transmitter 124 .
- Destination device 116 includes a receiver 126 , a modem 127 , a video decoder 128 , and a display device 130 .
- video encoder 122 of source device 112 may be configured to select one or more sets of filter coefficients for multiple inputs in a video block filtering process and then encode the selected one or more sets of filter coefficients.
- Specific filters from the one or more sets of filter coefficients may be selected based on one or more metrics for one or more inputs, and the filter coefficients may be used to filter the one or more inputs.
- the filtering techniques of this disclosure are generally compatible with any techniques for coding or signaling filter coefficients in an encoded bitstream.
- Each video block or CU within the series of video blocks can then contain additional syntax to identify which filter or filters of the set of the filters is to be used for each input of that video block, or in accordance with the techniques of this disclosure, which filter or filters of the set of the filters is to be used can be determined based on two or more metrics associated with one or more of the inputs.
- video encoder 122 of source device 112 may select one or more sets of filters for a series of video blocks, apply filters from the set(s) to pixels or groups of pixels of inputs associated with CUs of the series of video blocks during the encoding process, and then encode the sets of filters (i.e. sets of filter coefficients) for communication to video decoder 128 of destination device 116 .
- Video encoder 122 may determine one or more metrics associated with inputs of CUs coded in order to select which filter(s) from the set(s) of filters to use with pixels or groups of pixels for that particular CU.
- Video encoder 122 may also signal to video decoder 128 , as part of the coded bitstream, a mapping of combinations of ranges to filters within a set of filters.
- video decoder 128 may determine the filter coefficients based on filter information received in the bitstream syntax. Video decoder 128 may decode the filter coefficients based on direct decoding or predictive decoding depending upon how the filter coefficients were encoded, which may be signaled as part of the bitstream syntax. Additionally, the bitstream may include filter description syntax information to describe the filters for a set of filters. Based on the filter description syntax, decoder 128 can reconstruct the filter coefficients based on additional information received from encoder 122 .
- the illustrated system 110 of FIG. 1 is merely exemplary. The filtering techniques of this disclosure may be performed by any encoding or decoding devices. Source device 112 and destination device 116 are merely examples of coding devices that can support such techniques. Video decoder 128 may also determine the mapping of combinations of ranges to filters based on filter information received in the bitstream syntax.
- Video encoder 122 of source device 112 may encode video data received from video source 120 using the techniques of this disclosure.
- Video source 120 may comprise a video capture device, such as a video camera, a video archive containing previously captured video, or a video feed from a video content provider.
- video source 120 may generate computer graphics-based data as the source video, or a combination of live video, archived video, and computer-generated video.
- source device 112 and destination device 116 may form so-called camera phones or video phones.
- the captured, pre-captured or computer-generated video may be encoded by video encoder 122 .
- the encoded video information may then be modulated by modem 123 according to a communication standard, e.g., such as code division multiple access (CDMA), frequency division multiple access (FDMA), orthogonal frequency division multiplexing (OFDM), or any other communication standard or technique, and transmitted to destination device 116 via transmitter 124 .
- a communication standard e.g., such as code division multiple access (CDMA), frequency division multiple access (FDMA), orthogonal frequency division multiplexing (OFDM), or any other communication standard or technique
- CDMA code division multiple access
- FDMA frequency division multiple access
- OFDM orthogonal frequency division multiplexing
- modem 123 may include various mixers, filters, amplifiers or other components designed for signal modulation.
- Transmitter 124 may include circuits designed for transmitting data, including amplifiers, filters, and one or more antennas.
- Receiver 126 of destination device 116 receives information over channel 115 , and modem 127 demodulates the information.
- the video decoding process performed by video decoder 128 may include filtering, e.g., as part of the in-loop decoding or as a post filtering step following the decoding loop. Either way, the set of filters applied by video decoder 128 for a particular slice or frame may be decoded using the techniques of this disclosure.
- Decoded filter information may include identifying filter description syntax in the coded bitstream. If, for example, predictive coding is used for the filter coefficients, similarities between different filter coefficients may be exploited to reduce the amount of information conveyed over channel 115 .
- a filter i.e.
- a set of the filter coefficients can be predictively coded as difference values relative to another set of the filter coefficients associated with a different filter.
- the different filter may, for example, be associated with a different slice or frame.
- video decoder 128 might receive an encoded bitstream comprising video blocks and filter information that identifies the different frame or slice with which the different filter is associated filter.
- the filter information also includes difference values that define the current filter relative to the filter of the different CU.
- the difference values may comprise filter coefficient difference values that define filter coefficients for the current filter relative to filter coefficients of a different filter used for a different CU.
- Video decoder 128 decodes the video blocks, generates the filter coefficients, and filters the decoded video blocks based on the generated filter coefficients. Video decoder 128 can generate the filter coefficients based on filter description syntax retrieved from the bitstream. The decoded and filtered video blocks can be assembled into video frames to form decoded video data.
- Display device 128 displays the decoded video data to a user, and may comprise any of a variety of display devices such as a cathode ray tube (CRT), a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or another type of display device.
- CTR cathode ray tube
- LCD liquid crystal display
- OLED organic light emitting diode
- Communication channel 115 may comprise any wireless or wired communication medium, such as a radio frequency (RF) spectrum or one or more physical transmission lines, or any combination of wireless and wired media.
- Communication channel 115 may form part of a packet-based network, such as a local area network, a wide-area network, or a global network such as the Internet.
- Communication channel 115 generally represents any suitable communication medium, or collection of different communication media, for transmitting video data from source device 112 to destination device 116 .
- FIG. 1 is merely exemplary and the techniques of this disclosure may apply to video coding settings (e.g., video encoding or video decoding) that do not necessarily include any data communication between the encoding and decoding devices. In other examples, data could be retrieved from a local memory, streamed over a network, or the like.
- encoded data may be output from video encoder 122 to a storage device 132 .
- encoded data may be accessed from storage device 132 by video decoder 128 .
- Storage device 132 may include any of a variety of distributed or locally accessed data storage media such as a hard drive, Blu-ray discs, DVDs, CD-ROMs, flash memory, volatile or non-volatile memory, or any other suitable digital storage media for storing encoded video data.
- storage device 132 may correspond to a file server or another intermediate storage device that may hold the encoded video generated by source device 112 .
- Destination device 116 may access stored video data from storage device 132 via streaming or download.
- the file server may be any type of server capable of storing encoded video data and transmitting that encoded video data to the destination device 116 .
- Example file servers include a web server (e.g., for a website), an FTP server, network attached storage (NAS) devices, or a local disk drive.
- Destination device 14 may access the encoded video data through any standard data connection, including an Internet connection. This may include a wireless channel (e.g., a Wi-Fi connection), a wired connection (e.g., DSL, cable modem, etc.), or a combination of both that is suitable for accessing encoded video data stored on a file server.
- the transmission of encoded video data from storage device 132 may be a streaming transmission, a download transmission, or a combination of both.
- system 110 may be configured to support one-way or two-way video transmission to support applications such as video streaming, video playback, video broadcasting, and/or video telephony.
- Video encoder 122 and video decoder 128 may operate according to a video compression standard such as the ITU-T H.264 standard, alternatively referred to as MPEG-4, Part 10, Advanced Video Coding (AVC), which will be used in parts of this disclosure for purposes of explanation.
- AVC Advanced Video Coding
- many of the techniques of this disclosure may be readily applied to any of a variety of other video coding standards, including the newly emerging HEVC standard.
- any standard that allows for filtering at the encoder and decoder may benefit from various aspects of the teaching of this disclosure.
- video encoder 122 and video decoder 128 may each be integrated with an audio encoder and decoder, and may include appropriate MUX-DEMUX units, or other hardware and software, to handle encoding of both audio and video in a common data stream or separate data streams. If applicable, MUX-DEMUX units may conform to the ITU H.223 multiplexer protocol, or other protocols such as the user datagram protocol (UDP).
- MUX-DEMUX units may conform to the ITU H.223 multiplexer protocol, or other protocols such as the user datagram protocol (UDP).
- devices 112 , 116 may operate in a substantially symmetrical manner.
- each of devices 112 , 116 may include video encoding and decoding components.
- system 110 may support one-way or two-way video transmission between video devices 112 , 116 , e.g., for video streaming, video playback, video broadcasting, or video telephony.
- video encoder 122 may execute a number of coding techniques or steps.
- video encoder 122 operates on video blocks within individual video frames in order to encode the video data.
- a video block may correspond to a macroblock or a partition of a macroblock.
- Macroblocks are one type of video block defined by the ITU H.264 standard and other standards. Macroblocks typically refer to 16 ⁇ 16 blocks of data, although the term is also sometimes used generically to refer to any video block of N ⁇ N or N ⁇ M size.
- the ITU-T H.264 standard supports intra prediction in various block sizes, such as 16 ⁇ 16, 8 ⁇ 8, or 4 ⁇ 4 for luma components, and 8 ⁇ 8 for chroma components, as well as inter prediction in various block sizes, such as 16 ⁇ 16, 16 ⁇ 8, 8 ⁇ 16, 8 ⁇ 8, 8 ⁇ 4, 4 ⁇ 8 and 4 ⁇ 4 for luma components and corresponding scaled sizes for chroma components.
- N ⁇ N refers to the pixel dimensions of the block in terms of vertical and horizontal dimensions, e.g., 16 ⁇ 16 pixels.
- a 16 ⁇ 16 block will have 16 pixels in a vertical direction and 16 pixels in a horizontal direction.
- an N ⁇ N block generally has N pixels in a vertical direction and N pixels in a horizontal direction, where N represents a positive integer value.
- the pixels in a block may be arranged in rows and columns.
- video blocks may be referred to as “coding units” (or CUs).
- coding units or CUs.
- LCUs largest coded units
- PUs prediction units
- the LCUs, CUs, and PUs are all video blocks within the meaning of this disclosure.
- Other types of video blocks may also be used, consistent with the HEVC standard or other video coding standards.
- video blocks refers to any size of video block.
- Separate CUs may be included for luma components and scaled sizes for chroma components for a given pixel, although other color spaces could also be used.
- Video blocks may have fixed or varying sizes, and may differ in size according to a specified coding standard.
- Each video frame may include a plurality of slices.
- Each slice may include a plurality of video blocks, which may be arranged into partitions, also referred to as sub-blocks.
- an N/2 ⁇ N/2 first CU may comprise a sub-block of an N ⁇ N LCU
- an N/4 ⁇ N/4 second CU may also comprise a sub-block of the first CU.
- An N/8 ⁇ N/8 PU may comprise a sub-block of the second CU.
- Video blocks may comprise blocks of pixel data in the pixel domain, or blocks of transform coefficients in the transform domain, e.g., following application of a transform such as a discrete cosine transform (DCT), an integer transform, a wavelet transform, or a conceptually similar transform to the residual video block data representing pixel differences between coded video blocks and predictive video blocks.
- a video block may comprise blocks of quantized transform coefficients in the transform domain.
- Syntax data within a bitstream may define an LCU for a frame or a slice, which is a largest coding unit in terms of the number of pixels for that frame or slice.
- an LCU or CU has a similar purpose to a macroblock coded according to H.264, except that LCUs and CUs do not have a specific size distinction.
- an LCU size can be defined on a frame-by-frame or slice-by-slice basis, and an LCU be split into CUs.
- references in this disclosure to a CU may refer to an LCU of a picture or a sub-CU of an LCU.
- An LCU may be split into sub-CUs, and each sub-CU may be split into sub-CUs.
- Syntax data for a bitstream may define a maximum number of times an LCU may be split, referred to as CU depth. Accordingly, a bitstream may also define a smallest coding unit (SCU).
- SCU smallest coding unit
- This disclosure also uses the terms “block” and “video block” to refer to any of an LCU, CU, PU, SCU, or TU.
- an LCU may be associated with a quadtree data structure.
- a quadtree data structure includes one node per CU, where a root node corresponds to the LCU. If a CU is split into four sub-CUs, the node corresponding to the CU includes four leaf nodes, each of which corresponds to one of the sub-CUs.
- Each node of the quadtree data structure may provide syntax data for the corresponding CU.
- a node in the quadtree may include a split flag, indicating whether the CU corresponding to the node is split into sub-CUs. Syntax elements for a CU may be defined recursively, and may depend on whether the CU is split into sub-CUs.
- a CU that is not split may include one or more prediction units (PUs).
- a PU represents all or a portion of the corresponding CU, and includes data for retrieving a reference sample for the PU.
- the PU may include data describing an intra-prediction mode for the PU.
- the PU may include data defining a motion vector for the PU.
- the data defining the motion vector may describe, for example, a horizontal component of the motion vector, a vertical component of the motion vector, a resolution for the motion vector (e.g., one-quarter pixel precision or one-eighth pixel precision), a reference frame to which the motion vector points, and/or a reference list (e.g., list 0 or list 1) for the motion vector.
- Data for the CU defining the PU(s) may also describe, for example, partitioning of the CU into one or more PUs. Partitioning modes may differ between whether the CU is uncoded, intra-prediction mode encoded, or inter-prediction mode encoded.
- a CU having one or more PUs may also include one or more transform units (TUs).
- the TUs comprise the data structure that includes residual transform coefficients, which are typically quantized.
- a video encoder may calculate residual values for the portion of the CU corresponding to the PU.
- the residual values may be transformed, quantized, scanned and stored in a TU, which may have variable sizes corresponding to the size of the transform that was performed. Accordingly, a TU is not necessarily limited to the size of a PU.
- TUs may be larger or smaller than corresponding PUs for the same CU.
- the maximum size of a TU may be the size of the corresponding CU.
- the TUs may comprise the data structures that include the residual transform coefficients associated with a given CU.
- FIGS. 2A and 2B are conceptual diagrams illustrating an example quadtree 250 and a corresponding LCU 272 .
- FIG. 2A depicts an example quadtree 250 , which includes nodes arranged in a hierarchical fashion. Each node in a quadtree, such as quadtree 250 , may be a leaf node with no children, or have four child nodes.
- quadtree 250 includes root node 252 . Root node 252 has four child nodes, including leaf nodes 256 A- 256 C (leaf nodes 256 ) and node 254 . Because node 254 is not a leaf node, node 254 includes four child nodes, which in this example, are leaf nodes 258 A- 258 D (leaf nodes 258 ).
- Quadtree 250 may include data describing characteristics of a corresponding LCU, such as LCU 272 in this example.
- quadtree 250 by its structure, may describe splitting of the LCU into sub-CUs.
- LCU 272 has a size of 2N ⁇ 2N.
- LCU 272 in this example, has four sub-CUs 276 A- 276 C (sub-CUs 276 ) and 274 , each of size N ⁇ N.
- Sub-CU 274 is further split into four sub-CUs 278 A- 278 D (sub-CUs 278 ), each of size N/2 ⁇ N/2.
- the structure of quadtree 250 corresponds to the splitting of LCU 272 , in this example. That is, root node 252 corresponds to LCU 272 , leaf nodes 256 correspond to sub-CUs 276 , node 254 corresponds to sub-CU 274 , and leaf nodes 258 correspond to sub-CUs 278 .
- Data for nodes of quadtree 250 may describe whether the CU corresponding to the node is split. If the CU is split, four additional nodes may be present in quadtree 250 .
- a node of a quadtree may be implemented similar to the following pseudocode:
- quadtree_node ⁇ boolean split_flag(1); // signaling data if (split_flag) ⁇ quadtree_node child1; quadtree_node child2; quadtree_node child3; quadtree_node child4; ⁇ ⁇
- the split_flag value may be a one-bit value representative of whether the CU corresponding to the current node is split. If the CU is not split, the split_flag value may be ‘0’, while if the CU is split, the split_flag value may be ‘1’. With respect to the example of quadtree 250 , an array of split flag values may be 101000000.
- each of sub-CUs 276 and sub-CUs 278 may be intra-prediction encoded using the same intra-prediction mode. Accordingly, video encoder 122 may provide an indication of the intra-prediction mode in root node 252 . Moreover, certain sizes of sub-CUs may have multiple possible transforms for a particular intra-prediction mode. Video encoder 122 may provide an indication of the transform to use for such sub-CUs in root node 252 . For example, sub-CUs of size N/2 ⁇ N/2 may have multiple possible transforms available. Video encoder 122 may signal the transform to use in root node 252 . Accordingly, video decoder 128 may determine the transform to apply to sub-CUs 278 based on the intra-prediction mode signaled in root node 252 and the transform signaled in root node 252 .
- video encoder 122 need not signal transforms to apply to sub-CUs 276 and sub-CUs 278 in leaf nodes 256 and leaf nodes 258 , but may instead simply signal an intra-prediction mode and, in some examples, a transform to apply to certain sizes of sub-CUs, in root node 252 , in accordance with the techniques of this disclosure. In this manner, these techniques may reduce the overhead cost of signaling transform functions for each sub-CU of an LCU, such as LCU 272 .
- intra-prediction modes for sub-CUs 276 and/or sub-CUs 278 may be different than intra-prediction modes for LCU 272 .
- Video encoder 122 and video decoder 130 may be configured with functions that map an intra-prediction mode signaled at root node 252 to an available intra-prediction mode for sub-CUs 276 and/or sub-CUs 278 .
- the function may provide a many-to-one mapping of intra-prediction modes available for LCU 272 to intra-prediction modes for sub-CUs 276 and/or sub-CUs 278 .
- a slice may be divided into video blocks (or LCUs) and each video block may be partitioned according to the quadtree structure described in relation to FIGS. 2A-B .
- the quadtree sub-blocks indicated by “ON” may be filtered by loop filters described herein, while quadtree sub-blocks indicated by “OFF” may not be filtered.
- the decision of whether or not to filter a given block or sub-block may be determined at the encoder by comparing the filtered result and the non-filtered result relative to the original block being coded.
- FIG. 2D is a decision tree representing partitioning decisions that results in the quadtree partitioning shown in FIG. 2C .
- the actual filtering applied to any pixels for “ON” blocks, may be determined based on the metrics discussed herein.
- FIG. 2C may represent a relatively large video block that is partitioned according to a quadtree portioning scheme into smaller video blocks of varying sizes.
- Each video block is labelled (on or off) in FIG. 2C , to illustrate whether filtering should be applied or avoided for that video block.
- the video encoder may define this filter map by comparing filtered and unfiltered versions of each video block to the original video block being coded.
- FIG. 2D is a decision tree corresponding to partitioning decisions that result in the quadtree partitioning shown in FIG. 2C .
- each circle may correspond to a CU. If the circle includes a “1” flag, then that CU is further partitioned into four more CUs, but if the circle includes a “0” flag, then that CU is not partitioned any further.
- Each circle (e.g., corresponding to CUs) also includes an associated diamond. If the flag in the diamond for a given CU is set to 1, then filtering is turned “ON” for that CU, but if the flag in the diamond for a given CU is set to 0, then filtering is turned off. In this manner, FIGS.
- 2C and 2D may be individually or collectively viewed as a filter map that can be generated at an encoder and communicated to a decoder at least once per slice of encoded video data in order to communicate the level of quadtree partitioning for a given video block (e.g., an LCU) whether or not to apply filtering to each partitioned video block (e.g., each CU within the LCU).
- a given video block e.g., an LCU
- a slice may be considered to be a plurality of video blocks and/or sub-blocks. Each slice may be an independently decodable series of video blocks of a video frame. Alternatively, frames themselves may be decodable series of video blocks, or other portions of a frame may be defined as decodable series of video blocks.
- series of video blocks may refer to any independently decodable portion of a video frame such as an entire frame, a slice of a frame, a group of pictures (GOP) also referred to as a sequence, or another independently decodable unit defined according to applicable coding techniques. Aspects of this disclosure might be described in reference to frames or slices, but such references are merely exemplary. It should be understood that generally any series of video blocks may be used instead of a frame or a slice.
- Syntax data may be defined on a per-coded-unit basis such that each CU includes associated syntax data.
- the filter information described herein may be part of such syntax for a CU, but might more likely be part of syntax for a series of video blocks, such as a frame, a slice, a GOP, LCU, or a sequence of video frames, instead of for a CU.
- the syntax data can indicate the set or sets of filters to be used with CUs of the slice or frame. Additionally, not all filter information necessarily has to be included in the header of a common series of video blocks. For example, filter description syntax might be transmitted in a frame header, while other filter information is signaled in a header for an LCU.
- Video encoder 122 may perform predictive coding in which a video block being coded is compared to a predictive frame (or other CU) in order to identify a predictive block.
- the differences between the current video block being coded and the predictive block are coded as a residual block, and prediction syntax is used to identify the predictive block.
- the residual block may be transformed and quantized.
- Transform techniques may comprise a DCT process or conceptually similar process, integer transforms, wavelet transforms, or other types of transforms.
- the transform process converts a set of pixel values into transform coefficients, which may represent the energy of the pixel values in the frequency domain.
- Quantization is typically applied to the transform coefficients, and generally involves a process that limits the number of bits associated with any given transform coefficient.
- entropy coding may be performed on the quantized and transformed residual video blocks. Syntax elements, such as the filter information and prediction vectors defined during the encoding, may also be included in the entropy coded bitstream for each CU.
- entropy coding comprises one or more processes that collectively compress a sequence of quantized transform coefficients and/or other syntax information. Scanning techniques, such as zig-zag scanning techniques, are performed on the quantized transform coefficients, e.g., as part of the entropy coding process, in order to define one or more serialized one-dimensional vectors of coefficients from two-dimensional video blocks.
- CAVLC content adaptive variable length coding
- CABAC context adaptive binary arithmetic coding
- encoded video blocks may be decoded in order to generate the video data used for subsequent prediction-based coding of subsequent video blocks.
- filtering may be performed in order to improve video quality, and e.g., remove blockiness artifacts from decoded video.
- the filtered data may be used for prediction of other video blocks, in which case the filtering is referred to as “in-loop” filtering.
- prediction of other video blocks may be based on unfiltered data, in which case the filtering is referred to as “post filtering.”
- video encoder 122 may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering. For the CUs that are to be filtered, the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2 ⁇ 2 block of pixels or a 4 ⁇ 4 block of pixels. These selections can be made in a manner that promotes the video quality.
- Such sets of filters may be selected from pre-defined sets of filters, or may be adaptively defined to promote video quality.
- video encoder 122 may select or define several sets of filters for a given frame or slice such that different filters are used for different pixels or groups of pixels of CUs of that frame or slice.
- several sets of filter coefficients may be defined, and the two or more metrics associated with the pixels of the CU may be used to determine which filter from the set of filters to use with such pixels or groups of pixels.
- video encoder 122 may apply several sets of filter coefficients and select one or more sets that produce the best quality video in terms of amount of distortion between a coded block and an original block, and/or the highest levels of compression.
- the set of filter coefficients applied by video encoder 122 for each CU may be encoded and communicated to video decoder 128 of destination device 118 so that video decoder 128 can apply the same filtering that was applied during the encoding process for each given CU.
- video decoder 128 can also calculate the two or more metrics, and based on filter information previously provided by video encoder 122 , match the combination of two or more metrics to a particular filter.
- FIG. 3 is a block diagram illustrating a video encoder 350 consistent with this disclosure.
- Video encoder 350 may correspond to video encoder 122 of device 120 , or a video encoder of a different device. As shown in FIG. 3 , video encoder 350 includes a prediction module 332 , adders 348 and 351 , and a memory 334 . Video encoder 350 also includes a transform unit 338 and a quantization unit 340 , as well as an inverse quantization unit 342 and an inverse transform unit 344 . Video encoder 350 also includes a deblocking filter 347 and an adaptive filter unit 349 . Video encoder 350 also includes an entropy encoding unit 346 .
- Filter unit 349 of video encoder 350 may perform filtering operations and also may include a filter selection unit (FSU) 353 for identifying a desirable or preferred filter or set of filters to be used for decoding. Filter unit 349 may also generate filter information identifying the selected filters so that the selected filters can be efficiently communicated as filter information to another device to be used during a decoding operation.
- FSU filter selection unit
- video encoder 350 receives a video block, such as an LCU, to be coded, and prediction module 332 performs predictive coding techniques on the video block.
- prediction module 332 can partition the video block and perform predictive coding techniques on CUs of different sizes.
- prediction module 332 compares the video block to be encoded, including sub-blocks of the video block, to various blocks in one or more video reference frames or slices in order to define a predictive block.
- prediction module 332 For intra coding, prediction module 332 generates a predictive block based on neighboring data within the same CU. Prediction module 332 outputs the prediction block and adder 348 subtracts the prediction block from the video block being coded in order to generate a residual block.
- prediction module 332 may comprise motion estimation and motion compensation units that identify a motion vector that points to a prediction block and generates the prediction block based on the motion vector.
- motion estimation is considered the process of generating the motion vector, which estimates motion.
- the motion vector may indicate the displacement of a predictive block within a predictive frame relative to the current block being coded within the current frame.
- Motion compensation is typically considered the process of fetching or generating the predictive block based on the motion vector determined by motion estimation.
- prediction module 332 For intra coding, prediction module 332 generates a predictive block based on neighboring data within the same CU.
- One or more intra-prediction modes may define how an intra prediction block can be defined.
- transform unit 338 applies a transform to the residual block.
- the transform may comprise a discrete cosine transform (DCT) or a conceptually similar transform such as that defined by a coding standard such as the HEVC standard. Wavelet transforms, integer transforms, sub-band transforms or other types of transforms could also be used.
- transform unit 338 applies the transform to the residual block, producing a block of residual transform coefficients.
- the transform may convert the residual information from a pixel domain to a frequency domain.
- Quantization unit 340 then quantizes the residual transform coefficients to further reduce bit rate.
- Quantization unit 340 may limit the number of bits used to code each of the coefficients.
- entropy encoding unit 346 scans the quantized coefficient block from a two-dimensional representation to one or more serialized one-dimensional vectors. The scan order may be pre-programmed to occur in a defined order (such as zig-zag scanning, horizontal scanning, vertical scanning, combinations, or another pre-defined order), or possibly adaptive defined based on previous coding statistics.
- entropy encoding unit 346 encodes the quantized transform coefficients (along with any syntax elements) according to an entropy coding methodology, such as CAVLC or CABAC, to further compress the data.
- Syntax elements included in the entropy coded bitstream may include prediction syntax from prediction module 332 , such as motion vectors for inter coding or prediction modes for intra coding.
- Syntax elements included in the entropy coded bitstream may also include filter information from filter unit 349 , which can be encoded in the manner described herein.
- CAVLC is one type of entropy encoding technique supported by the ITU H.264/MPEG4, AVC standard, which may be applied on a vectorized basis by entropy encoding unit 346 .
- CAVLC uses variable length coding (VLC) tables in a manner that effectively compresses serialized “runs” of transform coefficients and/or syntax elements.
- VLC variable length coding
- CABAC is another type of entropy coding technique supported by the ITU H.264/MPEG4, AVC standard, which may be applied on a vectorized basis by entropy encoding unit 346 .
- CABAC involves several stages, including binarization, context model selection, and binary arithmetic coding.
- entropy encoding unit 346 codes transform coefficients and syntax elements according to CABAC.
- CABAC CABAC entropy coding
- the emerging HEVC standard may also support both CAVLC and CABAC entropy coding.
- many other types of entropy coding techniques also exist, and new entropy coding techniques will likely emerge in the future. This disclosure is not limited to any specific entropy coding technique.
- the encoded video may be transmitted to another device or archived for later transmission or retrieval.
- the encoded video may comprise the entropy coded vectors and various syntax, which can be used by the decoder to properly configure the decoding process.
- Inverse quantization unit 342 and inverse transform unit 344 apply inverse quantization and inverse transform, respectively, to reconstruct the residual block in the pixel domain.
- Summer 351 adds the reconstructed residual block to the prediction block produced by prediction module 332 to produce a pre-deblocked reconstructed video block, sometimes referred to as pre-deblocked reconstructed image.
- De-blocking filter 347 may apply filtering to the pre-deblocked reconstructed video block to improve video quality by removing blockiness or other artifacts.
- the output of the de-blocking filter 347 can be referred to as a post-deblocked video block, reconstructed video block, or reconstructed image.
- Filter unit 349 can be configured to receive a single input or multiple inputs. In the example of FIG. 3 , filter unit 349 receives as input the post-deblocked reconstructed image (RI), pre-deblocked reconstructed image (pRI), the prediction image (PI), and the reconstructed residual block (EI). Filter unit 349 can use any of these inputs either individually or in combination to produce a reconstructed image to store in memory 334 . Additionally, as will be discussed in more detail below, one or more filters can be selected to be applied to the input(s). In one example, the output of filter unit 349 may be one additional filter applied to RI. In another example, the output of filter unit 349 may be one additional filter applied to pRI.
- the output of filter unit 349 may be based on multiple inputs. For example, filter unit 349 may apply a first filter to pRI and then use the filtered version of pRI in conjunction with filtered versions of EI and PI to create a reconstructed image. In instances where the output of filter unit 349 is the product of one additional filter being applied to a single input, filter unit 349 may in fact apply filters to the other inputs, but those filters might have all zero coefficients. Similarly, if the output of filter unit 349 is the product of applying three filters to three inputs, filter unit 349 may in fact apply a filter to the fourth input, but that filter might have all zero coefficients.
- Filter unit 349 may also be configured to receive a single input.
- FIG. 3 shows PI, EI, pRI, and RI being input into filter unit 349
- RI might be the only input received by filter unit 349 .
- filter unit 349 might apply a filter to RI so that a filtered version of RI is more similar to the original image than the unfiltered version of RI.
- filter unit 349 and de-blocking filter 347 may be combined into a single filtering unit that applies filtering to pRI.
- the techniques of this disclosure which generally relate to multi-metric-based filter mapping, are compatible with both single-input and multi-input filtering schemes that utilize multiple filters.
- Filtering by filter unit 349 may improve compression by generating predictive video blocks that more closely match video blocks being coded than unfiltered predictive video blocks. After filtering, the reconstructed video block may be used by prediction module 332 as a reference block to inter-code a block in a subsequent video frame or other CU.
- filter unit 349 is shown “in-loop,” the techniques of this disclosure could also be used with post filters, in which case non-filtered data (rather than filtered data) would be used for purposes of predicting data in subsequent CUs.
- filter unit 349 may select sets of filters for each input in a manner that promotes the video quality. For example, filter unit 349 may select sets of filters from pre-defined sets of coefficients, or may adaptively define filters in order to promote video quality or improved compression. Filter unit 349 may select or define one or more sets of filters for a given CU such that the same set(s) of filters are used for pixels of different video blocks of that CU. For a particular frame, slice, or LCU, filter unit 349 may apply several sets of filters to multiple inputs, and FSU 353 may select the set that produces the best quality video or the highest levels of compression.
- FSU 353 may train a new filter by analyzing the auto-correlations and cross-correlations between multiple inputs and an original image.
- a new set of filters may, for example, be determined by solving Wienter-Hopt equations based on the auto- and cross-correlations.
- filter unit 349 Regardless of whether a new set of filters is trained or an existing set of filters are selected, filter unit 349 generates syntax for inclusion in the bitstream that enables a decoder to also identify the set or sets of filters to be used for the particular frame or slice.
- filter unit 349 may select which filter from the set of filters is to be used based on two or more metrics that quantify properties associated with one or more sets of pixels within the CU.
- FSU 353 may determine sets of filters for a higher level coded unit such as a frame or slice, while filter unit 349 determines which filter(s) from the set(s) is to be used for a particular pixel of a lower level coded unit based on the two or more metrics associated with the pixels of that lower level coded unit.
- a set of M filters may be used for each input. Depending on design preferences, M may, for example, be as few as 2 or as great as 16, or even higher. A large number of filters per input may improve video quality, but also may increase overhead associated with signaling sets of filters from encoder to decoder.
- the set of M filters can be determined by FSU 353 as described above and signaled to the decoder for each frame or slice.
- a segmentation map can be used to indicate how a CU is segmented and whether or not a particular sub-unit of the CU is to be filtered. The segmentation map, may for example, include for a CU an array of split flags as described above as well an additional bit signaling whether each sub-CU is to be filtered.
- a specific filter from the set of filters can be chosen based on two or more metrics. Combinations of values for two or more metrics can be indexed to particular filters from the set of M filters.
- FIG. 4A is a conceptual diagram illustrating ranges of values for two metrics indexed to filters from a set of filters.
- the particular example of FIG. 4A shows eight filters (i.e. Filter 1, Filter 2 . . . Filter 8), but more or fewer filters may similarly be used.
- FIG. 4A shows two metrics that might be used for selecting a filter in accordance with the techniques of this disclosure.
- the two metrics may, for example, quantify properties of the pixel data related to non-direction specific activity (e.g. a sum-modified Laplacian value) and direction, direction-specific activity and edge detection, a direction metric and an edge metric, a horizontal activity metric and a vertical activity metric, or two other such metrics.
- three or more metrics might be used, in which case the conceptual diagram of FIG. 4A would include a third dimension for mapping ranges of the metrics to filters from the set of filters.
- a first metric has four ranges (Ranges 1-1, 1-2, 1-3, and 1-4), and a second metric (Metric 2) also has four ranges (Ranges 2-1, 2-2, 2-3, and 2-4). Therefore, the example of FIG. 4A has sixteen combinations of ranges for Metric 1 and Metric 2. As can be seen from FIG. 4A , however, each combination is not necessarily associated with a unique filter. The combination of Range 1-1 and Range 2-1, as well as combinations 1-1 and 2-2, and 1-1 and 2-3, for instance, are all mapped to Filter 1, in the example of FIG. 4A . Filter 4, in contrast, is only mapped to one combination (1-1 and 2-4).
- Range 1-1 may encompass a greater range of values than Range 1-2.
- FIG. 4A shows Metric 1 and Metric 2 as having the same number of ranges, the number of ranges for a first metric and the number of ranges for a second metric do not necessarily need to be equal. If, for example, Metric 1 is a variance metric and Metric 2 is a direction metric, Metric 1 might use eight ranges while Metric 2 uses three ranges.
- the ranges of Metric 1 and Metric 2 may represent a continuous spectrum of values. For example, if Metric 1 is a sum-modified Laplacian value, Range 1-2 may correspond to more activity than Range 1-1 but less activity than Range 1-3, and Range 1-4 may correspond to more activity than Range 1-3. Within a range, the amount of activity determined for a particular pixel or group of pixels may similarly increase along the Metric 1 axis. In other examples, the ranges of Metric 1 and Metric 2 may not represent actual ranges but instead may represent discrete determinations.
- Range 1-1 may correspond to a determination of no direction
- Range 2-2 may correspond to a determination of horizontal direction
- Range 2-3 may correspond to a determination of vertical direction
- Range 2-4 may represent a determination of diagonal direction.
- no direction, horizontal direction, vertical direction, and diagonal direction can be discrete determinations, and thus, the ranges for Metric 2 might not represent a continuous spectrum of values in the same way the ranges of Metric 1 do.
- FIG. 4B is a conceptual diagram illustrating ranges of values for an activity metric and a direction metric.
- the direction metric includes three discrete determinations (No Direction, Horizontal, and Vertical). Techniques for determining no direction, horizontal, and vertical as well as techniques for determining activity will be explained in greater detail below.
- the particular example of FIG. 4B shows six filters (i.e. Filter 1, Filter 2 . . . Filter 6), but more or fewer filters may similarly be used.
- the two metrics (activity and direction) create 15 combinations, identified as combinations 421 through 435. In some instances, however, additional combinations not explicitly shown in FIG. 4B may also be used. For example, a combination corresponding to no activity may be a 16th combination that also has a corresponding filter.
- Filter unit 349 can store a mapping of filters to combinations of ranges of two or more metrics, such as the example mappings of FIGS. 4A and 4B , and use the mapping to determine which filter from a set of filters to apply to a particular pixel or group of pixels in a CU.
- the mapping of filters to combinations of ranges of two or more metrics may, for example, be determined by filter unit 349 as part of the filter selection process described above. Regardless of how the mapping is determined, filter unit 349 can generate information allowing a decoder to reconstruct the mapping. This information can be included in the coded bitstream to signal the mapping of combinations of ranges to filters.
- the mapping of combinations to ranges signaled may map range combinations to filter identifications IDs. The actual coefficients for a particular filter might be signaled separately.
- filter unit 349 first determines a transmission order for the combinations.
- the transmission order generally refers to the order in which filters will be signaled for combinations of ranges.
- these are just a few of the many transmission orders that are possible.
- filter unit 349 can use a series of codewords to signal the mapping to a decoder. For example, filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the first metric. If a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the second metric, then filter unit 349 can generate a second codeword instead of the first codeword.
- filter unit 349 can generate a third codeword, instead of the first codeword or second codeword, that indicates the filter corresponding to the current combination being decoded.
- the first and second codeword of the current example may be relatively short compared to the third codeword.
- the first codeword and second codeword might each be two bits (e.g. 00 and 01, respectively), while the third codeword is more bits (a first bit of 1, plus additional bits).
- a current combination being decoded or a previous combination being decoded refers to the portion of the encoding and decoding processes where the mapping of filters to range combinations is being signaled by an encoder or constructed by a decoder, and not necessarily to a transmission or decoding of the combination itself.
- combination 407 is the combination currently being decoded
- combination 406 is the most recently decoded combination that shares the same range for Metric 1
- combination 403 is the most recently decoded combination that shares the same range for Metric 2.
- filter unit 349 can transmit a second codeword (e.g. 01) to indicate that the current combination being decoded (combination 407) maps to the same filter as the most recently decoded combination that shares the same range for a second metric (combination 403).
- a second codeword e.g. 01
- combination 410 is the current combination being decoded
- combination 409 is the most recently decoded combination that shares the same range for Metric 1
- combination 406 is the most recently decoded combination that shares the same range for Metric 2.
- filter unit 349 can transmit a first codeword (e.g. 00) to indicate that the current combination being decoded (combination 410) maps to the same filter (Filter 2) as the most recently decoded combination that shares the same range for a first metric (combination 409).
- a first codeword e.g. 00
- filter unit 349 can transmit a third codeword (e.g. 1+additional bits) to indicate that the current combination being decoded (combination 411) maps to a different filter (Filter 3) than both the most recently decoded combination that shares the same range for Metric 1 and the most recently decoded combination that shares the same range for Metric 2.
- a third codeword e.g. 1+additional bits
- combination 409 is the current combination to be decoded
- combination 405 is the most recently decoded combination that shares the same range for Metric 2, but no combination that shares a range for Metric 1 has yet been decoded.
- the most recently decoded combination that shares a range for Metric 1 can be assumed to not map to the same filter as the current combination being decoded.
- the first codeword will not be used for combination 409.
- the combination that shares a range for Metric 1 can be replaced by another combination, such as the most recently decoded combination or a different previously decoded combination.
- the most recently decoded combination before combination 409 would be combination 408.
- filter unit 349 can generate the first codeword. Analogous techniques can be used for those combinations where a previous combination sharing common range for Metric 1 have not yet been decoded.
- filter unit 349 can generate a codeword indicating the filter that maps to the first combination.
- the filter may, for example, be signaled using the third codeword or may be signaled using a different technique, in which case the techniques described in this disclosure might begin with the second combination in a transmission order or a later combination.
- filter unit 349 can use a series of codewords to signal the mapping to a decoder.
- filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the first metric. If a current combination being decoded does not map to the same filter as the most recently decoded combination that shares that range for the first metric, then filter unit 349 can generate a second codeword, instead of the first codeword, that indicates the filter that maps to the current combination being decoded.
- the first codeword may be relatively short compared to the second codeword.
- the first codeword might be one bits (e.g.
- the second codeword is more bits (e.g., a first bit of 1, plus additional bits).
- this technique includes only generating a short codeword if the current combination maps to the same filter as a previously decoded combination that shares the same range for Metric 1.
- filter unit 349 still generates a second codeword (e.g. 1+additional bits).
- filter unit 349 can use a different series of codewords to signal the mapping to a decoder. For example, filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination, regardless of which, if any, range the current combination has in common with the previously decoded combination. If the current combination being decoded does not map to the same filter as the most recently decoded combination, then filter unit 349 can generate a second codeword identifying the filter that maps to the current combination.
- the first codeword may be relatively short compared to the second codeword.
- the first codeword might be one bits (e.g. 0), while the second codeword is more bits (e.g., a first bit of 1, plus additional bits).
- filter unit 349 can generate the first codeword if combination 402 maps to the same filter as combination 401, if combination 403 maps to the same filter as combination 402, etc. Otherwise, filter unit 349 can generated the second codeword identifying the filter that maps to the current combination.
- filter unit 349 can use two codewords to signal the mapping of the filters to combinations.
- a first codeword such as a “0”
- a second codeword such as a “1”
- the second codeword does not need to identify a new filter. Instead, the new filter can be determined based on the transmission order for the classes and the order in which filter coefficients are transmitted. Using the left-to-right, bottom-to-top transmission order described above for FIG.
- combinations 421-422 would be mapped to a first filter, combinations 423-427 to a second filter, combinations 428-431 to a third filter, and combinations 432-435 to a fourth filter.
- the coefficients for the first filter, second filter, third filter, and fourth filter can correspond to the order in which sets of filter coefficients are signaled, where the first set of filter coefficients signaled correspond to the first filter, the second set of filter coefficients signaled correspond to the second filter, and so on. Determining an order for transmitting sets of filter coefficients is discussed in more detail below.
- filter unit 349 may use a first technique. Where both a combination that shares the same range for Metric 1 and a combination that shares the same range for Metric 2 have been decoded (e.g.
- codewords used for any of the first, second, and third codewords described above may be any of fixed length codewords, variable length codewords, or context-adaptive variable length codewords.
- filter unit 349 In addition to generating information allowing a decoder to reconstruct the mapping of filters to combinations of ranges, filter unit 349 also generates information allowing a decoder to reconstruct the filters themselves. Reconstructing the filters includes reconstructing the filter coefficients of the filters. As will be described in more detail below, filter unit 349 can use differential coding techniques to signal the filter coefficients. To use differential coding technique, filter unit 349 determines an order in which to signal the sets of filter coefficients.
- filter unit 349 determines a combination identification (ID) that represents a sequential value for each combination of ranges.
- ID a combination identification
- these are just a few of the many orders that could be used.
- any of the orders described could be either lowest to highest or highest to lowest.
- filter unit 349 can identify groupings of range combinations that are mapped to the same filter. Using FIG. 4A as an example, the groupings would be as follows.
- Filter 1 Group combinations 413, 414, and 415
- Filter unit 349 can then assign each group a group ID, and the group ID can represent a sequential value.
- the group IDs can be assigned to the groups based on the sequential values associated with the combinations that comprise the group. For example, the group that has the combination with the lowest associated sequential value based on the combination IDs, might be assigned the group ID with the lowest sequential value. Of the remaining groups, the remaining group that has the combination with the lowest associated sequential value can be assigned the group ID with the next lowest sequential value. This process can repeat until all groups have been assigned a group ID.
- group IDs might be assigned based on the combinations with the highest associated sequential values rather than the lowest.
- the group that has the combination with the lowest associated sequential value based on the combination IDs might be assigned the group ID with the highest sequential value, or vice versa.
- filter unit 349 can assign group IDs to the filter groups, as shown below in Table 1.
- filter unit 349 assigns the Filter 5 Group the group ID with the lowest sequential value because the Filter 5 Group includes the range combination with the lowest sequential value (i.e., combination 401).
- Filter unit 349 assigns the Filter 6 Group the group ID with the second lowest sequential value because, of the remaining filter groups (i.e. all the groups excluding the Filter 5 Group), the Filter 6 Group includes the range combination with the second lowest sequential value (i.e., combination 402).
- Filter unit 349 assigns the Filter 7 Group the group ID with the third lowest sequential value because, of the remaining filter groups (i.e.
- the Filter 7 Group includes the range combination with the lowest sequential value (i.e., combination 403).
- Filter unit 349 assigns the Filter 8 Group the group ID with the fourth lowest sequential value because, of the remaining filter groups (i.e. all the filter groups excluding the Filter 5 Group, the Filter 6 Group, and the Filter 7 Group), the Filter 8 Group includes the range combination with the fourth lowest sequential value (combination 404).
- Filter unit 349 assigns the Filter 2 Group the group ID with the fifth lowest sequential value because, of the remaining filter groups (i.e.
- the Filter 2 Group includes the range combination with the lowest sequential value (combination 409).
- Filter unit 349 assigns the Filter 3 Group the group ID with the sixth lowest sequential value because, of the remaining filter groups (i.e. excluding the Filter 5 Group, the Filter 6 Group, the Filter 7 Group, the Filter 8 Group, and the Filter 2 Group), the Filter 3 Group includes the range combination with the lowest sequential value (combination 411).
- Filter unit 349 assigns the Filter 1 Group the group ID with the seventh lowest sequential value because, of the remaining filter groups (i.e.
- the Filter 1 Group includes the range combination with the lowest sequential value (combination 413).
- filter unit 349 assigns the Filter 4 group, the final remaining filter group, the group ID with the highest sequential value (8 in this particular example).
- filter unit 349 determines an order in which to signal the filter coefficients of a filter. Again, using the example of FIG. 4A and Table 1, filter unit 349 first signals the coefficient for Filter 5, then the coefficient for Filter 6, then the coefficient for Filter 7, then the coefficient for Filter 8, then the coefficient for Filter 2, then the coefficient for Filter 3, then the coefficient for Filter 1, and finally the coefficient for Filter 4. Using differential coding techniques, as described in this disclosure, filter unit 349 may code the coefficients for Filter 6 as difference information relative to the filter coefficients of Filter 5, code the coefficients for Filter 7 as difference information relative to the filter coefficients for Filter 6, and so on, based on the sequential ordering of Group IDs.
- mapping of two or more metrics for inputs to filters can be implemented in multiple ways. For example, in some implementations each input might have a unique set of filters, while in some implementations inputs share a common set of filters. Additionally, in some implementations, two or more metrics for each input might be used to identify a particular filter for each input. In other implementations, however, two or more metrics for a single input might be used to identify filters for all the inputs. In yet other implementations, two or more metrics for a first input might be used to identify a filter for a second, different input.
- filter unit 349 may perform coding techniques with respect to filter information that may reduce the amount of data needed to encode and convey filter information from encoder 350 to another device. Again, for each frame or slice, filter unit 349 may define or select one or more sets of filter coefficients to be applied to the pixels of CUs for that frame or slice. Filter unit 349 applies the filter coefficients in order to filter video blocks of reconstructed video frames stored in memory 334 , which may be used for predictive coding consistent with in-loop filtering. Filter unit 349 can encode the filter coefficients as filter information, which is forwarded to entropy encoding unit 346 for inclusion in the encoded bitstream.
- filter unit 349 may predictively encode one or more filter coefficients to be used for filtering based on the filter coefficients of another CU, potentially exploiting similarities between the filter coefficients. In some cases, however, it may be more desirable to encode the filter coefficients directly, e.g., without using any prediction.
- Various techniques can be used for efficiently communicating filter coefficients to a decoder. Additionally, symmetry may also be imposed so that a subset of coefficients (e.g., 5, ⁇ 2, 10) known by the decoder can be used to define the full set of coefficients (e.g., 5, ⁇ 2, 10, 10, ⁇ 2, 5). Symmetry may be imposed in both the direct and the predictive coding scenarios.
- video encoder 350 represents an example of a video encoder configured to determine a first metric for a group of pixels within a block of pixels, determine a second metric for the group of pixels, determine a filter based on the first metric and the second metric, and generate a filtered image by applying the filter to the group of pixels.
- Video encoder 350 also represents an example of a video encoder configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a second metric for the block of pixels; determine a filter based on the first metric and the second metric; and, generate a filtered image by applying the filter to the block of pixels.
- video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric, wherein each range combination has a unique range combination identification (ID), wherein each unique range combination ID corresponds to a sequential value for a range combination; assign unique group IDs to groups of range combinations based on the sequential values for the range combinations, wherein each unique group ID corresponds to a sequential value for a group; and, code sets of filter coefficients corresponding for the filters based on the unique group IDs.
- ID unique range combination identification
- Video encoder 350 can code the sets of filter coefficients by signaling the sets of filter coefficients in a coded bitstream in an order that is selected based on the sequential values of the unique group IDs. Video encoder 350 can signal the sets of filter coefficients using differential coding techniques.
- video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range of values for a first metric and a range of values for a second metric; generate a first codeword if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the first metric; generate a second codeword if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the second metric; and, generate a third codeword if the current range combination is mapped to a different filter than the previous range combination that comprises the same range of values for the first metric and the previous range combination that comprises the same range of values for the second metric.
- Video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric; generate a first codeword if a current range combination is mapped to the same filter as a previous range combination; and, generate a second codeword if the current range combination is mapped to a different filter than the previous range combination, wherein the second codeword identifies a filter mapped to the current range combination.
- FIG. 5 is a block diagram illustrating an example of a video decoder 560 , which decodes a video sequence that is encoded in the manner described herein.
- the received video sequence may comprise an encoded set of image frames, a set of frame slices, a commonly coded group of pictures (GOPs), or a wide variety of types of series of video blocks that include encoded video blocks and syntax to define how to decode such video blocks.
- GOPs commonly coded group of pictures
- Video decoder 560 includes an entropy decoding unit 552 , which performs the reciprocal decoding function of the encoding performed by entropy encoding unit 346 of FIG. 3 .
- entropy decoding unit 552 may perform CAVLC or CABAC decoding, or any other type of entropy decoding used by video encoder 350 .
- Entropy decoded video blocks in a one-dimensional serialized format may be inverse scanned to convert one or more one-dimensional vectors of coefficients back into a two-dimensional block format. The number and size of the vectors, as well as the scan order defined for the video blocks may define how the two-dimensional block is reconstructed.
- Entropy decoded prediction syntax may be sent from entropy decoding unit 552 to prediction module 554
- entropy decoded filter information may be sent from entropy decoding unit 552 to filter unit 559 .
- Video decoder 560 also includes a prediction module 554 , an inverse quantization unit 556 , an inverse transform unit 558 , a memory and a summer 564 .
- video decoder 560 also includes a de-blocking filter 557 that filters the output of summer 564 .
- filter unit 559 may receive entropy decoded filter information that includes one or more filters to be applied to one or more inputs.
- de-blocking filter 557 may also receive entropy decoded filter information that includes one or more filters to be applied.
- the filters applied by filter unit 559 may be defined by sets of filter coefficients.
- Filter unit 559 may be configured to generate the sets of filter coefficients based on the filter information received from entropy decoding unit 552 .
- the filter information may include filter description syntax that identifies a maximum number of filters in a set of filters and/or a shape of filters in a set of filters, for example.
- the filter description syntax can be included in a header of a series of video blocks, e.g., an LCU header, a frame header, a slice header, a GOP header, a sequence header, or the like. In other examples, the filter description syntax might be included in a footer or other data structure. Based on the filter description syntax, filter unit 559 can reconstruct the set of filters used at the encoder.
- the filter information may also include additional signaling syntax that signals to the decoder the manner of encoding used for any given set of coefficients.
- the filter information may for example, also include ranges for two or more metrics for which any given set of coefficients should be used.
- filter unit 559 can filter the pixel values of decoded video blocks based on the one or more sets of filter coefficients and the signaling syntax that includes the ranges for which the different sets of filter coefficients should be used.
- Filter unit 559 may receive in the bitstream one or more syntax elements indicating a set of filters for each frame or slice as well as a mapping of filters to the two or more metrics. For example, if an encoder uses the mapping of ranges for metrics to filters shown in FIG. 4A , then the encoder will either signal this mapping or transmit data to allow filter unit 559 to reconstruct this mapping. Regardless of whether or not this mapping is explicitly signaled, filter unit 559 can maintain the same mapping of filters to combinations of ranges as used by the encoder.
- filter unit 559 generates a mapping based on filter information signaled in the bitstream. Based on this mapping, filter unit 559 can determine groups and assign group IDs to the groups in the same manner described above in relation to filter unit 349 . Using these group IDs, filter unit 559 can associate received filter coefficients with For each CU within the frame or slice, filter unit 559 can calculate one or more metrics associated with the decoded pixels of a CU for multiple inputs (i.e. PI, EI, pRI, and RI) in order to determine which filter(s) of the set(s) to apply to each input. Alternatively, filter unit 559 may calculate one or more metrics for a single input, such as pRI or RI.
- Filter unit 559 determines which filter to apply based on the metrics determined for a particular pixel or group of pixels. Using a sum-modified Laplacian value and direction as examples for Metric 1 and Metric 2 and using the mappings shown in FIG. 4A as an example, if filter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-2 and a direction corresponding to Range 2-3, then filter unit 559 can apply Filter 2 to that pixel or group of pixels.
- filter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-4 and a direction corresponding to Range 2-2, then filter unit 559 can apply Filter 6 to that pixel or group of pixels, and so on.
- the filter may generally assume any type of filter support shape or arrangement.
- the filter support refers to the shape of the filter with respect to a given pixel being filtered, and the filter coefficients may define weighting applied to neighboring pixel values according to the filter support.
- syntax data may be included in the bitstream to signal to the decoder how the filters were encoded (e.g., how the filter coefficients were encoded), as well as the ranges of the activity metric for which the different filters should be used.
- filter unit 559 can calculate one or more metrics associated with the decoded pixels of a CU for multiple inputs (i.e. PI, EI, pRI, and RI) in order to determine which filter(s) of the set(s) to apply to each input.
- filter unit 559 may calculate one or more metrics for a single input, such as pRI or RI.
- Filter unit 559 determines which filter to apply based on the metrics determined for a particular pixel or group of pixels. Using a sum-modified Laplacian value and direction as examples for Metric 1 and Metric 2 and using the mappings shown in FIG.
- filter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-2 and a direction corresponding to Range 2-3, then filter unit 559 can apply Filter 2 to that pixel or group of pixels. If filter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-4 and a direction corresponding to Range 2-2, then filter unit 559 can apply Filter 6 to that pixel or group of pixels, and so on.
- the filter may generally assume any type of filter support shape or arrangement.
- the filter support refers to the shape of the filter with respect to a given pixel being filtered, and the filter coefficients may define weighting applied to neighboring pixel values according to the filter support.
- syntax data may be included in the bitstream to signal to the decoder how the filters were encoded (e.g., how the filter coefficients were encoded), as well as the ranges of the activity metric for which the different filters should be used.
- Prediction module 554 receives prediction syntax (such as motion vectors) from entropy decoding unit 552 . Using the prediction syntax, prediction module 554 generates the prediction blocks that were used to code video blocks. Inverse quantization unit 556 performs inverse quantization, and inverse transform unit 558 performs inverse transforms to change the coefficients of the residual video blocks back to the pixel domain. Adder 564 combines each prediction block with the corresponding residual block output by inverse transform unit 558 in order to reconstruct the video block.
- prediction syntax such as motion vectors
- Filter unit 559 generates the filter coefficients to be applied for each input of a CU, and then applies such filter coefficients in order to filter the reconstructed video blocks of that CU.
- the filtering may comprise additional deblock filtering that smoothes edges and/or eliminates artifacts associated with video blocks, denoise filtering to reduce quantization noise, or any other type of filtering that can improve coding quality.
- the filtered video blocks are accumulated in memory 562 in order to reconstruct decoded frames (or other decodable units) of video information.
- the decoded units may be output from video decoder 560 for presentation to a user, but may also be stored for use in subsequent predictive decoding.
- filtering can be applied via a post-filter, in which case the filtered frame is not used for prediction of future frames.
- filtering can be applied “in-loop,” in which case the filtered frame may be used to predict future frames.
- a desirable filter can be designed by minimizing the error between the original signal and the decoded filtered signal.
- filtering has been based on applying one or more filters to a reconstructed image. For example, a deblocking filter might be applied to a reconstructed image prior to the image being stored in memory, or a deblocking filter and one additional filter might be applied to a reconstructed image prior to the image being stored in memory.
- the coefficients of filter h(k,l) may be quantized as:
- Filter h(k,l) is intended to generically represent any filter. For example, filter h(k,l) could be applied to any one of multiple inputs. In some instances multiple inputs associated with a video block will utilize different filters, in which case multiple filters similar to h(k,l) may be quantized and de-quanitzed as described above.
- the quantized filter coefficients are encoded and sent from source device associated with encoder 350 to a destination device associated with decoder 560 as part of an encoded bitstream.
- the value of normFact is usually equal to 2n although other values could be used. Larger values of normFact lead to more precise quantization such that the quantized filter coefficients f (k, l) provide better performance. However, larger values of normFact may produce coefficients f (k, l) that require more bits to signal to the decoder.
- K and L may represent integers.
- K and L may define a block of pixels that spans two-dimensions from ⁇ K to K and from ⁇ L to L. Filters applied to other inputs can be applied in an analogous manner.
- the techniques of this disclosure may improve the performance of a post-filter or in-loop filter, and may also reduce number of bits needed to signal filter coefficients f(k, l).
- a number of different post-filters or in-loop filters are signaled to the decoder for each series of video block, e.g., for each frame, slice, portion of a frame, group of frames (GOP), or the like.
- GOP group of frames
- the frames may be identified by frame number and/or frame type (e.g., I-frames, P-frames or B-frames).
- I-frames refer to intra-frames that are intra-predicted.
- P-frames refer to predictive frames that have video blocks predicted based on one list of data (e.g., one previous frame).
- B-frames refer to bidirectional predictive frames that are predicted based on two lists of data (e.g., a previous and subsequent frame).
- Macroblocks can be identified by listing macroblock types and/or range of quantization parameter (QP) values use to reconstruct the macroblock.
- QP quantization parameter
- Filter coefficients f(k,l), for any input may be coded using prediction from coefficients signaled for previous CUs.
- the encoder may encode and transmit a set of M filters:
- bitstream For each filter, the bitstream may also be encoded to identify the combination of ranges for two or more metrics for which the filter should be used.
- the filter coefficients can be predicted using reconstructed filter coefficients used in a previous CU.
- the previous filter coefficients may be represented as:
- the number of the CU n may be used to identify one or more filters used for prediction of the current filters, and the number n may be sent to the decoder as part of the encoded bitstream.
- information can be encoded and transmitted to the decoder to identify combinations of ranges for two or more metrics for which predictive coding is used.
- the amplitude of the filter coefficients g(k, l) depends on k and l values. Usually, the coefficient with the biggest amplitude is the coefficient g(0,0). The other coefficients which are expected to have large amplitudes are the coefficients for which value of k or l is equal to 0. This phenomenon may be utilized to further reduce amount of bits needed to signal the coefficients.
- the index values k and l may define locations within a known filter support.
- parameterized variable length codes such as Golomb or exp-Golomb codes defined according to a parameter p.
- parameterized variable length codes such as Golomb or exp-Golomb codes defined according to a parameter p.
- video decoder 560 represents an example of a video decoder configured to determine a first metric for a group of pixels within a block of pixels, determine a second metric for the group of pixels, determine a filter based on the first metric and the second metric, and generate a filtered image by applying the filter to the group of pixels.
- Video decoder 560 also represents an example of a video encoder configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a second metric for the block of pixels; determine a filter based on the first metric and the second metric; and, generate a filtered image by applying the filter to the block of pixels.
- video decoder 560 also represents an example of a video decoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric, wherein each range combination has a unique range combination identification (ID), wherein each unique range combination ID corresponds to a sequential value for a range combination; assign unique group IDs to groups of range combinations based on the sequential values for the range combinations, wherein each unique group ID corresponds to a sequential value for a group; and, code sets of filter coefficients corresponding for the filters based on the unique group IDs.
- Video decoder 560 can code the sets of filter coefficients comprises by generating the sets of filter coefficients based on information received in a coded bitstream. Video decoder 560 can generate the sets of filter coefficients using differential coding techniques.
- Video decoder 560 also represents an example of a video decoder configured to map a first range combination to a first filter, wherein the first range combination comprises a first range of values for a first metric and a first range of values for a second metric; map a second range combination to a second filter, wherein the second range combination comprises a second range of values for the first metric and a second range of values for the second metric; map a current range combination to a filter, wherein the current range combination comprises the first range of values of the first metric and the second range of values for the second metric.
- Mapping the current range combination to the filter can include mapping the current range combination to the first filter in response to receiving a first codeword, wherein the first codeword indicates the current range combination is mapped to the same filter as the first range combination; mapping the current range combination to the second filter in response to receiving a second codeword, wherein the second codeword indicates the current range combination is mapped to the same filter as the second combination; and, mapping the current range combination to a third filter in response to receiving a third codeword, wherein the third codeword identifies that third filter.
- Video decoder 560 also represents an example of a video decoder configured to generate a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric; map a current range combination to a same filter as a previous range combination in response to receiving a first codeword signaling the current range combination is mapped to the same filter as the previous range combination; and, map the current range combination to a filter identified by a second codeword in response to receiving the second codeword signaling the current range combination is mapped to a different filter than the previous range combination.
- activity metrics that quantify activity associated with one or more blocks of pixels within the video data.
- Activity metrics can comprise variance metrics indicative of pixel variance within a set of pixels.
- some of these activity metrics are direction-specific. For example, a horizontal activity metric quantifies activity along a horizontal axis, a vertical activity metric quantifies activity along a vertical axis, a diagonal activity metric quantifies activity along a diagonal axis, and so on.
- k represents a value of a summation of pixel values from ⁇ K to K and l represents a value of a summation from ⁇ L to L for a two-dimensional window that spans from ⁇ K to K and ⁇ L to L
- i and j represent pixel coordinates of the pixel data
- RI(i,j) represents a given pixel value at coordinates i and j
- var(i,j) is the activity metric (i.e. the sum-modified Laplacian value).
- Equations 2 and 3 show examples of how horizontal activity and vertical activity can be computed for a current pixel (x, y) by comparing a pixel value (Rec), such as intensity, of the current pixel to a pixel value of neighboring pixels.
- Rec pixel value
- Hor_act( x,y ) R (2*Rec[ x][y ] ⁇ Rec[ x+ 1 ][y ] ⁇ Rec[ x ⁇ 1 ][y ]) (2)
- Ver_act( x,y ) R (2*Rec[ x][y ] ⁇ Rec[ x][y+ 1] ⁇ Rec[ x][y+ 1]) (3)
- the current pixel (x,y) when determining horizontal activity, can be compared to a left neighbor (x ⁇ 1, y) and a right neighbor (x+1, y). As shown by equation 3, when determining vertical activity, the current pixel can be compared to an upper neighbor (x, y+1) and a lower neighbor (x, y ⁇ 1).
- Equations 4 and 5 show examples of how diagonal activity can be computed for a current pixel (x, y) by comparing a pixel value (Rec) of the current pixel to pixel values of neighboring pixels.
- Equations 2-5 illustrate how horizontal activity, vertical activity, and diagonal activity can be determined on a pixel-by-pixel basis, but in some implementations, horizontal activity, vertical activity, and diagonal activity may be determined on a group-by-group basis, where a group of pixels is a 2 ⁇ 2, 4 ⁇ 4, or M ⁇ N block of pixels.
- horizontal activity for example, can be determined by comparing pixel values of a current group to pixel values of a left group and a right group, in an analogous manner to equation 2; and, the vertical activity can be determined by comparing a current group to an upper group and a lower group, in an analogous manner to equation 3.
- 45-degree diagonal activity can be determined by comparing a current group of pixels to an upper-right neighboring group and a lower-left neighboring group in an analogous manner to equation 4
- 135-degree diagonal activity can be determined by comparing a current group of pixels to an upper-left neighboring group and a lower-right neighboring group, in an analogous manner to equation 5.
- horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be determined by comparing a current pixel or group of pixels to neighboring pixels or groups of pixels in only one direction. For example, instead of determining horizontal activity based on comparing a current pixel to a left neighbor and a right neighbor, horizontal activity might be determined based on only a left neighbor or only a right neighbor. Additionally, in some implementations, horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity may be determined using averages or weighted averages of areas of neighboring pixels instead of single neighboring pixels or single groups of pixels.
- the values resulting from equations 2-5 can be divided into a finite number of ranges, such as 2, 4, 8, or any other finite number, and each range can be assigned a range identification.
- Range 1-1, Range 1-2, Range 2-1, etc. are all examples of range identifications.
- horizontal activity values can be divided into four ranges, and the ranges might be assigned IDs Range 1-1, Range 1-2, Range 1-3, and Range 1-4.
- Horizontal threshold values i.e., ThH 1 , . . . , ThH P-1
- Table 2 below shows the generic case of how horizontal IDs might be assigned to P ranges.
- any of horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be used as a metric in accordance with the multi-metric filter filtering techniques described in this disclosure.
- Metric 1 might be a measure of vertical activity
- Metric 2 might be a measure of horizontal activity.
- a filter unit such as filter unit 349 of FIG. 4A or filter 559 of FIG. 5 , can determine a filter for a pixel or group of pixels based on the horizontal activity of the pixel or group of pixel and the vertical activity of the pixel or group of pixels.
- a current pixel has a horizontal activity metric that falls in Range 2-3 and a vertical activity metric that falls in range 1-3
- the filter unit filters the pixel using Filter 4.
- combinations of 45-degree diagonal activity and 135-degree diagonal activity, 45-degree diagonal activity and horizontal activity, 45-degree diagonal activity and vertical activity, 135-degree diagonal activity and horizontal activity, or 135-degree diagonal activity and vertical activity may also be used by a filter unit for selecting a filter for a pixel or group of pixels.
- three or all four of horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity may be used by a filter unit for selecting a filter of a pixel or group of pixels.
- horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can all be used as metrics, as Metric 1 and/or Metric 2 in FIG. 4A , for example.
- horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity might not be metrics themselves, but instead can be used as intermediate determinations for determining an overall direction metric.
- the direction metric generally describes in which direction (e.g. no direction, horizontal, vertical, 45-degree diagonal, or 135-degree diagonal) the pixels are changing the most.
- a direction for a pixel might be determined based on the following conditions:
- Constants, k1 and k2 can be selected such that the direction is only deemed to be direction 1 or direction 2 if horizontal activity is substantially greater than vertical activity or vertical activity is substantially greater than horizontal activity. If horizontal activity and vertical activity are equal or approximately equal, then the direction is direction 0.
- Direction 1 generally indicates that the pixel values are changing more in the horizontal direction than in the vertical direction
- direction 2 indicates that pixel values are changing more in the vertical direction than in the horizontal direction
- Direction 0 indicates that the change in pixel values in the horizontal direction is approximately equal to the change in pixel values in the vertical direction.
- the determined direction metric (e.g. direction 0, direction 1, direction 2) can be used as a metric in the multi-metric filtering techniques described in this disclosure.
- Metric 1 might be a variance metric, such as a sum-modified Laplacian value
- Metric 2 might be a direction determination as described above.
- each of direction 1, direction 2, and direction 0 can be associated with a range of Metric 2 even though direction 1, direction 2, and direction 0 represent finite determinations instead of a spectrum of values.
- techniques of this disclosure also include using 45-degree diagonal activity and 135-degree diagonal activity, as described in equations 4 and 5, to determine directions, based on the following conditions:
- Direction determinations based on 45-degree diagonal activity and 135-degree diagonal activity can be used as a metric with another metric, such as a sum-modified Laplacian value, as described above.
- a direction metric may also be determined, based on the following conditions:
- k1 through k12 are constants selected to determination how much greater than one of horizontal activity, vertical activity, 45-degree activity, and 135-degree activity needs to be compared to the others in order for a certain direction to be selected.
- Direction determinations based on horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be used as a metric with another metric, such as a sum-modified Laplacian value, as described above.
- An edge metric generally quantifies activity that might be indicative of the presence of an edge in a block of pixels.
- An edge may occur, for example, in a block of pixels if that block of pixels contains the boundary of an object within an image.
- One example of edge detection includes using a current pixel's four neighboring pixels (e.g., left, right, top, bottom) or using the current pixel's eight neighboring pixels (left, right, top, bottom, top right, top left, bottom right, bottom left).
- edge type detection may include using two neighboring pixels, such as top and bottom, left and right, top left and bottom right, or top right and left bottom.
- the pseudo code below shows examples of how edge information can be computed for a current pixel (x, y) by comparing a pixel value (Rec), such as intensity, of the current pixel to the pixel values of those neighboring pixels (i.e., 4/8 pixels).
- a pixel value such as intensity
- An EdgeType variable is initiated to 0. Each time a statement is true, the EdgeType variable is either incremented by 1 (as shown in the pseudo code by EdgeType ++) or decremented by 1 (as shown in the pseudo code by EdgeType ⁇ ).
- Rec[x][y] refers to a pixel value, such as the pixel intensity, of the pixel located at (x, y).
- the first grouping of “if” statements are for comparing the current pixel to top, bottom, left, and right neighbors.
- the second grouping of “if” statements are for comparing the current pixel to the top-left, top-right, bottom-left, and bottom-right neighbors.
- the techniques of this disclosure can be implemented using either group or both groups.
- a current pixel is a local maximum, then the pixel value of the pixel will be greater than all its neighbors and will have an edge type of 4 if using four neighbors or an edge type of 8 if using eight neighbors. If a current pixel is local minimum, then the pixel value of the pixel will be less than all its neighbors and will have an edge type of ⁇ 4 if using four neighbors or an edge type of ⁇ 8 if using eight neighbors.
- edge type between ⁇ 4 and 4 or ⁇ 8 and 8 can be used in determining a filter. The values determined for the edge type (i.e.
- values of ⁇ 4 to 4 or values of ⁇ 8 to 8) can be mapped to ranges of a metric, such as Metric 1 or Metric 2 of FIG. 4A .
- absolute values of the edge type determination might be mapped to ranges, such that an edge type of ⁇ 3 and 3, for example, would map to the same filter.
- the calculations of the various metrics described in this disclosure are only intended to be examples and are not exhaustive.
- the metrics can be determined using windows or lines of pixels that include more neighboring pixels than described in this disclosure.
- the metrics described in this disclosure may be calculated using sub-sampling of the pixels in a particular line or window. For example, to calculate a block activity metric for a 4 ⁇ 4 block of pixels, metrics for activity and direction can be calculated as follows:
- Ver_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i,j ⁇ 1) ⁇ X ( i,j+ 1))
- Hor_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i ⁇ 1 ,j ) ⁇ X ( i+ 1 ,j ))
- Hor_act (i, j) generally refers to the horizontal activity of current pixel (i, j)
- Vert_act(i, j) generally refers to the vertical activity of current pixel (i,j).
- X(i, j) generally refers to a pixel vale of pixel (i, j).
- H B refers to the horizontal activity of the 4 ⁇ 4 block, which in this example is determined based on a sum of horizontal activity for pixels (0, 0), (0, 2), (2, 0), and (2, 2).
- V B refers to the vertical activity of the 4 ⁇ 4 block, which in this example is determined based on a sum of vertical activity for pixels (0, 0), (0, 2), (2, 0), and (2, 2).
- ⁇ 1 represents a multiply by two operation.
- a direction can be determined. Using the example above, if the value of H B is more than k times the value of V B , then the direction can be determined to be direction 1 (i.e. horizontal), which might correspond to more horizontal activity than vertical activity. If the value of V B is more than k times the value of H B , then the direction can be determined to be direction 2 (i.e. vertical), which might correspond to more vertical activity than horizontal activity. Otherwise, the direction can be determined to be direction 0 (i.e. no direction), meaning neither horizontal nor vertical activity is dominant.
- the labels for the various directions and the ratios used to determine the directions merely constitute one example, as other labels and ratios can also be used.
- Activity (L B ) for the 4 ⁇ 4 block can be determined as a sum of the horizontal and vertical activity.
- the value of L B can be classified into a range, as described above. This particular example shows five ranges although more or fewer ranges may similarly be used.
- a filter for the 4 ⁇ 4 block of pixels can be selected. As described above, a filter may be selected based on a two-dimensional mapping of activity and direction to filters, as described in reference to FIGS. 4A and 4B , or activity and direction may be combined into a single metric, and that single metric may be used to select a filter.
- FIG. 6A represents a 4 ⁇ 4 block of pixels. Using the sub-sampling techniques described above, only four of the sixteen pixels are used. The four pixels are pixel (0, 0) which is labeled as pixel 601 , pixel (2, 0) which is labeled as pixel 602 , pixel (0, 2) which is labeled as pixel 603 , and pixel (2, 2) which is labeled as pixel 604 .
- the Horizontal activity of pixel 601 i.e. hor_act(0, 0)
- the right neighboring pixel is labeled as pixel 605 .
- the left neighboring pixel is located in a different block than the 4 ⁇ 4 block and is not shown on FIG. 6A .
- the vertical activity of pixel 602 i.e. ver_act(2, 0)
- ver_act(2, 0) for example is determined based on an upper neighboring pixel and a lower neighboring pixel.
- the lower neighboring pixel is labeled as pixel 606
- the upper neighboring pixel is located in a different block than the 4 ⁇ 4 block and is not shown in FIG. 6A .
- a block activity metric may also be calculated using a different subset of pixels as follows:
- Ver_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i,j ⁇ 1) ⁇ X ( i,j+ 1))
- Hor_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i ⁇ 1 ,j ) ⁇ X ( i+ 1 ,j ))
- This different subset of pixels for calculating H B and V B includes pixels (1, 1), (2, 1), (1, 2), and (2, 2), shown on FIG. 6B as pixels 611 , 612 , 613 , and 614 , respectively.
- pixels 611 , 612 , 613 , and 614 are located within the 4 ⁇ 4 block.
- pixels 611 , 612 , 613 , and 614 are all located in the interior of the block as opposed to be locating on the block boundary. Pixels 601 , 602 , 603 , and 605 in FIG.
- pixels 621 , 624 , 625 , and 628 in FIG. 6C are examples of pixels located on the block boundary.
- additional different subsets of pixel may be chosen. For example, subsets may be selected such that upper and lower neighboring pixels for the pixels of the subset are within the 4 ⁇ 4 block, but some left and right neighboring pixels are in neighboring blocks. Subsets may also be selected such that left and right neighboring pixels for the pixels of the subset are within the 4 ⁇ 4 block, but some upper and lower neighboring pixels are in neighboring blocks.
- a block activity metric may also be calculated using a subset of eight pixels as follows:
- Ver_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i,j ⁇ 1) ⁇ X ( i,j+ 1))
- Hor_act( i,j ) abs( X ( i,j ) ⁇ 1 ⁇ X ( i ⁇ 1 ,j ) ⁇ X ( i+ 1 ,j ))
- This different subset of eight pixels for calculating H B and V B includes pixels (0, 1), (1, 1), (2, 1), (3, 1), (0, 2), (1, 2), (2, 2), and (3, 2), shown on FIG. 6C as pixels 621 , 622 , 623 , and 624 , 625 , 626 , 627 , and 628 respectively. As can be seen by FIG.
- pixels 621 , 622 , 623 , and 624 , 625 , 626 , 627 , and 628 are located within the 4 ⁇ 4 block, although pixels 621 and 625 each have left neighboring pixels in a left neighboring block and pixels 624 and 628 each have right neighboring pixels in a right neighboring block.
- This particular selection of pixels may reduce encoder and/or decoder complexity by avoiding the need for a line buffer for storing pixel values of upper and/or lower neighboring blocks.
- line buffers for pixel values of upper and lower neighboring blocks often need to store pixel values for the entire upper or lower line, which in the case of the 1080P video, for example, might be 1920 pixels.
- Line buffers for, left and right neighboring blocks often only need to store pixel values for one LCU or a couple of LCUs, which might only be 64 or 128 pixels, for example.
- line buffers for pixel values of upper and lower neighboring blocks may need to be significantly larger than line buffers used for pixel values of left and right neighboring blocks.
- the selection of pixels shown in FIG. 6C may be able to avoid the use of line buffers for pixel values of upper and lower neighboring block, thus reducing coding complexity.
- FIGS. 6A-6C are merely introduced techniques of this disclosure. It is contemplated that these techniques can be extended to blocks other than just 4 ⁇ 4 and that different subsets of pixels may be selected.
- quantized pixels i.e., X(i,j)>>N
- calculations can be absolute difference based instead of Laplacian based.
- absolute differences can be used instead of Laplacian values, as follows:
- Ver_act( i,j ) abs( X ( i,j ) ⁇ X ( i,j ⁇ 1))
- Hor_act( i,j ) abs( X ( i,j ) ⁇ X ( i ⁇ 1 ,j ))
- sub-sampling techniques with reference to a limited group of specific metrics. It is contemplated, however, that these sub-sampling techniques are generally applicable to other metrics, such as the other metrics discussed in this disclosure, that may be used for purposes of determining a filter. Additionally, although the sub-sampling techniques of this disclosure have been described with reference to 4 ⁇ 4 blocks of pixels, the techniques may also be applicable to blocks of other sizes.
- FIG. 7 is a flow diagram illustrating a video coding technique consistent with this disclosure.
- the techniques described in FIG. 7 can be performed by the filter unit of a video encoder or a video decoder, such as filter unit 349 of video encoder 350 or filter unit 559 of video decoder 560 .
- the filter unit determines a first metric for a group of pixels within a block of pixels ( 710 ).
- the first metric may, for example, be an activity metric such as a sum-modified Laplacian value, or the first metric may be a direction metric.
- the first metric may be determined, for example, based on a comparison of the set of pixels in the block, or based on a subset of the pixels in the block, to other pixels in the block.
- the filter unit further determines a second metric for the block ( 720 ).
- the second metric may, for example, be a direction metric that is determined based on comparing a measure of horizontal activity to a measure of vertical activity.
- the filter unit determines a filter ( 730 ).
- the filter unit generates a filtered image by applying the filter to the block ( 740 ).
- the block may be a 2 ⁇ 2, 4 ⁇ 4, or M ⁇ N block of pixels, used for determining the first metric or the second metric.
- the first metric may be a horizontal activity metric while the second metric is a vertical activity metric, or the first metric may be an edge metric while the second metric is a direction metric.
- FIG. 8A is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 8A can be performed by the filter unit of a video decoder, such as filter unit 559 of video decoder 560 .
- Filter unit 559 maps a first range combination to a first filter ( 810 A).
- the first range combination is combination of a first range of values for a first metric and a first range of values for a second metric.
- the first metric may, for example, be a sum-modified Laplacian value and the second metric may be a direction metric, although others metrics may also be used.
- Filter unit 559 maps a second range combination to a second filter ( 820 A).
- the second range combination is a combination of a second range of values for the first metric and a second range of values for the second metric.
- Filter unit 559 then maps a current range combination to a filter based on a received codeword.
- the current range combination includes the first range of values of the first metric and the second range of values for the second metric. If the codeword is a first codeword ( 830 A, yes), then filter unit 559 maps the current range combination to the first filter ( 840 A). The first codeword indicates the current range combination is mapped to the same filter as the first range combination. If the codeword is a second codeword ( 850 A, yes), the filter unit 559 maps the current range combination to the second filter ( 860 A).
- the second codeword indicates the current range combination is mapped to the same filter as the second combination. If the codeword is neither a first codeword nor a second codeword ( 850 A, no), then filter unit 559 maps the current range combination to a third filter ( 870 A). If in response to receiving a third codeword, wherein the third codeword identifies that third filter.
- the first codeword and the second codeword may each include fewer bits than the third codeword.
- FIG. 8B is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 8B can be performed by the filter unit of a video decoder, such as filter unit 559 of video decoder 560 .
- Filter unit 559 generates a mapping of range combinations to filters ( 810 B).
- Each range combination for example, can include a range for a first metric and a range for a second metric.
- filter unit 559 maps the current range combination to the same filter as the previous range combination ( 830 B).
- filter unit 559 maps the current range combination to a new filter ( 840 B).
- the current range combination can be determined based on a known transmission order.
- the new filter can be identified based on the second codeword, while in other examples, the new filter might be determined based on the order in which filter coefficients are signaled.
- FIG. 9A is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 9A can be performed by the filter unit of a video encoder, such as filter unit 349 of video encoder 350 .
- Filter unit 349 determines a mapping of range combinations to filters ( 910 A). Each range combination includes a range of values for a first metric and a range of values for a second metric. For a current range combination, if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the first metric ( 920 A, yes), then filter unit 349 generates a first codeword ( 930 A).
- filter unit 349 If the current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the second metric ( 940 A, yes), then filter unit 349 generates a second codeword ( 950 A). If the current range combination is not mapped to either the previous range combination that comprises the same range of values for the first metric or the previous range combination that comprises the same range of values for the second metric ( 950 A, no), then filter unit 349 generates a third codeword ( 960 A). The third codeword can identify a filter mapped to the current range combination.
- FIG. 9B is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 9 BA can be performed by the filter unit of a video encoder, such as filter unit 349 of video encoder 350 .
- Filter unit 349 determines a mapping of range combinations to filters ( 910 B). Each range combination can, for example, include a range for a first metric and a range for a second metric.
- filter unit 349 can generate a first codeword to signal that the current range combination is mapped to the same filter as a previous range combination ( 930 B).
- filter unit 349 can generating a second codeword ( 940 B).
- the second codeword can identify the filter mapped to the current range combination.
- the current range combination can be determined based on a known transmission order.
- the first codeword may include fewer bits than the second codeword.
- first codeword “second codeword,” and “third codeword” are used to differentiate between different codewords and not meant to imply a sequential ordering of codewords.
- FIG. 10 is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 10 can be performed by the filter unit of a video encoder, such as filter unit 349 of video encoder 350 , or the filter unit of a video decoder, such as filter unit 559 .
- the filter unit determines a mapping of range combinations to filters ( 1010 ).
- the range combinations include a range for a first metric and a range for a second metric.
- the filter unit determines a unique range combination identification (ID) for each range combination ( 1020 ).
- IDs correspond to sequential values.
- the filter unit assigns a first unique group ID to a first group of range combinations based on the sequential value of a range combination ID of at least one range combination in the first group of range combinations ( 1030 ).
- the groups of range combinations include range combinations mapped to the same filter, the unique group IDs correspond to a set of sequential values.
- the filter unit codes a first set of filter coefficients corresponding to the same filter based on the sequential value of the first unique filter ID ( 1040 ).
- coding the first set of filter coefficients can include, for example, signaling the filter coefficients in an encoded bitstream using differential coding techniques.
- coding the first set of filter coefficients can include reconstructing the filter coefficients based on information received in an encoded bitstream.
- FIG. 11 is a flow diagram illustrating video coding techniques consistent with this disclosure.
- the techniques described in FIG. 11 can be performed by the filter unit of a video encoder, such as filter unit 349 of video encoder 350 , or the filter unit of a video decoder, such as filter unit 559 .
- the filter unit determines a mapping of range combinations to filters ( 1110 ).
- the range combinations can include a range for a first metric and a range for a second metric.
- Each range combination can have a unique range combination identification (ID), and each unique range combination ID can correspond to a sequential value for the range combination.
- the filter unit can assigns a unique group ID to each group of range combinations ( 1120 ).
- the filter unit can assign the unique group IDS, for example, based on the sequential values of the range combinations.
- a group of range combinations can includes range combinations mapped to a common filter, and the unique group IDs can correspond to a set of sequential values.
- the filter unit can code sets of filter coefficients for the filters based on the unique group IDs ( 1140 ).
- the filter unit can assign the unique group IDs by, for example, assigning a unique group ID corresponding to a lowest sequential value of the unique group IDs to a group of range combinations that comprises a range combination with a range combination ID corresponding to a lowest sequential value of the range combination IDs.
- the filter unit can assign the unique group ID corresponding to a highest sequential value of the unique group IDs to a group of range combinations that comprises a range combination with a range combination ID corresponding to a highest sequential value of the range combination IDs.
- the filter unit can code the sets of filter coefficients by generating the sets of filter coefficients based on information received in a coded bitstream.
- the filter unit can, for example, generate the sets of filter coefficients using differential coding techniques.
- the filter unit can code the sets of filter coefficients by signaling the sets of filter coefficients in a coded bitstream in an order selected based on the sequential values of the unique group IDs.
- the filter unit can, for example, signal the sets of filter coefficients using differential coding techniques.
- the disclosure generally describes sets of filters being signaled on a per-frame or per-slice basis, but sets of filters may also be signaled on a per-sequence basis, per-group of picture basis, per-group of slices basis, per-CU basis, per-LCU basis, or other such basis.
- filters may be signaled for any grouping of one or more CUs.
- there may be numerous filters per input per CU, numerous coefficients per filter, and numerous different levels of variance with each of the filters being defined for a different range of variance.
- filter information such as filter description syntax may be signaled on a frame-by-frame basis or slice-by-slice basis while other filter information such as filter coefficients are signaled on an LCU-by-LCU basis. Syntax at other levels of the coding hierarchy, such as sequence level, GOP-level, or other levels could also be defined for conveying some or all of such filter information
- Each of the filters for each input may include many coefficients.
- the filters comprise two-dimensional filters with 81 different coefficients defined for a filter support that extends in two-dimensions.
- the number of filter coefficients that are signaled for each filter may be fewer than 81 in some cases.
- Coefficient symmetry for example, may be imposed such that filter coefficients in one dimension or quadrant may correspond to inverted or symmetric values relative to coefficients in other dimensions or quadrants. Coefficient symmetry may allow for 81 different coefficients to be represented by fewer coefficients, in which case the encoder and decoder may assume that inverted or mirrored values of coefficients define other coefficients.
- the coefficients (5, ⁇ 2, 10, 10, ⁇ 2, 5) may be encoded and signaled as the subset of coefficients (5, ⁇ 2, 10).
- the decoder may know that these three coefficients define the larger symmetric set of coefficients (5, ⁇ 2, 10, 10, ⁇ 2, 5).
- the techniques of this disclosure may be implemented in a wide variety of devices or apparatuses, including a wireless handset, and integrated circuit (IC) or a set of ICs (i.e., a chip set). Any components, modules or units have been described provided to emphasize functional aspects and does not necessarily require realization by different hardware units.
- the techniques described herein may be implemented in hardware, software, firmware, or any combination thereof. If implemented in hardware, any features described as modules, units or components may be implemented together in an integrated logic device or separately as discrete but interoperable logic devices. If implemented in software, the techniques may be realized at least in part by a computer-readable medium comprising instructions that, when executed in a processor, performs one or more of the methods described above.
- the computer-readable medium may comprise a computer-readable storage medium and may form part of a computer program product, which may include packaging materials.
- the computer-readable storage medium may comprise random access memory (RAM) such as synchronous dynamic random access memory (SDRAM), read-only memory (ROM), non-volatile random access memory (NVRAM), electrically erasable programmable read-only memory (EEPROM), FLASH memory, magnetic or optical data storage media, and the like.
- RAM synchronous dynamic random access memory
- ROM read-only memory
- NVRAM non-volatile random access memory
- EEPROM electrically erasable programmable read-only memory
- FLASH memory magnetic or optical data storage media, and the like.
- the techniques additionally, or alternatively, may be realized at least in part by a computer-readable communication medium that carries or communicates code in the form of instructions or data structures and that can be accessed, read, and/or executed by a computer.
- the code may be executed by one or more processors, such as one or more digital signal processors (DSPs), general purpose microprocessors, an application specific integrated circuits (ASICs), field programmable logic arrays (FPGAs), or other equivalent integrated or discrete logic circuitry.
- DSPs digital signal processors
- ASICs application specific integrated circuits
- FPGAs field programmable logic arrays
- processors may refer to any of the foregoing structure or any other structure suitable for implementation of the techniques described herein.
- the functionality described herein may be provided within dedicated software modules or hardware modules configured for encoding and decoding, or incorporated in a combined video codec. Also, the techniques could be fully implemented in one or more circuits or logic elements.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Error Detection And Correction (AREA)
- Picture Signal Circuits (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
- Developing Agents For Electrophotography (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
- This application claims priority to
- U.S. Provisional Application No. 61/445,967, filed 23 Feb. 2011;
- U.S. Provisional Application No. 61/448,771, filed 3 Mar. 2011;
- U.S. Provisional Application No. 61/473,713, filed 8 Apr. 2011;
- U.S. Provisional Application No. 61/476,260, filed 16 Apr. 2011;
- U.S. Provisional Application No. 61/478,287, filed 22 Apr. 2011;
- U.S. Provisional Application No. 61/503,426, filed 30 Jun. 2011;
- U.S. Provisional Application No. 61/503,434, filed 30 Jun. 2011;
- U.S. Provisional Application No. 61/503,440, filed 30 Jun. 2011;
- U.S. Provisional Application No. 61/527,463, filed 25 Aug. 2011;
- U.S. Provisional Application No. 61/531,571, filed 6 Sep. 2011;
- the entire contents each of which are herein incorporated by reference in their entirety.
- This disclosure relates to block-based digital video coding used to compress video data and, more particularly to, techniques for the filtering of video blocks.
- Digital video capabilities can be incorporated into a wide range of devices, including digital televisions, digital direct broadcast systems, wireless communication devices such as radio telephone handsets, wireless broadcast systems, personal digital assistants (PDAs), laptop computers, desktop computers, tablet computers, digital cameras, digital recording devices, video gaming devices, video game consoles, and the like. Digital video devices implement video compression techniques, such as MPEG-2, MPEG-4, or ITU-T H.264/MPEG-4, Part 10, Advanced Video Coding (AVC), to transmit and receive digital video more efficiently. Video compression techniques perform spatial and temporal prediction to reduce or remove redundancy inherent in video sequences. New video standards, such as the High Efficiency Video Coding (HEVC) standard being developed by the “Joint Collaborative Team—Video Coding” (JCTVC), which is a collaboration between MPEG and ITU-T, continue to emerge and evolve. This new HEVC standard is also sometimes referred to as H.265.
- Block-based video compression techniques may perform spatial prediction and/or temporal prediction. Intra-coding relies on spatial prediction to reduce or remove spatial redundancy between video blocks within a given unit of coded video, which may comprise a video frame, a slice of a video frame, or the like. In contrast, inter-coding relies on temporal prediction to reduce or remove temporal redundancy between video blocks of successive coding units of a video sequence. For intra-coding, a video encoder performs spatial prediction to compress data based on other data within the same unit of coded video. For inter-coding, the video encoder performs motion estimation and motion compensation to track the movement of corresponding video blocks of two or more adjacent units of coded video.
- A coded video block may be represented by prediction information that can be used to create or identify a predictive block, and a residual block of data indicative of differences between the block being coded and the predictive block. In the case of inter-coding, one or more motion vectors are used to identify the predictive block of data from a previous or subsequent coding unit, while in the case of intra-coding, the prediction mode can be used to generate the predictive block based on data within the CU associated with the video block being coded. Both intra-coding and inter-coding may define several different prediction modes, which may define different block sizes and/or prediction techniques used in the coding. Additional types of syntax elements may also be included as part of encoded video data in order to control or define the coding techniques or parameters used in the coding process.
- After block-based prediction coding, the video encoder may apply transform, quantization and entropy coding processes to further reduce the bit rate associated with communication of a residual block. Transform techniques may comprise discrete cosine transforms (DCTs) or conceptually similar processes, such as wavelet transforms, integer transforms, or other types of transforms. In a discrete cosine transform process, as an example, the transform process converts a set of pixel difference values into transform coefficients, which may represent the energy of the pixel values in the frequency domain. Quantization is applied to the transform coefficients, and generally involves a process that limits the number of bits associated with any given transform coefficient. Entropy coding comprises one or more processes that collectively compress a sequence of quantized transform coefficients.
- Filtering of video blocks may be applied as part of the encoding and decoding loops, or as part of a post-filtering process on reconstructed video blocks. Filtering is commonly used, for example, to reduce blockiness or other artifacts common to block-based video coding. Filter coefficients (sometimes called filter taps) may be defined or selected in order to promote desirable levels of video block filtering that can reduce blockiness and/or improve the video quality in other ways. A set of filter coefficients, for example, may define how filtering is applied along edges of video blocks or other locations within video blocks. Different filter coefficients may cause different levels of filtering with respect to different pixels of the video blocks. Filtering, for example, may smooth or sharpen differences in intensity of adjacent pixel values in order to help eliminate unwanted artifacts.
- This disclosure describes techniques associated with filtering of video data in a video encoding and/or video decoding process. In accordance with this disclosure, filtering is applied at an encoder, and filter information is encoded in the bitstream to enable a decoder to identify the filtering that was applied at the encoder. The decoder receives encoded video data that includes the filter information, decodes the video data, and applies filtering based on the filtering information. In this way, the decoder applies the same filtering that was applied at the encoder. According to the techniques of this disclosure, on a frame-by-frame, slice-by-slice, or LCU-by-LCU basis, an encoder may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering. For the coded units (CUs) that are to be filtered, the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2×2 block of pixels or a 4×4 block of pixels.
- In one example, a method of video coding includes determining a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; based on the first metric, determining a filter for the block of pixels; and, generating a filtered image by applying the filter to the block of pixels.
- In another example, a video coding device includes a filter unit configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block, determine a filter for the block of pixels based on the first metric, and generate a filtered image by applying the filter to the block of pixels; and a memory configured to store a filtered result of the filter unit;
- In another example, a video coding apparatus includes means for determining a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; means for determining a filter for the block of pixels based on the first metric; and, means for generating a filtered image by applying the filter to the block of pixels.
- In another example, a computer-readable storage medium stores instructions that when executed cause one or more processors to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a filter for the block of pixels based on the first metric; and, generate a filtered image by applying the filter to the block of pixels.
- The details of one or more examples are set forth in the accompanying drawings and the description below. Other features, objects, and advantages will be apparent from the description and drawings, and from the claims.
-
FIG. 1 is a block diagram illustrating an exemplary video encoding and decoding system. -
FIGS. 2A and 2B are conceptual diagrams illustrating an example of quadtree partitioning applied to a largest coding unit (LCU). -
FIGS. 2C and 2D are conceptual diagrams illustrating an example of a filter map for a series of video blocks corresponding to the example quadtree partitioning ofFIGS. 2A and 2B . -
FIG. 3 is a block diagram illustrating an exemplary video encoder consistent with this disclosure. -
FIG. 4A is a conceptual diagram illustrating a mapping of ranges for two metrics to filters. -
FIG. 4B is a conceptual diagram illustrating a mapping of ranges for an activity metric and a direction metric to filters. -
FIG. 5 is a block diagram illustrating an exemplary video decoder consistent with this disclosure. -
FIGS. 6A , 6B, and 6C show conceptual diagrams of a 4×4 block of pixels. -
FIG. 7 is a flow diagram illustrating coding techniques consistent with this disclosure. -
FIGS. 8A and 8B are flow diagrams illustrating coding techniques consistent with this disclosure. -
FIGS. 9A and 9B are flow diagrams illustrating coding techniques consistent with this disclosure. -
FIG. 10 is a flow diagram illustrating coding techniques consistent with this disclosure. -
FIG. 11 is a flow diagram illustrating coding techniques consistent with this disclosure. - This disclosure describes techniques associated with filtering of video data in a video encoding and/or video decoding process. In accordance with this disclosure, filtering is applied at an encoder, and filter information is encoded in the bitstream to enable a decoder to identify the filtering that was applied at the encoder. The decoder receives encoded video data that includes the filter information, decodes the video data, and applies filtering based on the filtering information. In this way, the decoder applies the same filtering that was applied at the encoder. According to the techniques of this disclosure, on a frame-by-frame, slice-by-slice, or LCU-by-LCU basis, an encoder may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering. For the coded units (CUs) that are to be filtered, the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2×2 block of pixels or a 4×4 block of pixels.
- According to the techniques of this disclosure, video data can be coded in units referred to as coded units (CUs). CUs can be partitioned into smaller CUs, or sub-units, using a quadtree partitioning scheme. Syntax identifying the quadtree partitioning scheme for a particular CU can be transmitted from an encoder to a decoder. Multiple inputs associated with each sub-unit of a given CU can be filtered during the process of decoding and reconstructing the encoded video data. According to the techniques of this disclosure, filter description syntax can describe a set of filters, such as how many filters are in the set or what shape the filters take. Additional syntax in the bitstream received by the decoder can identify the filters (i.e. the filter coefficients) used at the encoder for a particular sub-unit. The filter used for a particular input can be selected based on two or metrics, where certain combinations of values for the two or metrics are indexed to specific filters within a set of filters. In other instances, two or more metrics may be combined to form a single metric. The mapping of filters to metrics can also be signaled in the bitstream
- Different types of filtering may be applied to pixels or blocks of pixels based on two or more metrics determined for the video data. The filter used for a particular pixel can be selected based on two or more metrics, such as some combination of an activity metric and a direction metric. An activity metric, for example, may quantify activity associated with one or more blocks of pixels within the video data. The activity metric may comprise a variance metric indicative of pixel variance within a set of pixels. An activity metric may be either direction-specific or non-direction-specific. For example, a non-direction-specific activity metric may include a sum-modified Laplacian value, as explained in greater detail below.
- Examples of direction-specific activity metrics include a horizontal activity metric, a vertical activity metric, a 45-degree activity metric, and a 135-degree activity metric. A direction metric may for a block of pixels quantify any of the horizontal activity, vertical activity, or diagonal activity of a pixel or group of pixels, or a direction metric may include a comparison of horizontal activity, vertical activity, and/or diagonal activity, where horizontal activity generally refers to changes in pixel values in a horizontal direction, vertical activity generally refers to changes in pixel values in a vertical direction, and diagonal activity generally refers to changes in pixel values in a diagonal direction.
- According to techniques of this disclosure, when determining a filter for a block of pixels, a subset of pixels within the block may be used to reduce encoding and decoding complexity. For example, when determining a filter for a 4×4 block of pixels, it may not be necessary to use all sixteen pixels of the 4×4 block. Additionally, according to techniques of this disclosure, the subset of pixels from within a current block being coded can be selected such that the metrics are calculated only using pixel values of the current block and not pixel values of neighboring blocks. For instance, the metric for a pixel being evaluated might be calculated based on comparing the pixel to nearby pixels. In some instances, one or more of the nearby pixels for the pixel being evaluated might be in a different block than the pixel being evaluated. In other instances, however, one of more of the nearby pixels for the pixel might be in the same block as the pixel. According to techniques of this disclosure, the subset of pixels can be selected to include pixels that do not have nearby pixels in neighboring blocks. Additionally or alternatively, the subset of pixels may include pixels that have nearby pixels in neighboring blocks, but those nearby pixels in neighboring blocks may not be used when determining the metric. By basing the determination of a particular metric on pixels within a current block and not on pixels of neighboring blocks, the need for buffers at the encoder and/or decoder may, in some instances, be reduced or even eliminated.
- In some instances, according to techniques of this disclosure, the subset of pixels from within a current block being coded can be selected such that the metrics are calculated only using pixel values of the current block and left and right neighboring blocks but not pixel values of upper neighboring blocks or lower neighboring blocks. As a result of the raster scan order used when coding video blocks, line buffers for upper and lower neighboring blocks tend to need to store far more pixel values than line buffers for storing pixel values of left and right neighboring blocks.
- According to the techniques of this disclosure, a filter unit, such as an adaptive-in loop filter, can be configured to utilize multiple filters based on multi-metric filter mapping. The multiple filters may be used in conjunction with a single input or multiple inputs. As will be described in more detail below, the multiple inputs described in this disclosure generally refer to intermediate video block data or image data that is produced during the encoding and decoding processes. Multiple inputs associated with a given video block can include, for example, a reconstructed block or image (RI), a pre-deblocked reconstructed block or image (pRI), a prediction block or image (PI), and/or a quantized prediction error image (EI). In a single input scheme, a filter may only be applied to one of the inputs above, such as RI. Also, as explained in greater detail below, the filtering techniques of this disclosure can be applied to CUs of various sizes using a quadtree partitioning scheme. By utilizing multiple filters with multi-metric filter mapping for CUs partitioned using a quadtree partitioning scheme, video coding performance, as measured by one or both of compression rate and reconstructed video quality, might be improved.
- To implement the multi-metric filtering techniques described above, an encoder maintains, by generating, updating, storing, or other means, a mapping of combinations of ranges to filters. As one example, the combination of a first range for a first metric and a first range for a second metric may map to a first filter. The combination of the first range for the first metric and a second range for the second metric may also map to the first filter or may map to a second filter. If a first metric has eight ranges and a second metric has four ranges, for example, then the first and second metric can have thirty-two combinations of ranges, and each of the thirty-two combinations can be mapped to a filter. Each combination, however, is not necessarily mapped to a unique filter. Thus, the thirty-two combinations might map to four filters, eight filters, ten filters, or some other number of filters. In order to apply the same filters as an encoder, a decoder may also maintain the same mappings of range combinations to filters.
- This disclosure describes techniques for signaling from an encoder to a decoder, in an encoded bitstream, a mapping of range combinations to filters. The mapping may, for example, associate each range combination with a filter identification (ID). One simple way to signal this mapping is to use one codeword for each filter ID, and then for each combination of ranges, send the codeword of the corresponding filter ID. This technique, however, is typically inefficient. Techniques of the present disclosure may exploit correlations within the mapping by using differential coding methods. Combinations of ranges that share a common range sometimes use the same filter. As one example, the combination of a first range for a first metric and a first rage for a second metric and the combination of the first range for the first metric and a second range for the second metric share a common range (the first range of the first metric). Thus, these two combinations might, in some instances, map to the same filter ID. By exploiting this correlation, the techniques of this disclosure may reduce the number of bits needed to signal the mapping of range combinations to filter IDs from an encoder to a decoder.
- In addition to signaling the mapping of range combinations to filter IDs, this disclosure also describes techniques for signaling, in an encoded bitstream, filter coefficients for filters. Techniques of the present disclosure include using differential coding methods to signal filter coefficients from an encoder to a decoder. In this manner, the filter coefficients for a second filter might be communicated to a decoder as difference information, where the difference information describes how to modify the filter coefficients of a first filter in a manner that produces the filter coefficients of the second filter. Differential coding techniques may be more effective (i.e. may result in a greater savings of bits) when the filter coefficients of the first and second filter are more similar than compared to when the filter coefficients of the first and second filter are less similar. The techniques of this disclosure include determining a sequential order in which to signal filter coefficients for filters. The orderings determined using the techniques described in this disclosure may result in improved differential coding of filter coefficients, and thus, may in some instances result in a savings of bits when signaling the filter coefficients.
- Although the techniques of this disclosure may at times be described in reference to in-loop filtering, the techniques may be applied to in-loop filtering, post-loop filtering, and other filtering schemes such as switched filtering. In-loop filtering generally refers to filtering in which the filtered data is part of the encoding and decoding loops such that filtered data is used for predictive intra- or inter-coding. Post-loop filtering refers to filtering that is applied to reconstructed video data after the encoding loop. With post-loop filtering, the unfiltered data, as opposed to the filtered data, is used for predictive intra- or inter-coding. In some implementations, the type of filtering may switch between post-loop filtering and in-loop filtering on, for example, a frame-by-frame, slice-by-slice, or other such basis, and the decision of whether to use post-loop filtering or in-loop filtering can be signaled from encoder to decoder for each frame, slice, etc. The techniques of this disclosure are not limited to in-loop filtering or post filtering, and may apply to a wide range of filtering applied during video coding.
- In this disclosure, the term “coding” refers to encoding or decoding. Similarly, the term “coder” generally refers to any video encoder, video decoder, or combined encoder/decoder (codec). Accordingly, the term “coder” is used herein to refer to a specialized computer device or apparatus that performs video encoding or video decoding.
- Additionally, in this disclosure, the term “filter” generally refers to a set of filter coefficients. For example, a 3×3 filter may be defined by a set of 9 filter coefficients, a 5×5 filter may be defined by a set of 25 filter coefficients, a 9×5 filter may be defined by a set of 45 filter coefficients, and so on. The term “set of filters” generally refers to a group of more than one filter. For example, a set of two 3×3 filters, could include a first set of 9 filter coefficients and a second set of 9 filter coefficients. According to techniques described in this disclosure, for a series of video blocks, such as a frame, slice, or largest coding unit (LCU), information identifying sets of filters are signaled from the encoder to the decoder in a header for the series of the video blocks. The term “shape,” sometimes called the “filter support,” generally refers to the number of rows of filter coefficients and number of columns of filter coefficients for a particular filter. For example, 9×9 is an example of a first shape, 9×5 is an example of a second shape, and 5×9 is an example of a third shape. In some instances, filters may take non-rectangular shapes including diamond-shapes, diamond-like shapes, circular shapes, circular-like shapes, hexagonal shapes, octagonal shapes, cross shapes, X-shapes, T-shapes, other geometric shapes, or numerous other shapes or configuration.
-
FIG. 1 is a block diagram illustrating an exemplary video encoding anddecoding system 110 that may implement techniques of this disclosure. As shown inFIG. 1 ,system 110 includes asource device 112 that transmits encoded video data to adestination device 116 via acommunication channel 115.Source device 112 anddestination device 116 may comprise any of a wide range of devices. In some cases,source device 112 anddestination device 116 may comprise wireless communication device handsets, such as so-called cellular or satellite radiotelephones. The techniques of this disclosure, however, which apply more generally to filtering of video data, are not necessarily limited to wireless applications or settings, and may be applied to non-wireless devices including video encoding and/or decoding capabilities. - In the example of
FIG. 1 ,source device 112 includes avideo source 120, avideo encoder 122, a modulator/demodulator (modem) 123 and atransmitter 124.Destination device 116 includes areceiver 126, amodem 127, avideo decoder 128, and adisplay device 130. In accordance with this disclosure,video encoder 122 ofsource device 112 may be configured to select one or more sets of filter coefficients for multiple inputs in a video block filtering process and then encode the selected one or more sets of filter coefficients. Specific filters from the one or more sets of filter coefficients may be selected based on one or more metrics for one or more inputs, and the filter coefficients may be used to filter the one or more inputs. The filtering techniques of this disclosure are generally compatible with any techniques for coding or signaling filter coefficients in an encoded bitstream. - According to the techniques of this disclosure, a device including
video encoder 122 can signal to a device includingvideo decoder 128 one or more sets of filter coefficients for a series of video blocks, such as a frame or a slice. For the series of video blocks,video encoder 122 may, for example, signal one set of filters to be used with all inputs, or may signal multiple sets of filters to be used with multiple inputs (one set per input, for example). Each video block or CU within the series of video blocks can then contain additional syntax to identify which filter or filters of the set of the filters is to be used for each input of that video block, or in accordance with the techniques of this disclosure, which filter or filters of the set of the filters is to be used can be determined based on two or more metrics associated with one or more of the inputs. - More specifically,
video encoder 122 ofsource device 112 may select one or more sets of filters for a series of video blocks, apply filters from the set(s) to pixels or groups of pixels of inputs associated with CUs of the series of video blocks during the encoding process, and then encode the sets of filters (i.e. sets of filter coefficients) for communication tovideo decoder 128 ofdestination device 116.Video encoder 122 may determine one or more metrics associated with inputs of CUs coded in order to select which filter(s) from the set(s) of filters to use with pixels or groups of pixels for that particular CU.Video encoder 122 may also signal tovideo decoder 128, as part of the coded bitstream, a mapping of combinations of ranges to filters within a set of filters. - On the decoder side,
video decoder 128 may determine the filter coefficients based on filter information received in the bitstream syntax.Video decoder 128 may decode the filter coefficients based on direct decoding or predictive decoding depending upon how the filter coefficients were encoded, which may be signaled as part of the bitstream syntax. Additionally, the bitstream may include filter description syntax information to describe the filters for a set of filters. Based on the filter description syntax,decoder 128 can reconstruct the filter coefficients based on additional information received fromencoder 122. The illustratedsystem 110 ofFIG. 1 is merely exemplary. The filtering techniques of this disclosure may be performed by any encoding or decoding devices.Source device 112 anddestination device 116 are merely examples of coding devices that can support such techniques.Video decoder 128 may also determine the mapping of combinations of ranges to filters based on filter information received in the bitstream syntax. -
Video encoder 122 ofsource device 112 may encode video data received fromvideo source 120 using the techniques of this disclosure.Video source 120 may comprise a video capture device, such as a video camera, a video archive containing previously captured video, or a video feed from a video content provider. As a further alternative,video source 120 may generate computer graphics-based data as the source video, or a combination of live video, archived video, and computer-generated video. In some cases, ifvideo source 120 is a video camera,source device 112 anddestination device 116 may form so-called camera phones or video phones. In each case, the captured, pre-captured or computer-generated video may be encoded byvideo encoder 122. - Once the video data is encoded by
video encoder 122, the encoded video information may then be modulated bymodem 123 according to a communication standard, e.g., such as code division multiple access (CDMA), frequency division multiple access (FDMA), orthogonal frequency division multiplexing (OFDM), or any other communication standard or technique, and transmitted todestination device 116 viatransmitter 124.Modem 123 may include various mixers, filters, amplifiers or other components designed for signal modulation.Transmitter 124 may include circuits designed for transmitting data, including amplifiers, filters, and one or more antennas. -
Receiver 126 ofdestination device 116 receives information overchannel 115, andmodem 127 demodulates the information. The video decoding process performed byvideo decoder 128 may include filtering, e.g., as part of the in-loop decoding or as a post filtering step following the decoding loop. Either way, the set of filters applied byvideo decoder 128 for a particular slice or frame may be decoded using the techniques of this disclosure. Decoded filter information may include identifying filter description syntax in the coded bitstream. If, for example, predictive coding is used for the filter coefficients, similarities between different filter coefficients may be exploited to reduce the amount of information conveyed overchannel 115. In particular, a filter (i.e. a set of the filter coefficients) can be predictively coded as difference values relative to another set of the filter coefficients associated with a different filter. The different filter may, for example, be associated with a different slice or frame. In such a case,video decoder 128 might receive an encoded bitstream comprising video blocks and filter information that identifies the different frame or slice with which the different filter is associated filter. The filter information also includes difference values that define the current filter relative to the filter of the different CU. In particular, the difference values may comprise filter coefficient difference values that define filter coefficients for the current filter relative to filter coefficients of a different filter used for a different CU. -
Video decoder 128 decodes the video blocks, generates the filter coefficients, and filters the decoded video blocks based on the generated filter coefficients.Video decoder 128 can generate the filter coefficients based on filter description syntax retrieved from the bitstream. The decoded and filtered video blocks can be assembled into video frames to form decoded video data.Display device 128 displays the decoded video data to a user, and may comprise any of a variety of display devices such as a cathode ray tube (CRT), a liquid crystal display (LCD), a plasma display, an organic light emitting diode (OLED) display, or another type of display device. -
Communication channel 115 may comprise any wireless or wired communication medium, such as a radio frequency (RF) spectrum or one or more physical transmission lines, or any combination of wireless and wired media.Communication channel 115 may form part of a packet-based network, such as a local area network, a wide-area network, or a global network such as the Internet.Communication channel 115 generally represents any suitable communication medium, or collection of different communication media, for transmitting video data fromsource device 112 todestination device 116. Again,FIG. 1 is merely exemplary and the techniques of this disclosure may apply to video coding settings (e.g., video encoding or video decoding) that do not necessarily include any data communication between the encoding and decoding devices. In other examples, data could be retrieved from a local memory, streamed over a network, or the like. - Alternatively, encoded data may be output from
video encoder 122 to astorage device 132. Similarly, encoded data may be accessed fromstorage device 132 byvideo decoder 128.Storage device 132 may include any of a variety of distributed or locally accessed data storage media such as a hard drive, Blu-ray discs, DVDs, CD-ROMs, flash memory, volatile or non-volatile memory, or any other suitable digital storage media for storing encoded video data. In a further example,storage device 132 may correspond to a file server or another intermediate storage device that may hold the encoded video generated bysource device 112.Destination device 116 may access stored video data fromstorage device 132 via streaming or download. The file server may be any type of server capable of storing encoded video data and transmitting that encoded video data to thedestination device 116. Example file servers include a web server (e.g., for a website), an FTP server, network attached storage (NAS) devices, or a local disk drive. Destination device 14 may access the encoded video data through any standard data connection, including an Internet connection. This may include a wireless channel (e.g., a Wi-Fi connection), a wired connection (e.g., DSL, cable modem, etc.), or a combination of both that is suitable for accessing encoded video data stored on a file server. The transmission of encoded video data fromstorage device 132 may be a streaming transmission, a download transmission, or a combination of both. - The techniques of this disclosure are not necessarily limited to wireless applications or settings. The techniques may be applied to video coding in support of any of a variety of multimedia applications, such as over-the-air television broadcasts, cable television transmissions, satellite television transmissions, streaming video transmissions, e.g., via the Internet, encoding of digital video for storage on a data storage medium, decoding of digital video stored on a data storage medium, or other applications. In some examples,
system 110 may be configured to support one-way or two-way video transmission to support applications such as video streaming, video playback, video broadcasting, and/or video telephony. -
Video encoder 122 andvideo decoder 128 may operate according to a video compression standard such as the ITU-T H.264 standard, alternatively referred to as MPEG-4, Part 10, Advanced Video Coding (AVC), which will be used in parts of this disclosure for purposes of explanation. However, many of the techniques of this disclosure may be readily applied to any of a variety of other video coding standards, including the newly emerging HEVC standard. Generally, any standard that allows for filtering at the encoder and decoder may benefit from various aspects of the teaching of this disclosure. - Although not shown in
FIG. 1 , in some aspects,video encoder 122 andvideo decoder 128 may each be integrated with an audio encoder and decoder, and may include appropriate MUX-DEMUX units, or other hardware and software, to handle encoding of both audio and video in a common data stream or separate data streams. If applicable, MUX-DEMUX units may conform to the ITU H.223 multiplexer protocol, or other protocols such as the user datagram protocol (UDP). -
Video encoder 122 andvideo decoder 128 each may be implemented as one or more microprocessors, digital signal processors (DSPs), application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs), discrete logic, software, hardware, firmware or any combinations thereof. Each ofvideo encoder 122 andvideo decoder 128 may be included in one or more encoders or decoders, either of which may be integrated as part of a combined encoder/decoder (CODEC) in a respective mobile device, subscriber device, broadcast device, server, or the like. - In some cases,
devices devices system 110 may support one-way or two-way video transmission betweenvideo devices - During the encoding process,
video encoder 122 may execute a number of coding techniques or steps. In general,video encoder 122 operates on video blocks within individual video frames in order to encode the video data. In one example, a video block may correspond to a macroblock or a partition of a macroblock. Macroblocks are one type of video block defined by the ITU H.264 standard and other standards. Macroblocks typically refer to 16×16 blocks of data, although the term is also sometimes used generically to refer to any video block of N×N or N×M size. The ITU-T H.264 standard supports intra prediction in various block sizes, such as 16×16, 8×8, or 4×4 for luma components, and 8×8 for chroma components, as well as inter prediction in various block sizes, such as 16×16, 16×8, 8×16, 8×8, 8×4, 4×8 and 4×4 for luma components and corresponding scaled sizes for chroma components. In this disclosure, “N×N” refers to the pixel dimensions of the block in terms of vertical and horizontal dimensions, e.g., 16×16 pixels. In general, a 16×16 block will have 16 pixels in a vertical direction and 16 pixels in a horizontal direction. Likewise, an N×N block generally has N pixels in a vertical direction and N pixels in a horizontal direction, where N represents a positive integer value. The pixels in a block may be arranged in rows and columns. - The emerging HEVC standard defines new terms for video blocks. In particular, video blocks (or partitions thereof) may be referred to as “coding units” (or CUs). With the HEVC standard, largest coded units (LCUs) may be divided into smaller CUs according to a quadtree partitioning scheme, and the different CUs that are defined in the scheme may be further partitioned into so-called prediction units (PUs). The LCUs, CUs, and PUs are all video blocks within the meaning of this disclosure. Other types of video blocks may also be used, consistent with the HEVC standard or other video coding standards. Thus, the phrase “video blocks” refers to any size of video block. Separate CUs may be included for luma components and scaled sizes for chroma components for a given pixel, although other color spaces could also be used.
- Video blocks may have fixed or varying sizes, and may differ in size according to a specified coding standard. Each video frame may include a plurality of slices. Each slice may include a plurality of video blocks, which may be arranged into partitions, also referred to as sub-blocks. In accordance with the quadtree partitioning scheme referenced above and described in more detail below, an N/2×N/2 first CU may comprise a sub-block of an N×N LCU, an N/4×N/4 second CU may also comprise a sub-block of the first CU. An N/8×N/8 PU may comprise a sub-block of the second CU. Similarly, as a further example, block sizes that are less than 16×16 may be referred to as partitions of a 16×16 video block or as sub-blocks of the 16×16 video block. Likewise, for an N×N block, block sizes less than N×N may be referred to as partitions or sub-blocks of the N×N block. Video blocks may comprise blocks of pixel data in the pixel domain, or blocks of transform coefficients in the transform domain, e.g., following application of a transform such as a discrete cosine transform (DCT), an integer transform, a wavelet transform, or a conceptually similar transform to the residual video block data representing pixel differences between coded video blocks and predictive video blocks. In some cases, a video block may comprise blocks of quantized transform coefficients in the transform domain.
- Syntax data within a bitstream may define an LCU for a frame or a slice, which is a largest coding unit in terms of the number of pixels for that frame or slice. In general, an LCU or CU has a similar purpose to a macroblock coded according to H.264, except that LCUs and CUs do not have a specific size distinction. Instead, an LCU size can be defined on a frame-by-frame or slice-by-slice basis, and an LCU be split into CUs. In general, references in this disclosure to a CU may refer to an LCU of a picture or a sub-CU of an LCU. An LCU may be split into sub-CUs, and each sub-CU may be split into sub-CUs. Syntax data for a bitstream may define a maximum number of times an LCU may be split, referred to as CU depth. Accordingly, a bitstream may also define a smallest coding unit (SCU). This disclosure also uses the terms “block” and “video block” to refer to any of an LCU, CU, PU, SCU, or TU.
- As introduced above, an LCU may be associated with a quadtree data structure. In general, a quadtree data structure includes one node per CU, where a root node corresponds to the LCU. If a CU is split into four sub-CUs, the node corresponding to the CU includes four leaf nodes, each of which corresponds to one of the sub-CUs. Each node of the quadtree data structure may provide syntax data for the corresponding CU. For example, a node in the quadtree may include a split flag, indicating whether the CU corresponding to the node is split into sub-CUs. Syntax elements for a CU may be defined recursively, and may depend on whether the CU is split into sub-CUs.
- A CU that is not split may include one or more prediction units (PUs). In general, a PU represents all or a portion of the corresponding CU, and includes data for retrieving a reference sample for the PU. For example, when the PU is intra-mode encoded, the PU may include data describing an intra-prediction mode for the PU. As another example, when the PU is inter-mode encoded, the PU may include data defining a motion vector for the PU. The data defining the motion vector may describe, for example, a horizontal component of the motion vector, a vertical component of the motion vector, a resolution for the motion vector (e.g., one-quarter pixel precision or one-eighth pixel precision), a reference frame to which the motion vector points, and/or a reference list (e.g.,
list 0 or list 1) for the motion vector. Data for the CU defining the PU(s) may also describe, for example, partitioning of the CU into one or more PUs. Partitioning modes may differ between whether the CU is uncoded, intra-prediction mode encoded, or inter-prediction mode encoded. - A CU having one or more PUs may also include one or more transform units (TUs). The TUs comprise the data structure that includes residual transform coefficients, which are typically quantized. In particular, following prediction using a PU, a video encoder may calculate residual values for the portion of the CU corresponding to the PU. The residual values may be transformed, quantized, scanned and stored in a TU, which may have variable sizes corresponding to the size of the transform that was performed. Accordingly, a TU is not necessarily limited to the size of a PU. Thus, TUs may be larger or smaller than corresponding PUs for the same CU. In some examples, the maximum size of a TU may be the size of the corresponding CU. Again, the TUs may comprise the data structures that include the residual transform coefficients associated with a given CU.
-
FIGS. 2A and 2B are conceptual diagrams illustrating anexample quadtree 250 and acorresponding LCU 272.FIG. 2A depicts anexample quadtree 250, which includes nodes arranged in a hierarchical fashion. Each node in a quadtree, such asquadtree 250, may be a leaf node with no children, or have four child nodes. In the example ofFIG. 2A ,quadtree 250 includes root node 252. Root node 252 has four child nodes, includingleaf nodes 256A-256C (leaf nodes 256) andnode 254. Becausenode 254 is not a leaf node,node 254 includes four child nodes, which in this example, areleaf nodes 258A-258D (leaf nodes 258). -
Quadtree 250 may include data describing characteristics of a corresponding LCU, such asLCU 272 in this example. For example,quadtree 250, by its structure, may describe splitting of the LCU into sub-CUs. Assume thatLCU 272 has a size of 2N×2N.LCU 272, in this example, has four sub-CUs 276A-276C (sub-CUs 276) and 274, each of size N×N. Sub-CU 274 is further split into four sub-CUs 278A-278D (sub-CUs 278), each of size N/2×N/2. The structure ofquadtree 250 corresponds to the splitting ofLCU 272, in this example. That is, root node 252 corresponds toLCU 272, leaf nodes 256 correspond to sub-CUs 276,node 254 corresponds to sub-CU 274, and leaf nodes 258 correspond to sub-CUs 278. - Data for nodes of
quadtree 250 may describe whether the CU corresponding to the node is split. If the CU is split, four additional nodes may be present inquadtree 250. In some examples, a node of a quadtree may be implemented similar to the following pseudocode: -
quadtree_node { boolean split_flag(1); // signaling data if (split_flag) { quadtree_node child1; quadtree_node child2; quadtree_node child3; quadtree_node child4; } }
The split_flag value may be a one-bit value representative of whether the CU corresponding to the current node is split. If the CU is not split, the split_flag value may be ‘0’, while if the CU is split, the split_flag value may be ‘1’. With respect to the example ofquadtree 250, an array of split flag values may be 101000000. - In some examples, each of sub-CUs 276 and sub-CUs 278 may be intra-prediction encoded using the same intra-prediction mode. Accordingly,
video encoder 122 may provide an indication of the intra-prediction mode in root node 252. Moreover, certain sizes of sub-CUs may have multiple possible transforms for a particular intra-prediction mode.Video encoder 122 may provide an indication of the transform to use for such sub-CUs in root node 252. For example, sub-CUs of size N/2×N/2 may have multiple possible transforms available.Video encoder 122 may signal the transform to use in root node 252. Accordingly,video decoder 128 may determine the transform to apply to sub-CUs 278 based on the intra-prediction mode signaled in root node 252 and the transform signaled in root node 252. - As such,
video encoder 122 need not signal transforms to apply to sub-CUs 276 and sub-CUs 278 in leaf nodes 256 and leaf nodes 258, but may instead simply signal an intra-prediction mode and, in some examples, a transform to apply to certain sizes of sub-CUs, in root node 252, in accordance with the techniques of this disclosure. In this manner, these techniques may reduce the overhead cost of signaling transform functions for each sub-CU of an LCU, such asLCU 272. - In some examples, intra-prediction modes for sub-CUs 276 and/or sub-CUs 278 may be different than intra-prediction modes for
LCU 272.Video encoder 122 andvideo decoder 130 may be configured with functions that map an intra-prediction mode signaled at root node 252 to an available intra-prediction mode for sub-CUs 276 and/or sub-CUs 278. The function may provide a many-to-one mapping of intra-prediction modes available forLCU 272 to intra-prediction modes for sub-CUs 276 and/or sub-CUs 278. - A slice may be divided into video blocks (or LCUs) and each video block may be partitioned according to the quadtree structure described in relation to
FIGS. 2A-B . Additionally, as shown inFIG. 2C , the quadtree sub-blocks indicated by “ON” may be filtered by loop filters described herein, while quadtree sub-blocks indicated by “OFF” may not be filtered. The decision of whether or not to filter a given block or sub-block may be determined at the encoder by comparing the filtered result and the non-filtered result relative to the original block being coded.FIG. 2D is a decision tree representing partitioning decisions that results in the quadtree partitioning shown inFIG. 2C . The actual filtering applied to any pixels for “ON” blocks, may be determined based on the metrics discussed herein. - In particular,
FIG. 2C may represent a relatively large video block that is partitioned according to a quadtree portioning scheme into smaller video blocks of varying sizes. Each video block is labelled (on or off) inFIG. 2C , to illustrate whether filtering should be applied or avoided for that video block. The video encoder may define this filter map by comparing filtered and unfiltered versions of each video block to the original video block being coded. - Again,
FIG. 2D is a decision tree corresponding to partitioning decisions that result in the quadtree partitioning shown inFIG. 2C . InFIG. 2D , each circle may correspond to a CU. If the circle includes a “1” flag, then that CU is further partitioned into four more CUs, but if the circle includes a “0” flag, then that CU is not partitioned any further. Each circle (e.g., corresponding to CUs) also includes an associated diamond. If the flag in the diamond for a given CU is set to 1, then filtering is turned “ON” for that CU, but if the flag in the diamond for a given CU is set to 0, then filtering is turned off. In this manner,FIGS. 2C and 2D may be individually or collectively viewed as a filter map that can be generated at an encoder and communicated to a decoder at least once per slice of encoded video data in order to communicate the level of quadtree partitioning for a given video block (e.g., an LCU) whether or not to apply filtering to each partitioned video block (e.g., each CU within the LCU). - Smaller video blocks can provide better resolution, and may be used for locations of a video frame that include high levels of detail. Larger video blocks can provide greater coding efficiency, and may be used for locations of a video frame that include a low level of detail. A slice may be considered to be a plurality of video blocks and/or sub-blocks. Each slice may be an independently decodable series of video blocks of a video frame. Alternatively, frames themselves may be decodable series of video blocks, or other portions of a frame may be defined as decodable series of video blocks. The term “series of video blocks” may refer to any independently decodable portion of a video frame such as an entire frame, a slice of a frame, a group of pictures (GOP) also referred to as a sequence, or another independently decodable unit defined according to applicable coding techniques. Aspects of this disclosure might be described in reference to frames or slices, but such references are merely exemplary. It should be understood that generally any series of video blocks may be used instead of a frame or a slice.
- Syntax data may be defined on a per-coded-unit basis such that each CU includes associated syntax data. The filter information described herein may be part of such syntax for a CU, but might more likely be part of syntax for a series of video blocks, such as a frame, a slice, a GOP, LCU, or a sequence of video frames, instead of for a CU. The syntax data can indicate the set or sets of filters to be used with CUs of the slice or frame. Additionally, not all filter information necessarily has to be included in the header of a common series of video blocks. For example, filter description syntax might be transmitted in a frame header, while other filter information is signaled in a header for an LCU.
-
Video encoder 122 may perform predictive coding in which a video block being coded is compared to a predictive frame (or other CU) in order to identify a predictive block. The differences between the current video block being coded and the predictive block are coded as a residual block, and prediction syntax is used to identify the predictive block. The residual block may be transformed and quantized. Transform techniques may comprise a DCT process or conceptually similar process, integer transforms, wavelet transforms, or other types of transforms. In a DCT process, as an example, the transform process converts a set of pixel values into transform coefficients, which may represent the energy of the pixel values in the frequency domain. Quantization is typically applied to the transform coefficients, and generally involves a process that limits the number of bits associated with any given transform coefficient. - Following transform and quantization, entropy coding may be performed on the quantized and transformed residual video blocks. Syntax elements, such as the filter information and prediction vectors defined during the encoding, may also be included in the entropy coded bitstream for each CU. In general, entropy coding comprises one or more processes that collectively compress a sequence of quantized transform coefficients and/or other syntax information. Scanning techniques, such as zig-zag scanning techniques, are performed on the quantized transform coefficients, e.g., as part of the entropy coding process, in order to define one or more serialized one-dimensional vectors of coefficients from two-dimensional video blocks. Other scanning techniques, including other scan orders or adaptive scans, may also be used, and possibly signaled in the encoded bitstream. In any case, the scanned coefficients are then entropy coded along with any syntax information, e.g., via content adaptive variable length coding (CAVLC), context adaptive binary arithmetic coding (CABAC), or another entropy coding process.
- As part of the encoding process, encoded video blocks may be decoded in order to generate the video data used for subsequent prediction-based coding of subsequent video blocks. At this stage, filtering may be performed in order to improve video quality, and e.g., remove blockiness artifacts from decoded video. The filtered data may be used for prediction of other video blocks, in which case the filtering is referred to as “in-loop” filtering. Alternatively, prediction of other video blocks may be based on unfiltered data, in which case the filtering is referred to as “post filtering.”
- On a frame-by-frame, slice-by-slice, or LCU-by-LCU basis,
video encoder 122 may select one or more sets of filters, and on a coded-unit-by-coded-unit basis, the encoder may determine whether or not to apply filtering. For the CUs that are to be filtered, the encoder can perform filtering on a pixel-by-pixel or group-by-group basis, where a group might, for example, be a 2×2 block of pixels or a 4×4 block of pixels. These selections can be made in a manner that promotes the video quality. Such sets of filters may be selected from pre-defined sets of filters, or may be adaptively defined to promote video quality. As an example,video encoder 122 may select or define several sets of filters for a given frame or slice such that different filters are used for different pixels or groups of pixels of CUs of that frame or slice. In particular, for each input associated with a CU, several sets of filter coefficients may be defined, and the two or more metrics associated with the pixels of the CU may be used to determine which filter from the set of filters to use with such pixels or groups of pixels. - In some cases,
video encoder 122 may apply several sets of filter coefficients and select one or more sets that produce the best quality video in terms of amount of distortion between a coded block and an original block, and/or the highest levels of compression. In any case, once selected, the set of filter coefficients applied byvideo encoder 122 for each CU may be encoded and communicated tovideo decoder 128 of destination device 118 so thatvideo decoder 128 can apply the same filtering that was applied during the encoding process for each given CU. - When two or more metrics are used for determining which filter to use with a particular input for a CU, the selection of the filter for that particular CU does not necessarily need to be communicated to
video decoder 128. Instead,video decoder 128 can also calculate the two or more metrics, and based on filter information previously provided byvideo encoder 122, match the combination of two or more metrics to a particular filter. -
FIG. 3 is a block diagram illustrating avideo encoder 350 consistent with this disclosure.Video encoder 350 may correspond tovideo encoder 122 ofdevice 120, or a video encoder of a different device. As shown inFIG. 3 ,video encoder 350 includes aprediction module 332,adders memory 334.Video encoder 350 also includes atransform unit 338 and aquantization unit 340, as well as aninverse quantization unit 342 and aninverse transform unit 344.Video encoder 350 also includes adeblocking filter 347 and anadaptive filter unit 349.Video encoder 350 also includes anentropy encoding unit 346.Filter unit 349 ofvideo encoder 350 may perform filtering operations and also may include a filter selection unit (FSU) 353 for identifying a desirable or preferred filter or set of filters to be used for decoding.Filter unit 349 may also generate filter information identifying the selected filters so that the selected filters can be efficiently communicated as filter information to another device to be used during a decoding operation. - During the encoding process,
video encoder 350 receives a video block, such as an LCU, to be coded, andprediction module 332 performs predictive coding techniques on the video block. Using the quadtree partitioning scheme discussed above,prediction module 332 can partition the video block and perform predictive coding techniques on CUs of different sizes. For inter coding,prediction module 332 compares the video block to be encoded, including sub-blocks of the video block, to various blocks in one or more video reference frames or slices in order to define a predictive block. For intra coding,prediction module 332 generates a predictive block based on neighboring data within the same CU.Prediction module 332 outputs the prediction block andadder 348 subtracts the prediction block from the video block being coded in order to generate a residual block. - For inter coding,
prediction module 332 may comprise motion estimation and motion compensation units that identify a motion vector that points to a prediction block and generates the prediction block based on the motion vector. Typically, motion estimation is considered the process of generating the motion vector, which estimates motion. For example, the motion vector may indicate the displacement of a predictive block within a predictive frame relative to the current block being coded within the current frame. Motion compensation is typically considered the process of fetching or generating the predictive block based on the motion vector determined by motion estimation. For intra coding,prediction module 332 generates a predictive block based on neighboring data within the same CU. One or more intra-prediction modes may define how an intra prediction block can be defined. - After
prediction module 332 outputs the prediction block andadder 348 subtracts the prediction block from the video block being coded in order to generate a residual block, transformunit 338 applies a transform to the residual block. The transform may comprise a discrete cosine transform (DCT) or a conceptually similar transform such as that defined by a coding standard such as the HEVC standard. Wavelet transforms, integer transforms, sub-band transforms or other types of transforms could also be used. In any case, transformunit 338 applies the transform to the residual block, producing a block of residual transform coefficients. The transform may convert the residual information from a pixel domain to a frequency domain. -
Quantization unit 340 then quantizes the residual transform coefficients to further reduce bit rate.Quantization unit 340, for example, may limit the number of bits used to code each of the coefficients. After quantization,entropy encoding unit 346 scans the quantized coefficient block from a two-dimensional representation to one or more serialized one-dimensional vectors. The scan order may be pre-programmed to occur in a defined order (such as zig-zag scanning, horizontal scanning, vertical scanning, combinations, or another pre-defined order), or possibly adaptive defined based on previous coding statistics. - Following this scanning process,
entropy encoding unit 346 encodes the quantized transform coefficients (along with any syntax elements) according to an entropy coding methodology, such as CAVLC or CABAC, to further compress the data. Syntax elements included in the entropy coded bitstream may include prediction syntax fromprediction module 332, such as motion vectors for inter coding or prediction modes for intra coding. Syntax elements included in the entropy coded bitstream may also include filter information fromfilter unit 349, which can be encoded in the manner described herein. - CAVLC is one type of entropy encoding technique supported by the ITU H.264/MPEG4, AVC standard, which may be applied on a vectorized basis by
entropy encoding unit 346. CAVLC uses variable length coding (VLC) tables in a manner that effectively compresses serialized “runs” of transform coefficients and/or syntax elements. CABAC is another type of entropy coding technique supported by the ITU H.264/MPEG4, AVC standard, which may be applied on a vectorized basis byentropy encoding unit 346. CABAC involves several stages, including binarization, context model selection, and binary arithmetic coding. In this case,entropy encoding unit 346 codes transform coefficients and syntax elements according to CABAC. Like the ITU H.264/MPEG4, AVC standard, the emerging HEVC standard may also support both CAVLC and CABAC entropy coding. Furthermore, many other types of entropy coding techniques also exist, and new entropy coding techniques will likely emerge in the future. This disclosure is not limited to any specific entropy coding technique. - Following the entropy coding by
entropy encoding unit 346, the encoded video may be transmitted to another device or archived for later transmission or retrieval. Again, the encoded video may comprise the entropy coded vectors and various syntax, which can be used by the decoder to properly configure the decoding process.Inverse quantization unit 342 andinverse transform unit 344 apply inverse quantization and inverse transform, respectively, to reconstruct the residual block in the pixel domain.Summer 351 adds the reconstructed residual block to the prediction block produced byprediction module 332 to produce a pre-deblocked reconstructed video block, sometimes referred to as pre-deblocked reconstructed image.De-blocking filter 347 may apply filtering to the pre-deblocked reconstructed video block to improve video quality by removing blockiness or other artifacts. The output of thede-blocking filter 347 can be referred to as a post-deblocked video block, reconstructed video block, or reconstructed image. -
Filter unit 349 can be configured to receive a single input or multiple inputs. In the example ofFIG. 3 ,filter unit 349 receives as input the post-deblocked reconstructed image (RI), pre-deblocked reconstructed image (pRI), the prediction image (PI), and the reconstructed residual block (EI).Filter unit 349 can use any of these inputs either individually or in combination to produce a reconstructed image to store inmemory 334. Additionally, as will be discussed in more detail below, based on two or more metrics, one or more filters can be selected to be applied to the input(s). In one example, the output offilter unit 349 may be one additional filter applied to RI. In another example, the output offilter unit 349 may be one additional filter applied to pRI. In other examples, however, the output offilter unit 349 may be based on multiple inputs. For example,filter unit 349 may apply a first filter to pRI and then use the filtered version of pRI in conjunction with filtered versions of EI and PI to create a reconstructed image. In instances where the output offilter unit 349 is the product of one additional filter being applied to a single input,filter unit 349 may in fact apply filters to the other inputs, but those filters might have all zero coefficients. Similarly, if the output offilter unit 349 is the product of applying three filters to three inputs,filter unit 349 may in fact apply a filter to the fourth input, but that filter might have all zero coefficients. -
Filter unit 349 may also be configured to receive a single input. For example, althoughFIG. 3 shows PI, EI, pRI, and RI being input intofilter unit 349, in some implementations RI might be the only input received byfilter unit 349. In such an implementation,filter unit 349 might apply a filter to RI so that a filtered version of RI is more similar to the original image than the unfiltered version of RI. In other implementations,filter unit 349 andde-blocking filter 347 may be combined into a single filtering unit that applies filtering to pRI. The techniques of this disclosure, which generally relate to multi-metric-based filter mapping, are compatible with both single-input and multi-input filtering schemes that utilize multiple filters. - Filtering by
filter unit 349 may improve compression by generating predictive video blocks that more closely match video blocks being coded than unfiltered predictive video blocks. After filtering, the reconstructed video block may be used byprediction module 332 as a reference block to inter-code a block in a subsequent video frame or other CU. Althoughfilter unit 349 is shown “in-loop,” the techniques of this disclosure could also be used with post filters, in which case non-filtered data (rather than filtered data) would be used for purposes of predicting data in subsequent CUs. - For a series of video blocks, such as a slice or frame,
filter unit 349 may select sets of filters for each input in a manner that promotes the video quality. For example,filter unit 349 may select sets of filters from pre-defined sets of coefficients, or may adaptively define filters in order to promote video quality or improved compression.Filter unit 349 may select or define one or more sets of filters for a given CU such that the same set(s) of filters are used for pixels of different video blocks of that CU. For a particular frame, slice, or LCU,filter unit 349 may apply several sets of filters to multiple inputs, andFSU 353 may select the set that produces the best quality video or the highest levels of compression. Alternatively,FSU 353 may train a new filter by analyzing the auto-correlations and cross-correlations between multiple inputs and an original image. A new set of filters may, for example, be determined by solving Wienter-Hopt equations based on the auto- and cross-correlations. Regardless of whether a new set of filters is trained or an existing set of filters are selected,filter unit 349 generates syntax for inclusion in the bitstream that enables a decoder to also identify the set or sets of filters to be used for the particular frame or slice. - According to this disclosure, for each pixel of a CU within the series of video blocks,
filter unit 349 may select which filter from the set of filters is to be used based on two or more metrics that quantify properties associated with one or more sets of pixels within the CU. In this way,FSU 353 may determine sets of filters for a higher level coded unit such as a frame or slice, whilefilter unit 349 determines which filter(s) from the set(s) is to be used for a particular pixel of a lower level coded unit based on the two or more metrics associated with the pixels of that lower level coded unit. - A set of M filters may be used for each input. Depending on design preferences, M may, for example, be as few as 2 or as great as 16, or even higher. A large number of filters per input may improve video quality, but also may increase overhead associated with signaling sets of filters from encoder to decoder. The set of M filters can be determined by
FSU 353 as described above and signaled to the decoder for each frame or slice. A segmentation map can be used to indicate how a CU is segmented and whether or not a particular sub-unit of the CU is to be filtered. The segmentation map, may for example, include for a CU an array of split flags as described above as well an additional bit signaling whether each sub-CU is to be filtered. For each input associated with a pixel of a CU that is to be filtered, a specific filter from the set of filters can be chosen based on two or more metrics. Combinations of values for two or more metrics can be indexed to particular filters from the set of M filters. -
FIG. 4A is a conceptual diagram illustrating ranges of values for two metrics indexed to filters from a set of filters. The particular example ofFIG. 4A shows eight filters (i.e.Filter 1,Filter 2 . . . Filter 8), but more or fewer filters may similarly be used.FIG. 4A shows two metrics that might be used for selecting a filter in accordance with the techniques of this disclosure. The two metrics may, for example, quantify properties of the pixel data related to non-direction specific activity (e.g. a sum-modified Laplacian value) and direction, direction-specific activity and edge detection, a direction metric and an edge metric, a horizontal activity metric and a vertical activity metric, or two other such metrics. In some instances, three or more metrics might be used, in which case the conceptual diagram ofFIG. 4A would include a third dimension for mapping ranges of the metrics to filters from the set of filters. - In the example of
FIG. 4A , a first metric (Metric 1) has four ranges (Ranges 1-1, 1-2, 1-3, and 1-4), and a second metric (Metric 2) also has four ranges (Ranges 2-1, 2-2, 2-3, and 2-4). Therefore, the example ofFIG. 4A has sixteen combinations of ranges forMetric 1 andMetric 2. As can be seen fromFIG. 4A , however, each combination is not necessarily associated with a unique filter. The combination of Range 1-1 and Range 2-1, as well as combinations 1-1 and 2-2, and 1-1 and 2-3, for instance, are all mapped toFilter 1, in the example ofFIG. 4A .Filter 4, in contrast, is only mapped to one combination (1-1 and 2-4). Although the ranges ofFIG. 4A are shown as being relatively equal, the sizes of ranges may vary. For example, in some implementations, Range 1-1 may encompass a greater range of values than Range 1-2. Additionally, althoughFIG. 4A showsMetric 1 andMetric 2 as having the same number of ranges, the number of ranges for a first metric and the number of ranges for a second metric do not necessarily need to be equal. If, for example,Metric 1 is a variance metric andMetric 2 is a direction metric,Metric 1 might use eight ranges whileMetric 2 uses three ranges. - In some examples, the ranges of
Metric 1 andMetric 2 may represent a continuous spectrum of values. For example, ifMetric 1 is a sum-modified Laplacian value, Range 1-2 may correspond to more activity than Range 1-1 but less activity than Range 1-3, and Range 1-4 may correspond to more activity than Range 1-3. Within a range, the amount of activity determined for a particular pixel or group of pixels may similarly increase along the Metric 1 axis. In other examples, the ranges ofMetric 1 andMetric 2 may not represent actual ranges but instead may represent discrete determinations. For example, ifMetric 2 is a direction metric, Range 1-1 may correspond to a determination of no direction, Range 2-2 may correspond to a determination of horizontal direction, Range 2-3 may correspond to a determination of vertical direction, and Range 2-4 may represent a determination of diagonal direction. As will be described in more detail below, no direction, horizontal direction, vertical direction, and diagonal direction can be discrete determinations, and thus, the ranges forMetric 2 might not represent a continuous spectrum of values in the same way the ranges ofMetric 1 do. -
FIG. 4B is a conceptual diagram illustrating ranges of values for an activity metric and a direction metric. In the example ofFIG. 4B , the direction metric includes three discrete determinations (No Direction, Horizontal, and Vertical). Techniques for determining no direction, horizontal, and vertical as well as techniques for determining activity will be explained in greater detail below. The particular example ofFIG. 4B shows six filters (i.e.Filter 1,Filter 2 . . . Filter 6), but more or fewer filters may similarly be used. As can be seen byFIG. 4B , the two metrics (activity and direction) create 15 combinations, identified ascombinations 421 through 435. In some instances, however, additional combinations not explicitly shown inFIG. 4B may also be used. For example, a combination corresponding to no activity may be a 16th combination that also has a corresponding filter. -
Filter unit 349 can store a mapping of filters to combinations of ranges of two or more metrics, such as the example mappings ofFIGS. 4A and 4B , and use the mapping to determine which filter from a set of filters to apply to a particular pixel or group of pixels in a CU. The mapping of filters to combinations of ranges of two or more metrics may, for example, be determined byfilter unit 349 as part of the filter selection process described above. Regardless of how the mapping is determined,filter unit 349 can generate information allowing a decoder to reconstruct the mapping. This information can be included in the coded bitstream to signal the mapping of combinations of ranges to filters. The mapping of combinations to ranges signaled may map range combinations to filter identifications IDs. The actual coefficients for a particular filter might be signaled separately. - In order to generate this information,
filter unit 349 first determines a transmission order for the combinations. The transmission order generally refers to the order in which filters will be signaled for combinations of ranges. UsingFIG. 4A as an example,filter unit 349 might use a left-to-right, top-to-bottom transmission order where the filter forcombination 401 is signaled first, the filter forcombination 402 is signaled second, and the remaining combinations are signaled in the order of 403=>404=>405=>406=>407=>408=>409=>410=>411=>412=>413=>414=>415=>416.Filter unit 349 might also use a top-to-bottom, zig-zag transmission order where the filters for combinations are signaled in the order of 401=>402=>403=>404=>408=>407=>406=>405=>409=>410=>411=>412=>416=>415=>414=>413.Filter unit 349 might also use a top-to-bottom, left-to-right transmission order where the filters for combinations are signaled in the order of 401=>405=>409=>413=>402=>406=>410=>414=>403=>407=>411=>415=>404=>408=>412=>416.Filter unit 349 might also use a left-to-right, zig-zag transmission order where the filters for combinations are signaled in the order of 401=>405=>409=>413=>414=>410=>406=>402=>403=>407=>411=>415=>416=>412=>408=>404. Referring toFIG. 4B ,filter unit 349 may use a left-to-right, bottom-to-top transmission order such that the transmission order is 421=>422=>423=>424=>425=>426=>427=>428=>429=>430=>431=>432=>433=>434=>435. As can be imagined, these are just a few of the many transmission orders that are possible. - According to a technique of this disclosure,
filter unit 349 can use a series of codewords to signal the mapping to a decoder. For example,filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the first metric. If a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the second metric, then filterunit 349 can generate a second codeword instead of the first codeword. If a current combination being decoded does not map to the same filter as either of these most recently decoded combinations, then filterunit 349 can generate a third codeword, instead of the first codeword or second codeword, that indicates the filter corresponding to the current combination being decoded. The first and second codeword of the current example may be relatively short compared to the third codeword. For example, the first codeword and second codeword might each be two bits (e.g. 00 and 01, respectively), while the third codeword is more bits (a first bit of 1, plus additional bits). In this particular context, a current combination being decoded or a previous combination being decoded refers to the portion of the encoding and decoding processes where the mapping of filters to range combinations is being signaled by an encoder or constructed by a decoder, and not necessarily to a transmission or decoding of the combination itself. - Examples of the techniques described above will now be given with reference to
FIG. 4A and a top-to-bottom, left-to-right transmission order. If, for example,combination 407 is the combination currently being decoded, thencombination 406 is the most recently decoded combination that shares the same range forMetric 1, andcombination 403 is the most recently decoded combination that shares the same range forMetric 2. Ifcombination 407 maps to the same filter (Filter 7 inFIG. 4A ) as the most recently decoded combination that shares the same range for a second metric (i.e. Range 2-3 for Metric 2), then filterunit 349 can transmit a second codeword (e.g. 01) to indicate that the current combination being decoded (combination 407) maps to the same filter as the most recently decoded combination that shares the same range for a second metric (combination 403). - If, for example,
combination 410 is the current combination being decoded, thencombination 409 is the most recently decoded combination that shares the same range forMetric 1, andcombination 406 is the most recently decoded combination that shares the same range forMetric 2. Ifcombination 410 maps to the same filter (Filter 2 inFIG. 4A ) as the most recently decoded combination that shares the same range for a first metric (i.e. Range 1-2 for Metric 1), then filterunit 349 can transmit a first codeword (e.g. 00) to indicate that the current combination being decoded (combination 410) maps to the same filter (Filter 2) as the most recently decoded combination that shares the same range for a first metric (combination 409). - If, for example,
combination 411 is the current combination being decoded, thencombination 410 is the most recently decoded combination that shares the same range forMetric 1, andcombination 407 is the most recently decoded combination that shares the same range forMetric 2. Ifcombination 411 does not map to the same filter as either ofcombination 410 orcombination 407, then filterunit 349 can transmit a third codeword (e.g. 1+additional bits) to indicate that the current combination being decoded (combination 411) maps to a different filter (Filter 3) than both the most recently decoded combination that shares the same range forMetric 1 and the most recently decoded combination that shares the same range forMetric 2. - For those current combinations where a combination that shares the same range for
Metric 1 or a combination that shares the same range forMetric 2 have not yet been decoded, then those options can either be considered unavailable or can be replaced by a different combination. If, for example,combination 409 is the current combination to be decoded, thencombination 405 is the most recently decoded combination that shares the same range forMetric 2, but no combination that shares a range forMetric 1 has yet been decoded. In such instances, the most recently decoded combination that shares a range forMetric 1 can be assumed to not map to the same filter as the current combination being decoded. Thus, in this case, the first codeword will not be used forcombination 409. Alternatively, the combination that shares a range forMetric 1 can be replaced by another combination, such as the most recently decoded combination or a different previously decoded combination. In such an instance, the most recently decoded combination beforecombination 409 would becombination 408. Thus, ifcombination 408 maps to the same filter ascombination 409, then filterunit 349 can generate the first codeword. Analogous techniques can be used for those combinations where a previous combination sharing common range forMetric 1 have not yet been decoded. - For the first combination in a transmission order (
e.g. combination 401 in the example ofFIG. 4A ), where neither a combination that shares the same range forMetric 1 or a combination that shares the same range forMetric 2 have been decoded,filter unit 349 can generate a codeword indicating the filter that maps to the first combination. The filter may, for example, be signaled using the third codeword or may be signaled using a different technique, in which case the techniques described in this disclosure might begin with the second combination in a transmission order or a later combination. - According to another technique of this disclosure,
filter unit 349 can use a series of codewords to signal the mapping to a decoder. In some implementations,filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination that shares the same range for the first metric. If a current combination being decoded does not map to the same filter as the most recently decoded combination that shares that range for the first metric, then filterunit 349 can generate a second codeword, instead of the first codeword, that indicates the filter that maps to the current combination being decoded. In this example, the first codeword may be relatively short compared to the second codeword. For example, the first codeword might be one bits (e.g. 0), while the second codeword is more bits (e.g., a first bit of 1, plus additional bits). Unlike the previous technique where a short codeword might be generated if a current combination maps to the same filter as a previously decoded combination that shares the same range for either Metric 1 orMetric 2, this technique includes only generating a short codeword if the current combination maps to the same filter as a previously decoded combination that shares the same range forMetric 1. Thus, even if the current combination maps to the same filter as a previously decoded combination that shares the same range forMetric 2,filter unit 349 still generates a second codeword (e.g. 1+additional bits). Although this disclosure is usingMetric 1 for purposes of explanation, the same techniques can also be applied using onlyMetric 2. - According to yet another technique of this disclosure,
filter unit 349 can use a different series of codewords to signal the mapping to a decoder. For example,filter unit 349 can generate a first codeword to indicate if a current combination being decoded maps to the same filter as the most recently decoded combination, regardless of which, if any, range the current combination has in common with the previously decoded combination. If the current combination being decoded does not map to the same filter as the most recently decoded combination, then filterunit 349 can generate a second codeword identifying the filter that maps to the current combination. In this particular implementation, the first codeword may be relatively short compared to the second codeword. For example, the first codeword might be one bits (e.g. 0), while the second codeword is more bits (e.g., a first bit of 1, plus additional bits). - Again, using the example of
FIG. 4A and a top-to-bottom, left-to-right transmission order,combination 401 would be the most recently decoded combination ifcombination 402 is currently being decoded,combination 402 would be the most recently decoded combination ifcombination 403 is the current combination, and so on. 404 would be the most recently decoded combination ifcombination 405 is the current combination being decoded. Thus,filter unit 349 can generate the first codeword ifcombination 402 maps to the same filter ascombination 401, ifcombination 403 maps to the same filter ascombination 402, etc. Otherwise,filter unit 349 can generated the second codeword identifying the filter that maps to the current combination. - According to yet another technique of this disclosure,
filter unit 349 can use two codewords to signal the mapping of the filters to combinations. A first codeword, such as a “0”, can be used to signal that a current combination uses the same filter as a previous combination. A second codeword, such as a “1”, can be used to signal that a current combination has a different filter than the previous combination. The second codeword, however, does not need to identify a new filter. Instead, the new filter can be determined based on the transmission order for the classes and the order in which filter coefficients are transmitted. Using the left-to-right, bottom-to-top transmission order described above forFIG. 4B as an example, codewords might be transmitted accordingly: 421 (0)=>422 (0)=>423 (1)=>424 (0)=>425 (0)=>426 (0)=>427 (0)=>428(1)=>429 (0)=>430 (0)=>431 (0)=>432 (1)=>433 (0)=>434 (0)=>435 (0), with the number in parentheses representing the codeword for that combination. In this example, combinations 421-422 would be mapped to a first filter, combinations 423-427 to a second filter, combinations 428-431 to a third filter, and combinations 432-435 to a fourth filter. The coefficients for the first filter, second filter, third filter, and fourth filter can correspond to the order in which sets of filter coefficients are signaled, where the first set of filter coefficients signaled correspond to the first filter, the second set of filter coefficients signaled correspond to the second filter, and so on. Determining an order for transmitting sets of filter coefficients is discussed in more detail below. - The various techniques described in this disclosure for signaling a mapping of filters to combinations of ranges are not mutually exclusive alternatives, but rather, may be used in conjunction with one another. For example, in some implementations, certain combinations might be signaled using a first technique while other combinations are signaled using a second technique. As one example, where one of a combination that shares the same range for
Metric 1 or a combination that shares the same range forMetric 2 have not yet been decoded (e.g. combinations unit 349 may use a first technique. Where both a combination that shares the same range forMetric 1 and a combination that shares the same range forMetric 2 have been decoded (e.g. combinations - In addition to generating information allowing a decoder to reconstruct the mapping of filters to combinations of ranges,
filter unit 349 also generates information allowing a decoder to reconstruct the filters themselves. Reconstructing the filters includes reconstructing the filter coefficients of the filters. As will be described in more detail below,filter unit 349 can use differential coding techniques to signal the filter coefficients. To use differential coding technique,filter unit 349 determines an order in which to signal the sets of filter coefficients. - As part of determining the order,
filter unit 349 determines a combination identification (ID) that represents a sequential value for each combination of ranges. UsingFIG. 4A as an example, the combinations might be assigned combination IDs that represent sequential values in a left-to-right, top-to-bottom order, in whichcase combination 401 would be assigned the first sequential value,combination 402 would be assigned the second sequential value, and the remaining combinations would be assigned sequential values in the order of 403=>404=>405=>406=>407=>408=>409=>410=>411=>412=>413=>414=>415=>416.Filter unit 349 might also assign the combination IDs using a top-to-bottom, zig-zag order where the combinations would be assigned combination IDs with sequential values that are in an order of 401=>402=>403=>404=>408=>407=>406=>405=>409=>410=>411=>412=>416=>415=>414=>413.Filter unit 349 might also assign combination IDs using a top-to-bottom, left-to-right order where the combinations are assigned combination IDs with sequential values that are in an order of 401=>405=>409=>413=>402=>406=>410=>414=>403=>407=>411=>415=>404=>408=>412=>416.Filter unit 349 might also use a left-to-right, zig-zag order where the combinations are assigned combination IDs with sequential values in an order of 401=>405=>409=>413=>414=>410=>406=>402=>403=>407=>411=>415=>416=>412=>408=>404. As can be imagined, these are just a few of the many orders that could be used. Furthermore, any of the orders described could be either lowest to highest or highest to lowest. - After
filter unit 349 has determined the mapping of filters to range combinations,filter unit 349 can identify groupings of range combinations that are mapped to the same filter. UsingFIG. 4A as an example, the groupings would be as follows. -
Filter 1 Group:combinations -
Filter 2 Group:combinations -
Filter 3 Group:combinations -
Filter 4 Group:combination 416 -
Filter 5 Group:combinations -
Filter 6 Group:combinations -
Filter 7 Group:combinations -
Filter 8 Group:combinations -
Filter unit 349 can then assign each group a group ID, and the group ID can represent a sequential value. The group IDs can be assigned to the groups based on the sequential values associated with the combinations that comprise the group. For example, the group that has the combination with the lowest associated sequential value based on the combination IDs, might be assigned the group ID with the lowest sequential value. Of the remaining groups, the remaining group that has the combination with the lowest associated sequential value can be assigned the group ID with the next lowest sequential value. This process can repeat until all groups have been assigned a group ID. In some implementations, group IDs might be assigned based on the combinations with the highest associated sequential values rather than the lowest. In some implementations, the group that has the combination with the lowest associated sequential value based on the combination IDs, might be assigned the group ID with the highest sequential value, or vice versa. - Again, using
FIG. 4A as an example, and assuming that combinations 401-416 are assigned combination IDs with sequential values in a left-to-right, top-to-bottom order, then filterunit 349 can assign group IDs to the filter groups, as shown below in Table 1. -
TABLE 1 Combinations Combination with Group Name in group lowest sequential value Group ID Filter 1 Group 413, 414, 415 413 7 Filter 2Group 409, 410 409 5 Filter 3Group 411, 412 411 6 Filter 4Group 416 416 8 Filter 5Group 401, 405 401 1 Filter 6Group 402, 406 402 2 Filter 7Group 403, 407 403 3 Filter 8Group 404, 408 404 4 - In the example of
FIG. 4A , shown in Table 1,filter unit 349 assigns theFilter 5 Group the group ID with the lowest sequential value because theFilter 5 Group includes the range combination with the lowest sequential value (i.e., combination 401).Filter unit 349 assigns theFilter 6 Group the group ID with the second lowest sequential value because, of the remaining filter groups (i.e. all the groups excluding theFilter 5 Group), theFilter 6 Group includes the range combination with the second lowest sequential value (i.e., combination 402).Filter unit 349 assigns theFilter 7 Group the group ID with the third lowest sequential value because, of the remaining filter groups (i.e. all the filter groups excluding theFilter 5 Group and theFilter 6 Group), theFilter 7 Group includes the range combination with the lowest sequential value (i.e., combination 403).Filter unit 349 assigns theFilter 8 Group the group ID with the fourth lowest sequential value because, of the remaining filter groups (i.e. all the filter groups excluding theFilter 5 Group, theFilter 6 Group, and theFilter 7 Group), theFilter 8 Group includes the range combination with the fourth lowest sequential value (combination 404).Filter unit 349 assigns theFilter 2 Group the group ID with the fifth lowest sequential value because, of the remaining filter groups (i.e. excluding theFilter 5 Group, theFilter 6 Group, theFilter 7 Group, and theFilter 8 Group), theFilter 2 Group includes the range combination with the lowest sequential value (combination 409).Filter unit 349 assigns theFilter 3 Group the group ID with the sixth lowest sequential value because, of the remaining filter groups (i.e. excluding theFilter 5 Group, theFilter 6 Group, theFilter 7 Group, theFilter 8 Group, and theFilter 2 Group), theFilter 3 Group includes the range combination with the lowest sequential value (combination 411).Filter unit 349 assigns theFilter 1 Group the group ID with the seventh lowest sequential value because, of the remaining filter groups (i.e. excluding theFilter 5 Group, theFilter 6 Group, theFilter 7 Group, theFilter 8 Group, theFilter 2 Group, and theFilter 3 Group), theFilter 1 Group includes the range combination with the lowest sequential value (combination 413). Finally,filter unit 349 assigns theFilter 4 group, the final remaining filter group, the group ID with the highest sequential value (8 in this particular example). - Based on the filter group IDs,
filter unit 349 determines an order in which to signal the filter coefficients of a filter. Again, using the example ofFIG. 4A and Table 1,filter unit 349 first signals the coefficient forFilter 5, then the coefficient forFilter 6, then the coefficient forFilter 7, then the coefficient forFilter 8, then the coefficient forFilter 2, then the coefficient forFilter 3, then the coefficient forFilter 1, and finally the coefficient forFilter 4. Using differential coding techniques, as described in this disclosure,filter unit 349 may code the coefficients forFilter 6 as difference information relative to the filter coefficients ofFilter 5, code the coefficients forFilter 7 as difference information relative to the filter coefficients forFilter 6, and so on, based on the sequential ordering of Group IDs. - The mapping of two or more metrics for inputs to filters can be implemented in multiple ways. For example, in some implementations each input might have a unique set of filters, while in some implementations inputs share a common set of filters. Additionally, in some implementations, two or more metrics for each input might be used to identify a particular filter for each input. In other implementations, however, two or more metrics for a single input might be used to identify filters for all the inputs. In yet other implementations, two or more metrics for a first input might be used to identify a filter for a second, different input.
- In accordance with this disclosure,
filter unit 349 may perform coding techniques with respect to filter information that may reduce the amount of data needed to encode and convey filter information fromencoder 350 to another device. Again, for each frame or slice,filter unit 349 may define or select one or more sets of filter coefficients to be applied to the pixels of CUs for that frame or slice.Filter unit 349 applies the filter coefficients in order to filter video blocks of reconstructed video frames stored inmemory 334, which may be used for predictive coding consistent with in-loop filtering.Filter unit 349 can encode the filter coefficients as filter information, which is forwarded toentropy encoding unit 346 for inclusion in the encoded bitstream. - Additionally, the techniques of this disclosure may exploit the fact that some of the filter coefficients defined or selected by
FSU 353 may be very similar to other filter coefficients applied with respect to the pixels of CUs of another frame or slice. The same type of filter may be applied for different frames or slices (e.g., the same filter support), but the filters may be different in terms of filter coefficient values associated with the different indices of the filter support. Accordingly, in order to reduce the amount of data needed to convey such filter coefficients,filter unit 349 may predictively encode one or more filter coefficients to be used for filtering based on the filter coefficients of another CU, potentially exploiting similarities between the filter coefficients. In some cases, however, it may be more desirable to encode the filter coefficients directly, e.g., without using any prediction. Various techniques, such as techniques that exploit the use of an activity metric to define when to encode the filter coefficients using predictive coding techniques and when to encode the filter coefficients directly without any predictive coding, can be used for efficiently communicating filter coefficients to a decoder. Additionally, symmetry may also be imposed so that a subset of coefficients (e.g., 5, −2, 10) known by the decoder can be used to define the full set of coefficients (e.g., 5, −2, 10, 10, −2, 5). Symmetry may be imposed in both the direct and the predictive coding scenarios. - As described above,
video encoder 350 represents an example of a video encoder configured to determine a first metric for a group of pixels within a block of pixels, determine a second metric for the group of pixels, determine a filter based on the first metric and the second metric, and generate a filtered image by applying the filter to the group of pixels.Video encoder 350 also represents an example of a video encoder configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a second metric for the block of pixels; determine a filter based on the first metric and the second metric; and, generate a filtered image by applying the filter to the block of pixels. - As described above,
video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric, wherein each range combination has a unique range combination identification (ID), wherein each unique range combination ID corresponds to a sequential value for a range combination; assign unique group IDs to groups of range combinations based on the sequential values for the range combinations, wherein each unique group ID corresponds to a sequential value for a group; and, code sets of filter coefficients corresponding for the filters based on the unique group IDs.Video encoder 350 can code the sets of filter coefficients by signaling the sets of filter coefficients in a coded bitstream in an order that is selected based on the sequential values of the unique group IDs.Video encoder 350 can signal the sets of filter coefficients using differential coding techniques. - As described above,
video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range of values for a first metric and a range of values for a second metric; generate a first codeword if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the first metric; generate a second codeword if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the second metric; and, generate a third codeword if the current range combination is mapped to a different filter than the previous range combination that comprises the same range of values for the first metric and the previous range combination that comprises the same range of values for the second metric.Video encoder 350 also represents an example of a video encoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric; generate a first codeword if a current range combination is mapped to the same filter as a previous range combination; and, generate a second codeword if the current range combination is mapped to a different filter than the previous range combination, wherein the second codeword identifies a filter mapped to the current range combination. -
FIG. 5 is a block diagram illustrating an example of avideo decoder 560, which decodes a video sequence that is encoded in the manner described herein. The received video sequence may comprise an encoded set of image frames, a set of frame slices, a commonly coded group of pictures (GOPs), or a wide variety of types of series of video blocks that include encoded video blocks and syntax to define how to decode such video blocks. -
Video decoder 560 includes anentropy decoding unit 552, which performs the reciprocal decoding function of the encoding performed byentropy encoding unit 346 ofFIG. 3 . In particular,entropy decoding unit 552 may perform CAVLC or CABAC decoding, or any other type of entropy decoding used byvideo encoder 350. Entropy decoded video blocks in a one-dimensional serialized format may be inverse scanned to convert one or more one-dimensional vectors of coefficients back into a two-dimensional block format. The number and size of the vectors, as well as the scan order defined for the video blocks may define how the two-dimensional block is reconstructed. Entropy decoded prediction syntax may be sent fromentropy decoding unit 552 toprediction module 554, and entropy decoded filter information may be sent fromentropy decoding unit 552 to filterunit 559. -
Video decoder 560 also includes aprediction module 554, aninverse quantization unit 556, aninverse transform unit 558, a memory and asummer 564. In addition,video decoder 560 also includes ade-blocking filter 557 that filters the output ofsummer 564. Consistent with this disclosure,filter unit 559 may receive entropy decoded filter information that includes one or more filters to be applied to one or more inputs. Although not shown onFIG. 5 ,de-blocking filter 557 may also receive entropy decoded filter information that includes one or more filters to be applied. - The filters applied by
filter unit 559 may be defined by sets of filter coefficients.Filter unit 559 may be configured to generate the sets of filter coefficients based on the filter information received fromentropy decoding unit 552. The filter information may include filter description syntax that identifies a maximum number of filters in a set of filters and/or a shape of filters in a set of filters, for example. The filter description syntax can be included in a header of a series of video blocks, e.g., an LCU header, a frame header, a slice header, a GOP header, a sequence header, or the like. In other examples, the filter description syntax might be included in a footer or other data structure. Based on the filter description syntax,filter unit 559 can reconstruct the set of filters used at the encoder. - The filter information may also include additional signaling syntax that signals to the decoder the manner of encoding used for any given set of coefficients. In some implementations, the filter information may for example, also include ranges for two or more metrics for which any given set of coefficients should be used. Following decoding of the filters,
filter unit 559 can filter the pixel values of decoded video blocks based on the one or more sets of filter coefficients and the signaling syntax that includes the ranges for which the different sets of filter coefficients should be used. -
Filter unit 559 may receive in the bitstream one or more syntax elements indicating a set of filters for each frame or slice as well as a mapping of filters to the two or more metrics. For example, if an encoder uses the mapping of ranges for metrics to filters shown inFIG. 4A , then the encoder will either signal this mapping or transmit data to allowfilter unit 559 to reconstruct this mapping. Regardless of whether or not this mapping is explicitly signaled,filter unit 559 can maintain the same mapping of filters to combinations of ranges as used by the encoder. - As mentioned above,
filter unit 559 generates a mapping based on filter information signaled in the bitstream. Based on this mapping,filter unit 559 can determine groups and assign group IDs to the groups in the same manner described above in relation to filterunit 349. Using these group IDs,filter unit 559 can associate received filter coefficients with For each CU within the frame or slice,filter unit 559 can calculate one or more metrics associated with the decoded pixels of a CU for multiple inputs (i.e. PI, EI, pRI, and RI) in order to determine which filter(s) of the set(s) to apply to each input. Alternatively,filter unit 559 may calculate one or more metrics for a single input, such as pRI or RI.Filter unit 559 determines which filter to apply based on the metrics determined for a particular pixel or group of pixels. Using a sum-modified Laplacian value and direction as examples forMetric 1 andMetric 2 and using the mappings shown inFIG. 4A as an example, iffilter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-2 and a direction corresponding to Range 2-3, then filterunit 559 can applyFilter 2 to that pixel or group of pixels. Iffilter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-4 and a direction corresponding to Range 2-2, then filterunit 559 can applyFilter 6 to that pixel or group of pixels, and so on. The filter may generally assume any type of filter support shape or arrangement. The filter support refers to the shape of the filter with respect to a given pixel being filtered, and the filter coefficients may define weighting applied to neighboring pixel values according to the filter support. According to the techniques of the present disclosure, syntax data may be included in the bitstream to signal to the decoder how the filters were encoded (e.g., how the filter coefficients were encoded), as well as the ranges of the activity metric for which the different filters should be used. - For each CU within the frame or slice,
filter unit 559 can calculate one or more metrics associated with the decoded pixels of a CU for multiple inputs (i.e. PI, EI, pRI, and RI) in order to determine which filter(s) of the set(s) to apply to each input. Alternatively,filter unit 559 may calculate one or more metrics for a single input, such as pRI or RI.Filter unit 559 determines which filter to apply based on the metrics determined for a particular pixel or group of pixels. Using a sum-modified Laplacian value and direction as examples forMetric 1 andMetric 2 and using the mappings shown inFIG. 4A as an example, iffilter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-2 and a direction corresponding to Range 2-3, then filterunit 559 can applyFilter 2 to that pixel or group of pixels. Iffilter unit 559 determines that a pixel or group of pixels has a sum-modified Laplacian value in Range 1-4 and a direction corresponding to Range 2-2, then filterunit 559 can applyFilter 6 to that pixel or group of pixels, and so on. The filter may generally assume any type of filter support shape or arrangement. The filter support refers to the shape of the filter with respect to a given pixel being filtered, and the filter coefficients may define weighting applied to neighboring pixel values according to the filter support. According to the techniques of the present disclosure, syntax data may be included in the bitstream to signal to the decoder how the filters were encoded (e.g., how the filter coefficients were encoded), as well as the ranges of the activity metric for which the different filters should be used. -
Prediction module 554 receives prediction syntax (such as motion vectors) fromentropy decoding unit 552. Using the prediction syntax,prediction module 554 generates the prediction blocks that were used to code video blocks.Inverse quantization unit 556 performs inverse quantization, andinverse transform unit 558 performs inverse transforms to change the coefficients of the residual video blocks back to the pixel domain.Adder 564 combines each prediction block with the corresponding residual block output byinverse transform unit 558 in order to reconstruct the video block. -
Filter unit 559 generates the filter coefficients to be applied for each input of a CU, and then applies such filter coefficients in order to filter the reconstructed video blocks of that CU. The filtering, for example, may comprise additional deblock filtering that smoothes edges and/or eliminates artifacts associated with video blocks, denoise filtering to reduce quantization noise, or any other type of filtering that can improve coding quality. The filtered video blocks are accumulated inmemory 562 in order to reconstruct decoded frames (or other decodable units) of video information. The decoded units may be output fromvideo decoder 560 for presentation to a user, but may also be stored for use in subsequent predictive decoding. - In the field of video coding, it is common to apply filtering at the encoder and decoder in order to enhance the quality of a decoded video signal. Filtering can be applied via a post-filter, in which case the filtered frame is not used for prediction of future frames. Alternatively, filtering can be applied “in-loop,” in which case the filtered frame may be used to predict future frames. A desirable filter can be designed by minimizing the error between the original signal and the decoded filtered signal. Typically, such filtering has been based on applying one or more filters to a reconstructed image. For example, a deblocking filter might be applied to a reconstructed image prior to the image being stored in memory, or a deblocking filter and one additional filter might be applied to a reconstructed image prior to the image being stored in memory.
- In a manner similar to the quantization of transform coefficients, the coefficients of the filter h(k,l), where k=−K, . . . , K, and l=−L, . . . ,L may also be quantized. K and L may represent integer values. The coefficients of filter h(k,l) may be quantized as:
-
f(k,l)=round(normFact·h(k,l)) - where normFact is a normalization factor and round is the rounding operation performed to achieve quantization to a desired bit-depth. Quantization of filter coefficients may be performed by
filter unit 349 ofFIG. 3 during the encoding, and de-quantization or inverse quantization may be performed on decoded filter coefficients byfilter unit 559 ofFIG. 5 . Filter h(k,l) is intended to generically represent any filter. For example, filter h(k,l) could be applied to any one of multiple inputs. In some instances multiple inputs associated with a video block will utilize different filters, in which case multiple filters similar to h(k,l) may be quantized and de-quanitzed as described above. - The quantized filter coefficients are encoded and sent from source device associated with
encoder 350 to a destination device associated withdecoder 560 as part of an encoded bitstream. In the example above, the value of normFact is usually equal to 2n although other values could be used. Larger values of normFact lead to more precise quantization such that the quantized filter coefficients f (k, l) provide better performance. However, larger values of normFact may produce coefficients f (k, l) that require more bits to signal to the decoder. - At
decoder 560 the decoded filter coefficients f (k,l) may be applied to the appropriate input. For example, if the decoded filter coefficients are to be applied to RI, the filter coefficients may be applied to the post-deblocked reconstructed image RI(i,j), where i=0, . . . , M and j=0, . . . , N as follows: -
- The variables M, N, K and L may represent integers. K and L may define a block of pixels that spans two-dimensions from −K to K and from −L to L. Filters applied to other inputs can be applied in an analogous manner.
- The techniques of this disclosure may improve the performance of a post-filter or in-loop filter, and may also reduce number of bits needed to signal filter coefficients f(k, l). In some cases, a number of different post-filters or in-loop filters are signaled to the decoder for each series of video block, e.g., for each frame, slice, portion of a frame, group of frames (GOP), or the like. For each filter, additional information is included in the bitstream to identify the CUs, macroblocks and/or pixels for which a given filter should be applied.
- The frames may be identified by frame number and/or frame type (e.g., I-frames, P-frames or B-frames). I-frames refer to intra-frames that are intra-predicted. P-frames refer to predictive frames that have video blocks predicted based on one list of data (e.g., one previous frame). B-frames refer to bidirectional predictive frames that are predicted based on two lists of data (e.g., a previous and subsequent frame). Macroblocks can be identified by listing macroblock types and/or range of quantization parameter (QP) values use to reconstruct the macroblock.
- Filter coefficients f(k,l), for any input, may be coded using prediction from coefficients signaled for previous CUs. For each input of a CU m (e.g., each frame, slice or GOP), the encoder may encode and transmit a set of M filters:
-
g i m, wherein i=0, . . . ,M−1. - For each filter, the bitstream may also be encoded to identify the combination of ranges for two or more metrics for which the filter should be used.
- The filter coefficients can be predicted using reconstructed filter coefficients used in a previous CU. The previous filter coefficients may be represented as:
-
f i n where i=0, . . . ,N−1, - In this case, the number of the CU n may be used to identify one or more filters used for prediction of the current filters, and the number n may be sent to the decoder as part of the encoded bitstream. In addition, information can be encoded and transmitted to the decoder to identify combinations of ranges for two or more metrics for which predictive coding is used.
- The amplitude of the filter coefficients g(k, l) depends on k and l values. Usually, the coefficient with the biggest amplitude is the coefficient g(0,0). The other coefficients which are expected to have large amplitudes are the coefficients for which value of k or l is equal to 0. This phenomenon may be utilized to further reduce amount of bits needed to signal the coefficients. The index values k and l may define locations within a known filter support.
- The coefficients:
-
g i m(k,l), i=0, . . . ,M−1 - for each frame m may be coded using parameterized variable length codes such as Golomb or exp-Golomb codes defined according to a parameter p. By changing the value of parameter p that defines the parameterized variable length codes, these codes can be used to efficiently represent wide range of source distributions. The distribution of coefficients g(k,l) (i.e., their likelihood to have large or small values) depends on values of k and l. Hence, to increase coding efficiency, for each frame m, the value of parameter p is transmitted for each pair (k,l). The parameter p can be used for parameterized variable length coding when encoding coefficients:
-
g i m(k,l) where k=−K, . . . ,K, l=−L, . . . , L. - As described above,
video decoder 560 represents an example of a video decoder configured to determine a first metric for a group of pixels within a block of pixels, determine a second metric for the group of pixels, determine a filter based on the first metric and the second metric, and generate a filtered image by applying the filter to the group of pixels.Video decoder 560 also represents an example of a video encoder configured to determine a first metric for a block of pixels, wherein the first metric is determined based on a comparison of a subset of the pixels in the block to other pixels in the block; determine a second metric for the block of pixels; determine a filter based on the first metric and the second metric; and, generate a filtered image by applying the filter to the block of pixels. - As described above,
video decoder 560 also represents an example of a video decoder configured to determine a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric, wherein each range combination has a unique range combination identification (ID), wherein each unique range combination ID corresponds to a sequential value for a range combination; assign unique group IDs to groups of range combinations based on the sequential values for the range combinations, wherein each unique group ID corresponds to a sequential value for a group; and, code sets of filter coefficients corresponding for the filters based on the unique group IDs.Video decoder 560 can code the sets of filter coefficients comprises by generating the sets of filter coefficients based on information received in a coded bitstream.Video decoder 560 can generate the sets of filter coefficients using differential coding techniques. -
Video decoder 560 also represents an example of a video decoder configured to map a first range combination to a first filter, wherein the first range combination comprises a first range of values for a first metric and a first range of values for a second metric; map a second range combination to a second filter, wherein the second range combination comprises a second range of values for the first metric and a second range of values for the second metric; map a current range combination to a filter, wherein the current range combination comprises the first range of values of the first metric and the second range of values for the second metric. Mapping the current range combination to the filter can include mapping the current range combination to the first filter in response to receiving a first codeword, wherein the first codeword indicates the current range combination is mapped to the same filter as the first range combination; mapping the current range combination to the second filter in response to receiving a second codeword, wherein the second codeword indicates the current range combination is mapped to the same filter as the second combination; and, mapping the current range combination to a third filter in response to receiving a third codeword, wherein the third codeword identifies that third filter.Video decoder 560 also represents an example of a video decoder configured to generate a mapping of range combinations to filters, wherein a range combination comprises a range for a first metric and a range for a second metric; map a current range combination to a same filter as a previous range combination in response to receiving a first codeword signaling the current range combination is mapped to the same filter as the previous range combination; and, map the current range combination to a filter identified by a second codeword in response to receiving the second codeword signaling the current range combination is mapped to a different filter than the previous range combination. - As has been introduced above, several different types of metrics can be used in conjunction with the multi-metric filtering techniques described in this disclosure. Some of these metrics are activity metrics that quantify activity associated with one or more blocks of pixels within the video data. Activity metrics can comprise variance metrics indicative of pixel variance within a set of pixels. As will be described, some of these activity metrics are direction-specific. For example, a horizontal activity metric quantifies activity along a horizontal axis, a vertical activity metric quantifies activity along a vertical axis, a diagonal activity metric quantifies activity along a diagonal axis, and so on.
- Some activity metrics are not direction-specific. For example, a sum-modified Laplacian value is an activity metric based on a two-dimensional window of pixels that surround a current pixel or current group of pixels. For a current pixel (i,j), a sum-modified Laplacian value can be calculated as follows:
-
- where k represents a value of a summation of pixel values from −K to K and l represents a value of a summation from −L to L for a two-dimensional window that spans from −K to K and −L to L, wherein i and j represent pixel coordinates of the pixel data, RI(i,j) represents a given pixel value at coordinates i and j, and var(i,j) is the activity metric (i.e. the sum-modified Laplacian value).
- The techniques of the present disclosure may also be implemented using direction-specific metrics for horizontal activity, vertical activity, and diagonal activity.
Equations -
Hor_act(x,y)=R(2*Rec[x][y]−Rec[x+1][y]−Rec[x−1][y]) (2) -
Ver_act(x,y)=R(2*Rec[x][y]−Rec[x][y+1]−Rec[x][y+1]) (3) - As shown by
equation 2, when determining horizontal activity, the current pixel (x,y) can be compared to a left neighbor (x−1, y) and a right neighbor (x+1, y). As shown byequation 3, when determining vertical activity, the current pixel can be compared to an upper neighbor (x, y+1) and a lower neighbor (x, y−1). -
Equations -
45 deg_act(x,y)=R(2*Rec[x][y]−Rec[x+1][y+1]−Rec[x−1][y−1]) (4) -
135 deg_act(x,y)=R(2*Rec[x][y]−Rec[x−1][y+1]−Rec[x+1][y−1]) (5) - As shown by
equation 4, diagonal activity can be computed, for example, in the 45 degree direction by comparing a current pixel (x, y) to an upper-right neighbor (x+1, y+1) and a lower-left neighbor (x−1, y−1). As shown byequation 5, diagonal activity may also be in the 135 degree direction by comparing a current pixel (x, y) to a left-upper neighbor (x−1, y+1) and a right-lower neighbor (x+1, y−1). - Equations 2-5, above, illustrate how horizontal activity, vertical activity, and diagonal activity can be determined on a pixel-by-pixel basis, but in some implementations, horizontal activity, vertical activity, and diagonal activity may be determined on a group-by-group basis, where a group of pixels is a 2×2, 4×4, or M×N block of pixels. In such an implementation, horizontal activity, for example, can be determined by comparing pixel values of a current group to pixel values of a left group and a right group, in an analogous manner to
equation 2; and, the vertical activity can be determined by comparing a current group to an upper group and a lower group, in an analogous manner toequation 3. Likewise, 45-degree diagonal activity can be determined by comparing a current group of pixels to an upper-right neighboring group and a lower-left neighboring group in an analogous manner toequation 4, and 135-degree diagonal activity can be determined by comparing a current group of pixels to an upper-left neighboring group and a lower-right neighboring group, in an analogous manner toequation 5. - In some implementations, horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be determined by comparing a current pixel or group of pixels to neighboring pixels or groups of pixels in only one direction. For example, instead of determining horizontal activity based on comparing a current pixel to a left neighbor and a right neighbor, horizontal activity might be determined based on only a left neighbor or only a right neighbor. Additionally, in some implementations, horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity may be determined using averages or weighted averages of areas of neighboring pixels instead of single neighboring pixels or single groups of pixels.
- The values resulting from equations 2-5 can be divided into a finite number of ranges, such as 2, 4, 8, or any other finite number, and each range can be assigned a range identification. Referring back to
FIG. 4A , for example, Range 1-1, Range 1-2, Range 2-1, etc. are all examples of range identifications. As one example, horizontal activity values can be divided into four ranges, and the ranges might be assigned IDs Range 1-1, Range 1-2, Range 1-3, and Range 1-4. Horizontal threshold values (i.e., ThH1, . . . , ThHP-1) can determine where the ranges begin and end. Table 2 below shows the generic case of how horizontal IDs might be assigned to P ranges. -
TABLE 2 Index of activity metric Condition of Hor_act_B Horizontal ID Hor_act_B < ThH1 Range 2-1 ThH1 ≦ Hor_act_B < ThH2 Range 2-2 . . . . . . ThHP−1 ≦ Hor_act_B Range 2-P
Using the example of Table 2, if a current pixel has a horizontal activity value greater than ThH1 but less than ThH2, then the current pixel is in range 2-2 formetric 2. Current pixels may be assigned to vertical ranges with Vertical IDs, 45-degree diagonal ranges with 45-degree diagonal IDS, and 135-degree diagonal ranges with 135-degree diagonal IDs, in a similar manner as described above in Table 2 for horizontal ranges and horizontal IDs. - Any of horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be used as a metric in accordance with the multi-metric filter filtering techniques described in this disclosure. For example, referring again back to
FIG. 4A ,Metric 1 might be a measure of vertical activity, and Metric 2 might be a measure of horizontal activity. In such an example, a filter unit, such asfilter unit 349 ofFIG. 4A or filter 559 ofFIG. 5 , can determine a filter for a pixel or group of pixels based on the horizontal activity of the pixel or group of pixel and the vertical activity of the pixel or group of pixels. If, for example, a current pixel has a horizontal activity metric that falls in Range 2-3 and a vertical activity metric that falls in range 1-3, then the filter unit filters thepixel using Filter 4. In a similar manner, combinations of 45-degree diagonal activity and 135-degree diagonal activity, 45-degree diagonal activity and horizontal activity, 45-degree diagonal activity and vertical activity, 135-degree diagonal activity and horizontal activity, or 135-degree diagonal activity and vertical activity may also be used by a filter unit for selecting a filter for a pixel or group of pixels. In some implementations, three or all four of horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity may be used by a filter unit for selecting a filter of a pixel or group of pixels. - In the implementations described above, horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can all be used as metrics, as
Metric 1 and/orMetric 2 inFIG. 4A , for example. In some implementations, however, horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity might not be metrics themselves, but instead can be used as intermediate determinations for determining an overall direction metric. The direction metric generally describes in which direction (e.g. no direction, horizontal, vertical, 45-degree diagonal, or 135-degree diagonal) the pixels are changing the most. - In one example, using only horizontal activity and vertical activity as described in
equations -
Direction 1=horizontal, if Hor_activity>k1*Ver_activity -
Direction 2=vertical, if Ver_activity>k2*Hor_activity -
Direction 0=no direction, otherwise. - Constants, k1 and k2, can be selected such that the direction is only deemed to be
direction 1 ordirection 2 if horizontal activity is substantially greater than vertical activity or vertical activity is substantially greater than horizontal activity. If horizontal activity and vertical activity are equal or approximately equal, then the direction isdirection 0.Direction 1 generally indicates that the pixel values are changing more in the horizontal direction than in the vertical direction, anddirection 2 indicates that pixel values are changing more in the vertical direction than in the horizontal direction.Direction 0 indicates that the change in pixel values in the horizontal direction is approximately equal to the change in pixel values in the vertical direction. - The determined direction metric (
e.g. direction 0,direction 1, direction 2) can be used as a metric in the multi-metric filtering techniques described in this disclosure. Using the example ofFIG. 4A again,Metric 1 might be a variance metric, such as a sum-modified Laplacian value, whileMetric 2 might be a direction determination as described above. As described in reference toFIG. 4A , each ofdirection 1,direction 2, anddirection 0 can be associated with a range ofMetric 2 even thoughdirection 1,direction 2, anddirection 0 represent finite determinations instead of a spectrum of values. - In addition to using only horizontal activity and vertical activity as described above, techniques of this disclosure also include using 45-degree diagonal activity and 135-degree diagonal activity, as described in
equations -
Direction=1, if 45 deg_activity>k1*135 deg_activity -
Direction=2, if 135 deg_activity>k2*45 deg_activity -
Direction=0, otherwise. - Direction determinations based on 45-degree diagonal activity and 135-degree diagonal activity can be used as a metric with another metric, such as a sum-modified Laplacian value, as described above.
- Additionally, a direction metric may also be determined, based on the following conditions:
-
Direction=1, if 45 deg_activity>k1*135 deg_acctivity,k2*Hor_activity,AND k3*Ver_activity -
Direction=2, if 135 deg_activity>>k4*45 deg_activity,k5*Hor_activity,AND k6*Ver_activity -
Direction=3, if Hor_activity>k7*Ver_activity,k8*135 deg_activity,AND k9*45 deg_acctivity -
Direction=4, if Ver_activity>k10*Hor_activity,k11*135 deg_activity,AND k12*45 deg_acctivity -
Direction=0, otherwise. - As described above, k1 through k12 are constants selected to determination how much greater than one of horizontal activity, vertical activity, 45-degree activity, and 135-degree activity needs to be compared to the others in order for a certain direction to be selected. Direction determinations based on horizontal activity, vertical activity, 45-degree diagonal activity, and 135-degree diagonal activity can be used as a metric with another metric, such as a sum-modified Laplacian value, as described above.
- Another metric that can be used with the techniques of this disclosure includes an edge metric. An edge metric generally quantifies activity that might be indicative of the presence of an edge in a block of pixels. An edge may occur, for example, in a block of pixels if that block of pixels contains the boundary of an object within an image. One example of edge detection includes using a current pixel's four neighboring pixels (e.g., left, right, top, bottom) or using the current pixel's eight neighboring pixels (left, right, top, bottom, top right, top left, bottom right, bottom left). Additionally, edge type detection may include using two neighboring pixels, such as top and bottom, left and right, top left and bottom right, or top right and left bottom.
- The pseudo code below shows examples of how edge information can be computed for a current pixel (x, y) by comparing a pixel value (Rec), such as intensity, of the current pixel to the pixel values of those neighboring pixels (i.e., 4/8 pixels).
- An EdgeType variable is initiated to 0. Each time a statement is true, the EdgeType variable is either incremented by 1 (as shown in the pseudo code by EdgeType ++) or decremented by 1 (as shown in the pseudo code by EdgeType −−). Rec[x][y] refers to a pixel value, such as the pixel intensity, of the pixel located at (x, y). The first grouping of “if” statements are for comparing the current pixel to top, bottom, left, and right neighbors. The second grouping of “if” statements are for comparing the current pixel to the top-left, top-right, bottom-left, and bottom-right neighbors. The techniques of this disclosure can be implemented using either group or both groups.
-
- EdgeType=0;
- if (Rec[x][y]>Rec[x−1][y]) EdgeType ++;
- if (Rec[x][y]<Rec[x−1][y]) EdgeType −−;
- if (Rec[x][y]>Rec[x+1][y]) EdgeType ++;
- if (Rec[x][y]<Rec[x+1][y]) EdgeType −−;
- if (Rec[x][y]>Rec[x][y−1]) EdgeType ++;
- if (Rec[x][y]<Rec[x][y−1]) EdgeType −−;
- if (Rec[x][y]>Rec[x][y+1]) EdgeType ++;
- if (Rec[x][y]<Rec[x][y+1]) EdgeType −−;
- if (Rec[x][y]>Rec[x−1][y−1]) EdgeType ++;
- if (Rec[x][y]<Rec[x−1][y−1]) EdgeType −−;
- if (Rec[x][y]>Rec[x+1][y−1]) EdgeType ++;
- if (Rec[x][y]<Rec[x+1][y−1]) EdgeType −−;
- if (Rec[x][y]>Rec[x−1][y+1]) EdgeType ++;
- if (Rec[x][y]<Rec[x−1][y+1]) EdgeType −−;
- if (Rec[x][y]>Rec[x+1][y+1]) EdgeType ++;
- if (Rec[x][y]<Rec[x+1][y+1]) EdgeType −−;
- If a current pixel is a local maximum, then the pixel value of the pixel will be greater than all its neighbors and will have an edge type of 4 if using four neighbors or an edge type of 8 if using eight neighbors. If a current pixel is local minimum, then the pixel value of the pixel will be less than all its neighbors and will have an edge type of −4 if using four neighbors or an edge type of −8 if using eight neighbors. Thus, using the example techniques described above for determining an edge type between −4 and 4 or −8 and 8 can be used in determining a filter. The values determined for the edge type (i.e. values of −4 to 4 or values of −8 to 8) can be mapped to ranges of a metric, such as
Metric 1 orMetric 2 ofFIG. 4A . In some implementations, absolute values of the edge type determination might be mapped to ranges, such that an edge type of −3 and 3, for example, would map to the same filter. - The calculations of the various metrics described in this disclosure are only intended to be examples and are not exhaustive. For example, the metrics can be determined using windows or lines of pixels that include more neighboring pixels than described in this disclosure.
- Additionally, in some implementations, the metrics described in this disclosure may be calculated using sub-sampling of the pixels in a particular line or window. For example, to calculate a block activity metric for a 4×4 block of pixels, metrics for activity and direction can be calculated as follows:
- Direction Metric
-
Ver_act(i,j)=abs(X(i,j)<<1−X(i,j−1)−X(i,j+1)) -
Hor_act(i,j)=abs(X(i,j)<<1−X(i−1,j)−X(i+1,j)) -
H B=Σi=0,2Σj=0,2Hor_act(i,j) -
V B=Σi=0,2Σj=0,2Vert_act(i,j) -
Direction=0,1(H B >k1*V B),2(V B >k2*H B) - Activity Metric
-
L B =H B +V B -
- 5 classes (0, 1, 2, 3, 4)
- Metric
-
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
FIG. 4B )
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
- Hor_act (i, j) generally refers to the horizontal activity of current pixel (i, j), and Vert_act(i, j) generally refers to the vertical activity of current pixel (i,j). X(i, j) generally refers to a pixel vale of pixel (i, j). HB refers to the horizontal activity of the 4×4 block, which in this example is determined based on a sum of horizontal activity for pixels (0, 0), (0, 2), (2, 0), and (2, 2). VB refers to the vertical activity of the 4×4 block, which in this example is determined based on a sum of vertical activity for pixels (0, 0), (0, 2), (2, 0), and (2, 2). “<<1” represents a multiply by two operation. As explained above, based on the values of HB and VB, a direction can be determined. Using the example above, if the value of HB is more than k times the value of VB, then the direction can be determined to be direction 1 (i.e. horizontal), which might correspond to more horizontal activity than vertical activity. If the value of VB is more than k times the value of HB, then the direction can be determined to be direction 2 (i.e. vertical), which might correspond to more vertical activity than horizontal activity. Otherwise, the direction can be determined to be direction 0 (i.e. no direction), meaning neither horizontal nor vertical activity is dominant. The labels for the various directions and the ratios used to determine the directions merely constitute one example, as other labels and ratios can also be used.
- Activity (LB) for the 4×4 block can be determined as a sum of the horizontal and vertical activity. The value of LB can be classified into a range, as described above. This particular example shows five ranges although more or fewer ranges may similarly be used. Based on the combination of activity and direction, a filter for the 4×4 block of pixels can be selected. As described above, a filter may be selected based on a two-dimensional mapping of activity and direction to filters, as described in reference to
FIGS. 4A and 4B , or activity and direction may be combined into a single metric, and that single metric may be used to select a filter. -
FIG. 6A represents a 4×4 block of pixels. Using the sub-sampling techniques described above, only four of the sixteen pixels are used. The four pixels are pixel (0, 0) which is labeled aspixel 601, pixel (2, 0) which is labeled aspixel 602, pixel (0, 2) which is labeled aspixel 603, and pixel (2, 2) which is labeled aspixel 604. The Horizontal activity of pixel 601 (i.e. hor_act(0, 0)), for example, is determined based on a left neighboring pixel and a right neighboring pixel. The right neighboring pixel is labeled aspixel 605. The left neighboring pixel is located in a different block than the 4×4 block and is not shown onFIG. 6A . The vertical activity of pixel 602 (i.e. ver_act(2, 0)), for example is determined based on an upper neighboring pixel and a lower neighboring pixel. The lower neighboring pixel is labeled aspixel 606, and the upper neighboring pixel is located in a different block than the 4×4 block and is not shown inFIG. 6A . - Generally using the same techniques described above, a block activity metric may also be calculated using a different subset of pixels as follows:
- Direction Metric
-
Ver_act(i,j)=abs(X(i,j)<<1−X(i,j−1)−X(i,j+1)) -
Hor_act(i,j)=abs(X(i,j)<<1−X(i−1,j)−X(i+1,j)) -
H B=Σi=1,2Σj=1,2 H(i,j) -
V B=Σi=1,2Σj=1,2 V(i,j) -
Direction=0,1(H>k1*V),2(V>k2*H) - Activity Metric
-
L B =H B +V B -
- 5 classes (0, 1, 2, 3, 4)
- Metric
-
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
FIG. 4B )
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
- This different subset of pixels for calculating HB and VB includes pixels (1, 1), (2, 1), (1, 2), and (2, 2), shown on
FIG. 6B aspixels FIG. 6B , all of the upper neighboring pixels, lower neighboring pixels, right neighboring pixels, and left neighboring pixels forpixels FIG. 6B ,pixels Pixels FIG. 6A andpixels FIG. 6C are examples of pixels located on the block boundary. In other implementations, additional different subsets of pixel may be chosen. For example, subsets may be selected such that upper and lower neighboring pixels for the pixels of the subset are within the 4×4 block, but some left and right neighboring pixels are in neighboring blocks. Subsets may also be selected such that left and right neighboring pixels for the pixels of the subset are within the 4×4 block, but some upper and lower neighboring pixels are in neighboring blocks. - Generally using the same techniques described above, a block activity metric may also be calculated using a subset of eight pixels as follows:
- Direction Metric
-
Ver_act(i,j)=abs(X(i,j)<<1−X(i,j−1)−X(i,j+1)) -
Hor_act(i,j)=abs(X(i,j)<<1−X(i−1,j)−X(i+1,j)) -
H B=Σi=0,1,2,3Σi=1,2 H(i,j) -
V B=Σi=0,1,2,3Σj=1,2 V(i,j) -
Direction=0,1(H>k1*V),2(V>k2*H) - Activity Metric
-
L B =H B +V B -
- 5 classes (0, 1, 2, 3, 4)
- Metric
-
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
FIG. 4B )
- Combination of Activity and Direction (e.g. 15 or 16 combinations as explained above in the example of
- This different subset of eight pixels for calculating HB and VB includes pixels (0, 1), (1, 1), (2, 1), (3, 1), (0, 2), (1, 2), (2, 2), and (3, 2), shown on
FIG. 6C aspixels FIG. 6C , all of the upper neighboring pixels and lower neighboring pixels forpixels pixels pixels FIG. 6C may be able to avoid the use of line buffers for pixel values of upper and lower neighboring block, thus reducing coding complexity. - The examples of
FIGS. 6A-6C are merely introduced techniques of this disclosure. It is contemplated that these techniques can be extended to blocks other than just 4×4 and that different subsets of pixels may be selected. - When computing a block activity metric, instead of original pixels, quantized pixels (i.e., X(i,j)>>N) can be used to reduce the complexity of operations, such as addition operations. Additionally, calculations can be absolute difference based instead of Laplacian based. For example, when computing Hor_act(i,j) or Ver_act(i,j), absolute differences can be used instead of Laplacian values, as follows:
- Direction Metric
-
Ver_act(i,j)=abs(X(i,j)−X(i,j−1)) -
Hor_act(i,j)=abs(X(i,j)−X(i−1,j)) -
H B=Σi=0,1,2Σj=0,1,2 H(i,j) -
V B=Σi=0,1,2Σj=0,1,2 V(i,j) -
Direction=0,1(H>2V),2(V>2H) - Activity Metric
-
L B =H B +V B -
- 5 classes (0, 1, 2, 3, 4)
- Metric
-
- Activity+Direction (e.g. 15 or 16 combinations as explained above in the example of
FIG. 4B )
- Activity+Direction (e.g. 15 or 16 combinations as explained above in the example of
- This disclosure has described sub-sampling techniques with reference to a limited group of specific metrics. It is contemplated, however, that these sub-sampling techniques are generally applicable to other metrics, such as the other metrics discussed in this disclosure, that may be used for purposes of determining a filter. Additionally, although the sub-sampling techniques of this disclosure have been described with reference to 4×4 blocks of pixels, the techniques may also be applicable to blocks of other sizes.
-
FIG. 7 is a flow diagram illustrating a video coding technique consistent with this disclosure. The techniques described inFIG. 7 can be performed by the filter unit of a video encoder or a video decoder, such asfilter unit 349 ofvideo encoder 350 orfilter unit 559 ofvideo decoder 560. The filter unit determines a first metric for a group of pixels within a block of pixels (710). The first metric may, for example, be an activity metric such as a sum-modified Laplacian value, or the first metric may be a direction metric. The first metric may be determined, for example, based on a comparison of the set of pixels in the block, or based on a subset of the pixels in the block, to other pixels in the block. The filter unit further determines a second metric for the block (720). The second metric may, for example, be a direction metric that is determined based on comparing a measure of horizontal activity to a measure of vertical activity. Based on the first metric and the second metric, the filter unit determines a filter (730). The filter unit generates a filtered image by applying the filter to the block (740). As discussed above, in some implementations, the block may be a 2×2, 4×4, or M×N block of pixels, used for determining the first metric or the second metric. In some implementations, the first metric may be a horizontal activity metric while the second metric is a vertical activity metric, or the first metric may be an edge metric while the second metric is a direction metric. -
FIG. 8A is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described inFIG. 8A can be performed by the filter unit of a video decoder, such asfilter unit 559 ofvideo decoder 560.Filter unit 559 maps a first range combination to a first filter (810A). The first range combination is combination of a first range of values for a first metric and a first range of values for a second metric. The first metric may, for example, be a sum-modified Laplacian value and the second metric may be a direction metric, although others metrics may also be used.Filter unit 559 maps a second range combination to a second filter (820A). The second range combination is a combination of a second range of values for the first metric and a second range of values for the second metric.Filter unit 559 then maps a current range combination to a filter based on a received codeword. The current range combination includes the first range of values of the first metric and the second range of values for the second metric. If the codeword is a first codeword (830A, yes), then filterunit 559 maps the current range combination to the first filter (840A). The first codeword indicates the current range combination is mapped to the same filter as the first range combination. If the codeword is a second codeword (850A, yes), thefilter unit 559 maps the current range combination to the second filter (860A). The second codeword indicates the current range combination is mapped to the same filter as the second combination. If the codeword is neither a first codeword nor a second codeword (850A, no), then filterunit 559 maps the current range combination to a third filter (870A). If in response to receiving a third codeword, wherein the third codeword identifies that third filter. In the example ofFIG. 8A , the first codeword and the second codeword may each include fewer bits than the third codeword. -
FIG. 8B is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described inFIG. 8B can be performed by the filter unit of a video decoder, such asfilter unit 559 ofvideo decoder 560.Filter unit 559 generates a mapping of range combinations to filters (810B). Each range combination, for example, can include a range for a first metric and a range for a second metric. In response to receiving a first codeword that signals a current range combination is mapped to a same filter as a previous range combination (820B, yes),filter unit 559 maps the current range combination to the same filter as the previous range combination (830B). In response to receiving a second codeword that signals the current range combination is mapped to a different filter than the previous range combination (820B, no),filter unit 559 maps the current range combination to a new filter (840B). As described above, the current range combination can be determined based on a known transmission order. In some examples, the new filter can be identified based on the second codeword, while in other examples, the new filter might be determined based on the order in which filter coefficients are signaled. -
FIG. 9A is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described inFIG. 9A can be performed by the filter unit of a video encoder, such asfilter unit 349 ofvideo encoder 350.Filter unit 349 determines a mapping of range combinations to filters (910A). Each range combination includes a range of values for a first metric and a range of values for a second metric. For a current range combination, if a current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the first metric (920A, yes), then filterunit 349 generates a first codeword (930A). If the current range combination is mapped to the same filter as a previous range combination that comprises the same range of values for the second metric (940A, yes), then filterunit 349 generates a second codeword (950A). If the current range combination is not mapped to either the previous range combination that comprises the same range of values for the first metric or the previous range combination that comprises the same range of values for the second metric (950A, no), then filterunit 349 generates a third codeword (960A). The third codeword can identify a filter mapped to the current range combination. -
FIG. 9B is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described in FIG. 9BA can be performed by the filter unit of a video encoder, such asfilter unit 349 ofvideo encoder 350.Filter unit 349 determines a mapping of range combinations to filters (910B). Each range combination can, for example, include a range for a first metric and a range for a second metric. When a current range combination being coded has the same filter as a previously coded range combination (920B, yes),filter unit 349 can generate a first codeword to signal that the current range combination is mapped to the same filter as a previous range combination (930B). When a current range combination being coded does not have the same filter as a previously coded range combination (920B, no),filter unit 349 can generating a second codeword (940B). The second codeword can identify the filter mapped to the current range combination. As described above, the current range combination can be determined based on a known transmission order. In the example ofFIG. 9B , the first codeword may include fewer bits than the second codeword. - In the examples of
FIGS. 8A and 8B andFIGS. 9A and 9B , the terms “first codeword,” “second codeword,” and “third codeword” are used to differentiate between different codewords and not meant to imply a sequential ordering of codewords. -
FIG. 10 is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described inFIG. 10 can be performed by the filter unit of a video encoder, such asfilter unit 349 ofvideo encoder 350, or the filter unit of a video decoder, such asfilter unit 559. The filter unit determines a mapping of range combinations to filters (1010). The range combinations include a range for a first metric and a range for a second metric. The filter unit determines a unique range combination identification (ID) for each range combination (1020). The unique range combination IDs correspond to sequential values. The filter unit assigns a first unique group ID to a first group of range combinations based on the sequential value of a range combination ID of at least one range combination in the first group of range combinations (1030). The groups of range combinations include range combinations mapped to the same filter, the unique group IDs correspond to a set of sequential values. The filter unit codes a first set of filter coefficients corresponding to the same filter based on the sequential value of the first unique filter ID (1040). In the case of video encoder, coding the first set of filter coefficients can include, for example, signaling the filter coefficients in an encoded bitstream using differential coding techniques. In the case of a video decoder, coding the first set of filter coefficients can include reconstructing the filter coefficients based on information received in an encoded bitstream. -
FIG. 11 is a flow diagram illustrating video coding techniques consistent with this disclosure. The techniques described inFIG. 11 can be performed by the filter unit of a video encoder, such asfilter unit 349 ofvideo encoder 350, or the filter unit of a video decoder, such asfilter unit 559. The filter unit determines a mapping of range combinations to filters (1110). The range combinations can include a range for a first metric and a range for a second metric. Each range combination can have a unique range combination identification (ID), and each unique range combination ID can correspond to a sequential value for the range combination. The filter unit can assigns a unique group ID to each group of range combinations (1120). The filter unit can assign the unique group IDS, for example, based on the sequential values of the range combinations. A group of range combinations can includes range combinations mapped to a common filter, and the unique group IDs can correspond to a set of sequential values. The filter unit can code sets of filter coefficients for the filters based on the unique group IDs (1140). - In the example of
FIG. 11 , the filter unit can assign the unique group IDs by, for example, assigning a unique group ID corresponding to a lowest sequential value of the unique group IDs to a group of range combinations that comprises a range combination with a range combination ID corresponding to a lowest sequential value of the range combination IDs. In another example, the filter unit can assign the unique group ID corresponding to a highest sequential value of the unique group IDs to a group of range combinations that comprises a range combination with a range combination ID corresponding to a highest sequential value of the range combination IDs. - In instances where the filter unit is part of a video decoder, the filter unit can code the sets of filter coefficients by generating the sets of filter coefficients based on information received in a coded bitstream. The filter unit can, for example, generate the sets of filter coefficients using differential coding techniques. In instances where the filter unit is part of a video encoder, the filter unit can code the sets of filter coefficients by signaling the sets of filter coefficients in a coded bitstream in an order selected based on the sequential values of the unique group IDs. The filter unit can, for example, signal the sets of filter coefficients using differential coding techniques.
- The foregoing disclosure has been simplified to some extent in order to convey details. For example, the disclosure generally describes sets of filters being signaled on a per-frame or per-slice basis, but sets of filters may also be signaled on a per-sequence basis, per-group of picture basis, per-group of slices basis, per-CU basis, per-LCU basis, or other such basis. In general, filters may be signaled for any grouping of one or more CUs. Additionally, in implementation, there may be numerous filters per input per CU, numerous coefficients per filter, and numerous different levels of variance with each of the filters being defined for a different range of variance. For example, in some cases there may be sixteen or more filters defined for each input of a CU and sixteen different ranges of variance corresponding to each filter. Additionally, when this disclosure describes transmitting filter information, it should not be assumed that all filter information is transmitted at the same coding level. For example, in some implementations, some filter information such as filter description syntax may be signaled on a frame-by-frame basis or slice-by-slice basis while other filter information such as filter coefficients are signaled on an LCU-by-LCU basis. Syntax at other levels of the coding hierarchy, such as sequence level, GOP-level, or other levels could also be defined for conveying some or all of such filter information
- Each of the filters for each input may include many coefficients. In one example, the filters comprise two-dimensional filters with 81 different coefficients defined for a filter support that extends in two-dimensions. However, the number of filter coefficients that are signaled for each filter may be fewer than 81 in some cases. Coefficient symmetry, for example, may be imposed such that filter coefficients in one dimension or quadrant may correspond to inverted or symmetric values relative to coefficients in other dimensions or quadrants. Coefficient symmetry may allow for 81 different coefficients to be represented by fewer coefficients, in which case the encoder and decoder may assume that inverted or mirrored values of coefficients define other coefficients. For example, the coefficients (5, −2, 10, 10, −2, 5) may be encoded and signaled as the subset of coefficients (5, −2, 10). In this case, the decoder may know that these three coefficients define the larger symmetric set of coefficients (5, −2, 10, 10, −2, 5).
- The techniques of this disclosure may be implemented in a wide variety of devices or apparatuses, including a wireless handset, and integrated circuit (IC) or a set of ICs (i.e., a chip set). Any components, modules or units have been described provided to emphasize functional aspects and does not necessarily require realization by different hardware units.
- Accordingly, the techniques described herein may be implemented in hardware, software, firmware, or any combination thereof. If implemented in hardware, any features described as modules, units or components may be implemented together in an integrated logic device or separately as discrete but interoperable logic devices. If implemented in software, the techniques may be realized at least in part by a computer-readable medium comprising instructions that, when executed in a processor, performs one or more of the methods described above. The computer-readable medium may comprise a computer-readable storage medium and may form part of a computer program product, which may include packaging materials. The computer-readable storage medium may comprise random access memory (RAM) such as synchronous dynamic random access memory (SDRAM), read-only memory (ROM), non-volatile random access memory (NVRAM), electrically erasable programmable read-only memory (EEPROM), FLASH memory, magnetic or optical data storage media, and the like. The techniques additionally, or alternatively, may be realized at least in part by a computer-readable communication medium that carries or communicates code in the form of instructions or data structures and that can be accessed, read, and/or executed by a computer.
- The code may be executed by one or more processors, such as one or more digital signal processors (DSPs), general purpose microprocessors, an application specific integrated circuits (ASICs), field programmable logic arrays (FPGAs), or other equivalent integrated or discrete logic circuitry. Accordingly, the term “processor,” as used herein may refer to any of the foregoing structure or any other structure suitable for implementation of the techniques described herein. In addition, in some aspects, the functionality described herein may be provided within dedicated software modules or hardware modules configured for encoding and decoding, or incorporated in a combined video codec. Also, the techniques could be fully implemented in one or more circuits or logic elements.
- Various aspects of the disclosure have been described. These and other aspects are within the scope of the following claims.
Claims (44)
Priority Applications (20)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/401,685 US8964853B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
EP20152795.9A EP3700203A1 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
AU2012220639A AU2012220639B2 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
UAA201311226A UA110637C2 (en) | 2011-02-23 | 2012-02-22 | Multi metric filtration |
EP12706181.0A EP2679010A1 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
CN201710670800.8A CN107396114B (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
KR1020137024825A KR101578986B1 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
CN201280015663.XA CN103477639B (en) | 2011-02-23 | 2012-02-22 | Many measurement filtering |
JP2013555530A JP5815756B2 (en) | 2011-02-23 | 2012-02-22 | Multiple metric filtering |
SG2013061338A SG192743A1 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
PCT/US2012/026166 WO2012116095A1 (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
BR112013021617-4A BR112013021617A2 (en) | 2011-02-23 | 2012-02-22 | multimetric filtering |
CA2828406A CA2828406C (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
RU2013143011/08A RU2579688C2 (en) | 2011-02-23 | 2012-02-22 | Multimetric filtration |
MYPI2013003111A MY167114A (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
KR1020157011601A KR20150056663A (en) | 2011-02-23 | 2012-02-22 | Multi-metric filtering |
IL227994A IL227994A (en) | 2011-02-23 | 2013-08-15 | Multi-metric filtering |
ZA2013/07111A ZA201307111B (en) | 2011-02-23 | 2013-09-20 | Multi-metric filtering |
US14/592,826 US9877023B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
JP2015187236A JP6105011B2 (en) | 2011-02-23 | 2015-09-24 | Multiple metric filtering |
Applications Claiming Priority (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161445967P | 2011-02-23 | 2011-02-23 | |
US201161448771P | 2011-03-03 | 2011-03-03 | |
US201161473713P | 2011-04-08 | 2011-04-08 | |
US201161476260P | 2011-04-16 | 2011-04-16 | |
US201161478287P | 2011-04-22 | 2011-04-22 | |
US201161503426P | 2011-06-30 | 2011-06-30 | |
US201161503440P | 2011-06-30 | 2011-06-30 | |
US201161503434P | 2011-06-30 | 2011-06-30 | |
US201161527463P | 2011-08-25 | 2011-08-25 | |
US201161531571P | 2011-09-06 | 2011-09-06 | |
US13/401,685 US8964853B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/592,826 Continuation US9877023B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
Publications (2)
Publication Number | Publication Date |
---|---|
US20120213293A1 true US20120213293A1 (en) | 2012-08-23 |
US8964853B2 US8964853B2 (en) | 2015-02-24 |
Family
ID=46652728
Family Applications (7)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/401,685 Active 2032-12-12 US8964853B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US13/401,552 Active 2033-01-02 US8964852B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US13/401,573 Active 2033-01-01 US8989261B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US13/401,548 Active 2032-12-03 US8982960B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US14/592,826 Active 2032-12-21 US9877023B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
US14/592,841 Active US9258563B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
US15/018,403 Active US9819936B2 (en) | 2011-02-23 | 2016-02-08 | Multi-metric filtering |
Family Applications After (6)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/401,552 Active 2033-01-02 US8964852B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US13/401,573 Active 2033-01-01 US8989261B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US13/401,548 Active 2032-12-03 US8982960B2 (en) | 2011-02-23 | 2012-02-21 | Multi-metric filtering |
US14/592,826 Active 2032-12-21 US9877023B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
US14/592,841 Active US9258563B2 (en) | 2011-02-23 | 2015-01-08 | Multi-metric filtering |
US15/018,403 Active US9819936B2 (en) | 2011-02-23 | 2016-02-08 | Multi-metric filtering |
Country Status (23)
Country | Link |
---|---|
US (7) | US8964853B2 (en) |
EP (8) | EP3687170A1 (en) |
JP (7) | JP5752812B2 (en) |
KR (6) | KR101552031B1 (en) |
CN (6) | CN103477639B (en) |
AU (2) | AU2012220639B2 (en) |
BR (2) | BR112013021617A2 (en) |
CA (2) | CA2828406C (en) |
DK (2) | DK2679009T3 (en) |
ES (3) | ES2816423T3 (en) |
HU (2) | HUE051433T2 (en) |
IL (1) | IL227636A (en) |
MX (1) | MX2013009722A (en) |
MY (2) | MY166573A (en) |
PL (2) | PL3796653T3 (en) |
PT (1) | PT2679009T (en) |
RU (2) | RU2584961C2 (en) |
SG (2) | SG192123A1 (en) |
SI (1) | SI2679009T1 (en) |
TW (1) | TWI499267B (en) |
UA (1) | UA110637C2 (en) |
WO (4) | WO2012116088A1 (en) |
ZA (2) | ZA201307111B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140056363A1 (en) * | 2012-08-23 | 2014-02-27 | Yedong He | Method and system for deblock filtering coded macroblocks |
US20140328414A1 (en) * | 2012-11-13 | 2014-11-06 | Atul Puri | Content adaptive quality restoration filtering for next generation video coding |
US9258563B2 (en) | 2011-02-23 | 2016-02-09 | Qualcomm Incorporated | Multi-metric filtering |
US20160330485A1 (en) * | 2011-04-21 | 2016-11-10 | Intellectual Discovery Co., Ltd. | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US10045028B2 (en) | 2015-08-17 | 2018-08-07 | Nxp Usa, Inc. | Media display system that evaluates and scores macro-blocks of media stream |
CN111107367A (en) * | 2018-10-26 | 2020-05-05 | 北京字节跳动网络技术有限公司 | Block division method and device |
CN114598902A (en) * | 2022-03-09 | 2022-06-07 | 安徽文香科技有限公司 | Video frame processing method and device and electronic equipment |
CN115134608A (en) * | 2015-06-11 | 2022-09-30 | 杜比实验室特许公司 | Method for encoding and decoding image using adaptive deblocking filtering and apparatus therefor |
CN116260973A (en) * | 2023-03-31 | 2023-06-13 | 北京百度网讯科技有限公司 | Time domain filtering method and device, electronic equipment and storage medium |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008082277A2 (en) * | 2007-01-05 | 2008-07-10 | Lg Electronics Inc. | Layer mapping method and data transmission metho for mimo system |
US10123050B2 (en) | 2008-07-11 | 2018-11-06 | Qualcomm Incorporated | Filtering video data using a plurality of filters |
US9143803B2 (en) | 2009-01-15 | 2015-09-22 | Qualcomm Incorporated | Filter prediction based on activity metrics in video coding |
US9344742B2 (en) * | 2012-08-10 | 2016-05-17 | Google Inc. | Transform-domain intra prediction |
FR3011429A1 (en) * | 2013-09-27 | 2015-04-03 | Orange | VIDEO CODING AND DECODING BY HERITAGE OF A FIELD OF MOTION VECTORS |
KR20150037371A (en) * | 2013-09-30 | 2015-04-08 | 삼성전자주식회사 | Method and apparatus for image processing using DMA |
US9628822B2 (en) * | 2014-01-30 | 2017-04-18 | Qualcomm Incorporated | Low complexity sample adaptive offset encoding |
JP2017513312A (en) * | 2014-03-14 | 2017-05-25 | シャープ株式会社 | Video compression using color space scalability |
CN104023241B (en) * | 2014-05-29 | 2017-08-04 | 华为技术有限公司 | The method for video coding and video coding apparatus of intraframe predictive coding |
US10057574B2 (en) | 2015-02-11 | 2018-08-21 | Qualcomm Incorporated | Coding tree unit (CTU) level adaptive loop filter (ALF) |
CN104918057B (en) * | 2015-05-08 | 2018-07-13 | 上海交通大学 | A kind of motion vector after-treatment system using neighborhood motion information |
JP6593934B2 (en) * | 2015-05-21 | 2019-10-23 | ホアウェイ・テクノロジーズ・カンパニー・リミテッド | Apparatus and method for video motion compensation |
EP3313078B1 (en) * | 2015-06-18 | 2020-12-23 | LG Electronics Inc. | Image properties-based adaptive filtering method and device in image coding system |
EP3313079B1 (en) * | 2015-06-18 | 2021-09-01 | LG Electronics Inc. | Image filtering method in image coding system |
CN105049846B (en) * | 2015-08-14 | 2019-05-21 | 广东中星微电子有限公司 | The method and apparatus of image and coding and decoding video |
US9883183B2 (en) * | 2015-11-23 | 2018-01-30 | Qualcomm Incorporated | Determining neighborhood video attribute values for video data |
KR101788183B1 (en) * | 2015-12-28 | 2017-10-20 | 현대자동차주식회사 | Vehicle and controlling method for the vehicle |
US11064195B2 (en) * | 2016-02-15 | 2021-07-13 | Qualcomm Incorporated | Merging filters for multiple classes of blocks for video coding |
US10382766B2 (en) | 2016-05-09 | 2019-08-13 | Qualcomm Incorporated | Signalling of filtering information |
US10419755B2 (en) * | 2016-05-16 | 2019-09-17 | Qualcomm Incorporated | Confusion of multiple filters in adaptive loop filtering in video coding |
WO2018066241A1 (en) * | 2016-10-03 | 2018-04-12 | Sharp Kabushiki Kaisha | Systems and methods for applying deblocking filters to reconstructed video data |
US10572978B2 (en) * | 2016-12-05 | 2020-02-25 | Kennesaw State University Research And Service Foundation, Inc. | Moran's / for impulse noise detection and removal in color images |
JP2018182444A (en) * | 2017-04-07 | 2018-11-15 | 株式会社Jvcケンウッド | Image encoding device, image encoding method and image encoding program, as well as image decoding device, image decoding method and image decoding program |
CN108737841B (en) * | 2017-04-21 | 2020-11-24 | 腾讯科技(深圳)有限公司 | Coding unit depth determination method and device |
US10992939B2 (en) | 2017-10-23 | 2021-04-27 | Google Llc | Directional intra-prediction coding |
US10225578B2 (en) | 2017-05-09 | 2019-03-05 | Google Llc | Intra-prediction edge filtering |
WO2019065261A1 (en) * | 2017-09-27 | 2019-04-04 | ソニー株式会社 | Coding device, coding method, decoding device, and decoding method |
US20190116359A1 (en) * | 2017-10-12 | 2019-04-18 | Qualcomm Incorporated | Guided filter for video coding and processing |
SG11202003927TA (en) | 2017-11-01 | 2020-05-28 | Vid Scale Inc | Methods for simplifying adaptive loop filter in video coding |
CN108122268B (en) * | 2017-12-19 | 2021-07-23 | 网易(杭州)网络有限公司 | Mapping processing method and device |
WO2019182159A1 (en) * | 2018-03-23 | 2019-09-26 | シャープ株式会社 | Image filtering device, image decoding device, and image encoding device |
US20190297603A1 (en) * | 2018-03-23 | 2019-09-26 | Samsung Electronics Co., Ltd. | Method and apparatus for beam management for multi-stream transmission |
US11451773B2 (en) | 2018-06-01 | 2022-09-20 | Qualcomm Incorporated | Block-based adaptive loop filter (ALF) design and signaling |
CN112272951B (en) * | 2018-06-13 | 2024-08-09 | 华为技术有限公司 | Intra-frame sharpening and/or de-ringing filters for video coding |
KR102622950B1 (en) * | 2018-11-12 | 2024-01-10 | 삼성전자주식회사 | Display apparatus, method for controlling thereof and recording media thereof |
EP3881542A4 (en) * | 2018-11-14 | 2022-12-28 | Sharp Kabushiki Kaisha | Systems and methods for applying deblocking filters to reconstructed video data |
US11051017B2 (en) | 2018-12-20 | 2021-06-29 | Qualcomm Incorporated | Adaptive loop filter (ALF) index signaling |
RU2737343C2 (en) * | 2019-01-10 | 2020-11-27 | Федеральное государственное казенное военное образовательное учреждение высшего образования "Военный учебно-научный центр Военно-воздушных сил "Военно-воздушная академия имени профессора Н.Е. Жуковского и Ю.А. Гагарина" (г. Воронеж) Министерства обороны Российской Федерации | Method of determining object movement pattern on frames of video sequence |
US11070848B2 (en) * | 2019-06-24 | 2021-07-20 | Tencent America LLC | Method for efficient signaling of virtual boundary for loop filtering control |
CN113727116B (en) * | 2021-07-21 | 2024-04-23 | 天津津航计算技术研究所 | Video decoding method based on filtering mechanism |
CN113747171B (en) * | 2021-08-06 | 2024-04-19 | 天津津航计算技术研究所 | Self-recovery video decoding method |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6983079B2 (en) * | 2001-09-20 | 2006-01-03 | Seiko Epson Corporation | Reducing blocking and ringing artifacts in low-bit-rate coding |
US7289154B2 (en) * | 2000-05-10 | 2007-10-30 | Eastman Kodak Company | Digital image processing method and apparatus for brightness adjustment of digital images |
US7391812B2 (en) * | 2002-07-14 | 2008-06-24 | Apple Inc. | Adaptively post filtering encoded video |
US20080240559A1 (en) * | 2004-03-15 | 2008-10-02 | Microsoft Corporation | Adaptive interpolation with artifact reduction of images |
US20100284458A1 (en) * | 2008-01-08 | 2010-11-11 | Telefonaktiebolaget L M Ericsson (Publ) | Adaptive filtering |
Family Cites Families (81)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS612482A (en) | 1984-06-15 | 1986-01-08 | Mitsubishi Electric Corp | Sampling filter of sub-nyquist |
CA1270322A (en) | 1983-12-22 | 1990-06-12 | Kotaro Asai | Adaptive comb filter |
JP2673778B2 (en) | 1994-02-22 | 1997-11-05 | 国際電信電話株式会社 | Noise reduction device for video decoding |
JPH0970044A (en) | 1995-08-31 | 1997-03-11 | Sony Corp | Image signal processor and method therefor |
US5798795A (en) | 1996-03-01 | 1998-08-25 | Florida Atlantic University | Method and apparatus for encoding and decoding video signals |
US5844613A (en) * | 1997-03-17 | 1998-12-01 | Microsoft Corporation | Global motion estimator for motion video signal encoding |
KR100265722B1 (en) | 1997-04-10 | 2000-09-15 | 백준기 | Image processing method and apparatus based on block |
KR20010032337A (en) | 1998-09-22 | 2001-04-16 | 마츠시타 덴끼 산교 가부시키가이샤 | Video signal encoding method, video signal encoder, and program recorded medium |
US6421720B2 (en) | 1998-10-28 | 2002-07-16 | Cisco Technology, Inc. | Codec-independent technique for modulating bandwidth in packet network |
US6529638B1 (en) | 1999-02-01 | 2003-03-04 | Sharp Laboratories Of America, Inc. | Block boundary artifact reduction for block-based image compression |
US7003038B2 (en) | 1999-09-27 | 2006-02-21 | Mitsubishi Electric Research Labs., Inc. | Activity descriptor for video sequences |
FI117533B (en) | 2000-01-20 | 2006-11-15 | Nokia Corp | Procedure for filtering digital video images |
US7203234B1 (en) | 2000-03-31 | 2007-04-10 | Sharp Laboratories Of America, Inc. | Method of directional filtering for post-processing compressed video |
US6504872B1 (en) | 2000-07-28 | 2003-01-07 | Zenith Electronics Corporation | Down-conversion decoder for interlaced video |
US20030026495A1 (en) | 2001-03-07 | 2003-02-06 | Gondek Jay Stephen | Parameterized sharpening and smoothing method and apparatus |
DE10120395A1 (en) | 2001-04-25 | 2002-10-31 | Bosch Gmbh Robert | Device for the interpolation of samples as well as image encoder and image decoder |
US7266150B2 (en) | 2001-07-11 | 2007-09-04 | Dolby Laboratories, Inc. | Interpolation of video compression frames |
EP1432249A4 (en) | 2001-09-18 | 2007-12-05 | Matsushita Electric Ind Co Ltd | Image encoding method and image decoding method |
CA2433455C (en) * | 2001-11-29 | 2012-03-06 | Matsushita Electric Industrial Co., Ltd. | Coding distortion removal method, video encoding method, video decoding method, and apparatus and program for the same |
KR100418437B1 (en) | 2001-12-24 | 2004-02-14 | (주)씨앤에스 테크놀로지 | A moving picture decoding processor for multimedia signal processing |
US7379501B2 (en) | 2002-01-14 | 2008-05-27 | Nokia Corporation | Differential coding of interpolation filters |
EP1333681A3 (en) | 2002-01-31 | 2004-12-08 | Samsung Electronics Co., Ltd. | Filtering method and apparatus for reducing block artifacts or ringing noise |
JP4102973B2 (en) | 2002-04-24 | 2008-06-18 | 日本電気株式会社 | Encoding method and decoding method of moving image, apparatus and program using the same |
ES2277174T3 (en) | 2002-05-02 | 2007-07-01 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | ARITHMETIC CODING OF TRANSFORMATION COEFFICIENTS. |
CN100566379C (en) | 2002-06-25 | 2009-12-02 | 松下电器产业株式会社 | Motion detection device and utilize the sound attenuation of this device |
EP2164261A3 (en) | 2002-07-11 | 2011-08-17 | Panasonic Corporation | Filtering strength determination method, moving picture coding and decoding method |
WO2004082290A1 (en) | 2003-03-10 | 2004-09-23 | Mitsubishi Denki Kabushiki Kaisha | Video signal encoding device and video signal encoding method |
US7430335B2 (en) | 2003-08-13 | 2008-09-30 | Apple Inc | Pre-processing method and system for data reduction of video sequences and bit rate reduction of compressed video sequences using spatial filtering |
US7599438B2 (en) * | 2003-09-07 | 2009-10-06 | Microsoft Corporation | Motion vector block pattern coding and decoding |
US8625680B2 (en) | 2003-09-07 | 2014-01-07 | Microsoft Corporation | Bitstream-controlled post-processing filtering |
US8094711B2 (en) | 2003-09-17 | 2012-01-10 | Thomson Licensing | Adaptive reference picture generation |
US7822286B2 (en) | 2003-11-07 | 2010-10-26 | Mitsubishi Electric Research Laboratories, Inc. | Filtering artifacts in images with 3D spatio-temporal fuzzy filters |
US7437013B2 (en) | 2003-12-23 | 2008-10-14 | General Instrument Corporation | Directional spatial video noise reduction |
US7453938B2 (en) | 2004-02-06 | 2008-11-18 | Apple Inc. | Target bitrate estimator, picture activity and buffer management in rate control for video coder |
JP4468734B2 (en) | 2004-04-27 | 2010-05-26 | オリンパス株式会社 | Video signal processing apparatus and video signal processing program |
US7460596B2 (en) | 2004-04-29 | 2008-12-02 | Mediatek Incorporation | Adaptive de-blocking filtering apparatus and method for MPEG video decoder |
US20070230565A1 (en) | 2004-06-18 | 2007-10-04 | Tourapis Alexandros M | Method and Apparatus for Video Encoding Optimization |
JP2008507915A (en) | 2004-07-20 | 2008-03-13 | クゥアルコム・インコーポレイテッド | Method and apparatus for encoder-assisted frame rate upconversion for video compression |
US20060028562A1 (en) | 2004-08-09 | 2006-02-09 | Martin Schmitz | Fast area-selected filtering for pixel-noise and analog artifacts reduction |
US7370126B2 (en) | 2004-11-03 | 2008-05-06 | Cisco Technology, Inc. | System and method for implementing a demand paging jitter buffer algorithm |
US7634148B2 (en) | 2005-01-07 | 2009-12-15 | Ntt Docomo, Inc. | Image signal transforming and inverse-transforming method and computer program product with pre-encoding filtering features |
US20090022220A1 (en) | 2005-04-13 | 2009-01-22 | Universitaet Hannover | Method and apparatus for enhanced video coding |
US7680355B2 (en) | 2005-05-02 | 2010-03-16 | Intel Corporation | Detection of artifacts resulting from image signal decompression |
US8422546B2 (en) | 2005-05-25 | 2013-04-16 | Microsoft Corporation | Adaptive video encoding using a perceptual model |
US20060285597A1 (en) | 2005-06-20 | 2006-12-21 | Flextronics International Usa, Inc. | Reusing interpolated values in advanced video encoders |
US8208564B2 (en) | 2005-06-24 | 2012-06-26 | Ntt Docomo, Inc. | Method and apparatus for video encoding and decoding using adaptive interpolation |
US7778169B2 (en) | 2005-09-02 | 2010-08-17 | Cisco Technology, Inc. | Packetizing media for a time slotted communication system |
US7894522B2 (en) | 2005-09-16 | 2011-02-22 | Sony Corporation | Classified filtering for temporal prediction |
JP4455487B2 (en) | 2005-12-16 | 2010-04-21 | 株式会社東芝 | Decoding device, decoding method, and program |
WO2007111292A1 (en) | 2006-03-27 | 2007-10-04 | Matsushita Electric Industrial Co., Ltd. | Picture coding apparatus and picture decoding apparatus |
BRPI0714233A2 (en) | 2006-07-18 | 2013-01-15 | Thomson Licensing | Methods and apparatus for adaptive reference filtering |
US8253752B2 (en) | 2006-07-20 | 2012-08-28 | Qualcomm Incorporated | Method and apparatus for encoder assisted pre-processing |
US8731064B2 (en) | 2006-09-11 | 2014-05-20 | Apple Inc. | Post-processing for decoder complexity scalability |
US20080075165A1 (en) | 2006-09-26 | 2008-03-27 | Nokia Corporation | Adaptive interpolation filters for video coding |
TWI368443B (en) * | 2006-11-09 | 2012-07-11 | Lg Electronics Inc | Method and apparatus for decoding/encoding a video signal |
ATE488096T1 (en) | 2006-12-18 | 2010-11-15 | Koninkl Philips Electronics Nv | IMAGE COMPRESSION AND DECOMPRESSION |
US8509316B2 (en) | 2007-01-09 | 2013-08-13 | Core Wireless Licensing, S.a.r.l. | Adaptive interpolation filters for video coding |
KR100856551B1 (en) * | 2007-05-31 | 2008-09-04 | 한국과학기술원 | Deblock filter and deblock filtering method in h.264/avc |
WO2008148272A1 (en) | 2007-06-04 | 2008-12-11 | France Telecom Research & Development Beijing Company Limited | Method and apparatus for sub-pixel motion-compensated video coding |
US7965900B2 (en) | 2007-09-26 | 2011-06-21 | Hewlett-Packard Development Company, L.P. | Processing an input image to reduce compression-related artifacts |
WO2009045683A1 (en) | 2007-09-28 | 2009-04-09 | Athanasios Leontaris | Video compression and tranmission techniques |
EP2048886A1 (en) | 2007-10-11 | 2009-04-15 | Panasonic Corporation | Coding of adaptive interpolation filter coefficients |
CN101184221A (en) * | 2007-12-06 | 2008-05-21 | 上海大学 | Vision attention based video encoding method |
WO2009091521A2 (en) | 2008-01-14 | 2009-07-23 | Thomson Licensing | Methods and apparatus for de-artifact filtering using multi-lattice sparsity-based filtering |
US8831086B2 (en) | 2008-04-10 | 2014-09-09 | Qualcomm Incorporated | Prediction techniques for interpolation in video coding |
US8451902B2 (en) | 2008-04-23 | 2013-05-28 | Telefonaktiebolaget L M Ericsson (Publ) | Template-based pixel block processing |
US10123050B2 (en) | 2008-07-11 | 2018-11-06 | Qualcomm Incorporated | Filtering video data using a plurality of filters |
US8290782B2 (en) | 2008-07-24 | 2012-10-16 | Dts, Inc. | Compression of audio scale-factors by two-dimensional transformation |
US8736751B2 (en) | 2008-08-26 | 2014-05-27 | Empire Technology Development Llc | Digital presenter for displaying image captured by camera with illumination system |
US8150191B2 (en) | 2008-10-14 | 2012-04-03 | Interra Systems Inc. | Method and system for calculating blur artifacts in videos using user perception threshold |
US8792564B2 (en) | 2008-10-28 | 2014-07-29 | Sony Corporation | Adaptive preprocessing method using feature-extracted video maps |
US8761538B2 (en) | 2008-12-10 | 2014-06-24 | Nvidia Corporation | Measurement-based and scalable deblock filtering of image data |
US9143803B2 (en) | 2009-01-15 | 2015-09-22 | Qualcomm Incorporated | Filter prediction based on activity metrics in video coding |
WO2010102935A1 (en) * | 2009-03-09 | 2010-09-16 | Thomson Licensing | Estimation of the prediction mode for the intra coding mode |
CN101854540B (en) * | 2009-04-01 | 2014-07-02 | 辉达公司 | Intra prediction method and device for employing H.264 video coding standard |
EP2262267A1 (en) * | 2009-06-10 | 2010-12-15 | Panasonic Corporation | Filter coefficient coding scheme for video coding |
WO2011126759A1 (en) | 2010-04-09 | 2011-10-13 | Sony Corporation | Optimal separable adaptive loop filter |
US9094658B2 (en) | 2010-05-10 | 2015-07-28 | Mediatek Inc. | Method and apparatus of adaptive loop filtering |
CN101945281B (en) * | 2010-09-10 | 2014-09-10 | 中兴通讯股份有限公司 | Method and device for filtering video codes |
US8964853B2 (en) | 2011-02-23 | 2015-02-24 | Qualcomm Incorporated | Multi-metric filtering |
JP5818755B2 (en) | 2012-08-23 | 2015-11-18 | 有限会社イザキ | Incineration ash storage method and incineration ash storage container used therefor |
-
2012
- 2012-02-21 US US13/401,685 patent/US8964853B2/en active Active
- 2012-02-21 US US13/401,552 patent/US8964852B2/en active Active
- 2012-02-21 US US13/401,573 patent/US8989261B2/en active Active
- 2012-02-21 US US13/401,548 patent/US8982960B2/en active Active
- 2012-02-22 RU RU2013142925/08A patent/RU2584961C2/en active
- 2012-02-22 EP EP19216610.6A patent/EP3687170A1/en active Pending
- 2012-02-22 DK DK12706180.2T patent/DK2679009T3/en active
- 2012-02-22 WO PCT/US2012/026154 patent/WO2012116088A1/en active Application Filing
- 2012-02-22 SG SG2013056502A patent/SG192123A1/en unknown
- 2012-02-22 RU RU2013143011/08A patent/RU2579688C2/en active
- 2012-02-22 CN CN201280015663.XA patent/CN103477639B/en active Active
- 2012-02-22 CN CN201710243718.7A patent/CN107277525B/en active Active
- 2012-02-22 KR KR1020137024783A patent/KR101552031B1/en active IP Right Grant
- 2012-02-22 JP JP2013555528A patent/JP5752812B2/en active Active
- 2012-02-22 EP EP19216619.7A patent/EP3687171A1/en active Pending
- 2012-02-22 WO PCT/US2012/026160 patent/WO2012116090A1/en active Application Filing
- 2012-02-22 PT PT127061802T patent/PT2679009T/en unknown
- 2012-02-22 HU HUE12706179A patent/HUE051433T2/en unknown
- 2012-02-22 WO PCT/US2012/026165 patent/WO2012116094A1/en active Application Filing
- 2012-02-22 EP EP12706178.6A patent/EP2679007A1/en not_active Ceased
- 2012-02-22 JP JP2013555529A patent/JP5897609B2/en active Active
- 2012-02-22 HU HUE12706180A patent/HUE051435T2/en unknown
- 2012-02-22 PL PL20183884.4T patent/PL3796653T3/en unknown
- 2012-02-22 BR BR112013021617-4A patent/BR112013021617A2/en not_active Application Discontinuation
- 2012-02-22 WO PCT/US2012/026166 patent/WO2012116095A1/en active Application Filing
- 2012-02-22 DK DK12706179.4T patent/DK2679008T3/en active
- 2012-02-22 MY MYPI2013002787A patent/MY166573A/en unknown
- 2012-02-22 EP EP12706181.0A patent/EP2679010A1/en not_active Ceased
- 2012-02-22 MX MX2013009722A patent/MX2013009722A/en active IP Right Grant
- 2012-02-22 SG SG2013061338A patent/SG192743A1/en unknown
- 2012-02-22 KR KR1020137024770A patent/KR101581098B1/en active IP Right Grant
- 2012-02-22 CA CA2828406A patent/CA2828406C/en active Active
- 2012-02-22 UA UAA201311226A patent/UA110637C2/en unknown
- 2012-02-22 AU AU2012220639A patent/AU2012220639B2/en active Active
- 2012-02-22 EP EP12706180.2A patent/EP2679009B1/en active Active
- 2012-02-22 PL PL12706180T patent/PL2679009T3/en unknown
- 2012-02-22 KR KR1020137024825A patent/KR101578986B1/en active IP Right Grant
- 2012-02-22 BR BR112013021476A patent/BR112013021476A2/en not_active Application Discontinuation
- 2012-02-22 CN CN201280010232.4A patent/CN103404142B/en active Active
- 2012-02-22 AU AU2012220632A patent/AU2012220632B2/en active Active
- 2012-02-22 JP JP2013555527A patent/JP5815755B2/en active Active
- 2012-02-22 TW TW101105899A patent/TWI499267B/en active
- 2012-02-22 SI SI201231846T patent/SI2679009T1/en unknown
- 2012-02-22 CN CN201710670800.8A patent/CN107396114B/en active Active
- 2012-02-22 ES ES12706179T patent/ES2816423T3/en active Active
- 2012-02-22 ES ES12706180T patent/ES2824831T3/en active Active
- 2012-02-22 CA CA2830381A patent/CA2830381C/en active Active
- 2012-02-22 EP EP20183884.4A patent/EP3796653B1/en active Active
- 2012-02-22 KR KR1020157011601A patent/KR20150056663A/en not_active Application Discontinuation
- 2012-02-22 ES ES20183884T patent/ES2966390T3/en active Active
- 2012-02-22 CN CN201280009765.0A patent/CN103380621B/en active Active
- 2012-02-22 KR KR1020157011552A patent/KR101788948B1/en active IP Right Grant
- 2012-02-22 EP EP20152795.9A patent/EP3700203A1/en not_active Withdrawn
- 2012-02-22 CN CN201280010179.8A patent/CN103392339B/en active Active
- 2012-02-22 MY MYPI2013003111A patent/MY167114A/en unknown
- 2012-02-22 EP EP12706179.4A patent/EP2679008B1/en active Active
- 2012-02-22 JP JP2013555530A patent/JP5815756B2/en active Active
- 2012-02-22 KR KR1020137024804A patent/KR101552032B1/en active IP Right Grant
-
2013
- 2013-07-24 IL IL227636A patent/IL227636A/en active IP Right Grant
- 2013-09-20 ZA ZA2013/07111A patent/ZA201307111B/en unknown
- 2013-09-20 ZA ZA2013/07110A patent/ZA201307110B/en unknown
-
2015
- 2015-01-08 US US14/592,826 patent/US9877023B2/en active Active
- 2015-01-08 US US14/592,841 patent/US9258563B2/en active Active
- 2015-07-28 JP JP2015148880A patent/JP6141917B2/en active Active
- 2015-09-24 JP JP2015187236A patent/JP6105011B2/en active Active
-
2016
- 2016-02-08 US US15/018,403 patent/US9819936B2/en active Active
-
2017
- 2017-05-08 JP JP2017092448A patent/JP6370960B2/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7289154B2 (en) * | 2000-05-10 | 2007-10-30 | Eastman Kodak Company | Digital image processing method and apparatus for brightness adjustment of digital images |
US6983079B2 (en) * | 2001-09-20 | 2006-01-03 | Seiko Epson Corporation | Reducing blocking and ringing artifacts in low-bit-rate coding |
US7391812B2 (en) * | 2002-07-14 | 2008-06-24 | Apple Inc. | Adaptively post filtering encoded video |
US20080240559A1 (en) * | 2004-03-15 | 2008-10-02 | Microsoft Corporation | Adaptive interpolation with artifact reduction of images |
US20100284458A1 (en) * | 2008-01-08 | 2010-11-11 | Telefonaktiebolaget L M Ericsson (Publ) | Adaptive filtering |
Non-Patent Citations (2)
Title |
---|
List et al, "Adaptive Deblocking Filter", July 2003, IEEE Transactions on Circuits and Systems for Video Technology, p. 614-619, Vol. 13, no.7 * |
Shin et. al., "Variable block-based deblocking filter for H.264/AVC on low-end and low-bit rates terminals", April 2010, Signal Processing: Image Communication, Vol. 25, Issue 4, p. 255-267 * |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9819936B2 (en) | 2011-02-23 | 2017-11-14 | Qualcomm Incorporated | Multi-metric filtering |
US9877023B2 (en) | 2011-02-23 | 2018-01-23 | Qualcomm Incorporated | Multi-metric filtering |
US9258563B2 (en) | 2011-02-23 | 2016-02-09 | Qualcomm Incorporated | Multi-metric filtering |
US10785503B2 (en) | 2011-04-21 | 2020-09-22 | Intellectual Discovery Co., Ltd. | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US11381844B2 (en) | 2011-04-21 | 2022-07-05 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US20160330485A1 (en) * | 2011-04-21 | 2016-11-10 | Intellectual Discovery Co., Ltd. | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US10129567B2 (en) * | 2011-04-21 | 2018-11-13 | Intellectual Discovery Co., Ltd. | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US10237577B2 (en) * | 2011-04-21 | 2019-03-19 | Intellectual Discovery Co., Ltd. | Method and apparatus for encoding/decoding images using a prediction method adopting in-loop filtering |
US20140056363A1 (en) * | 2012-08-23 | 2014-02-27 | Yedong He | Method and system for deblock filtering coded macroblocks |
US20140328414A1 (en) * | 2012-11-13 | 2014-11-06 | Atul Puri | Content adaptive quality restoration filtering for next generation video coding |
US9800899B2 (en) * | 2012-11-13 | 2017-10-24 | Intel Corporation | Content adaptive quality restoration filtering for next generation video coding |
US10182245B2 (en) | 2012-11-13 | 2019-01-15 | Intel Corporation | Content adaptive quality restoration filtering for next generation video coding |
CN115134609A (en) * | 2015-06-11 | 2022-09-30 | 杜比实验室特许公司 | Method for encoding and decoding image using adaptive deblocking filtering and apparatus therefor |
CN115134608A (en) * | 2015-06-11 | 2022-09-30 | 杜比实验室特许公司 | Method for encoding and decoding image using adaptive deblocking filtering and apparatus therefor |
US10045028B2 (en) | 2015-08-17 | 2018-08-07 | Nxp Usa, Inc. | Media display system that evaluates and scores macro-blocks of media stream |
CN111107367A (en) * | 2018-10-26 | 2020-05-05 | 北京字节跳动网络技术有限公司 | Block division method and device |
CN114598902A (en) * | 2022-03-09 | 2022-06-07 | 安徽文香科技有限公司 | Video frame processing method and device and electronic equipment |
CN116260973A (en) * | 2023-03-31 | 2023-06-13 | 北京百度网讯科技有限公司 | Time domain filtering method and device, electronic equipment and storage medium |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9819936B2 (en) | Multi-metric filtering | |
US9819966B2 (en) | Filter description signaling for multi-filter adaptive filtering | |
IL227994A (en) | Multi-metric filtering |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: QUALCOMM INCORPORATED, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CHONG, IN SUK;KARCZEWICZ, MARTA;REEL/FRAME:027843/0726 Effective date: 20120229 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551) Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |