Nothing Special »
Address
:
[go:
up one dir
,
main page
]
Include Form
Remove Scripts
Accept Cookies
Show Images
Show Referer
Rotate13
Base64
Strip Meta
Strip Title
Session Cookies
Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
Intra prediction with 3-tap filters for lossless and lossy video coding
Date
2016
Author
Ranjbar Alvar, Saeed
Metadata
Show full item record
Item Usage Stats
211
views
0
downloads
Cite This
Video coders are primarily designed for lossy compression. The basic steps in modern lossy video compression are block-based spatial or temporal prediction, transformation of the prediction error block, quantization of the transform coefficients and entropy coding of the quantized coefficients together with other side information. In some cases, this lossy coding architecture may not be efficient for compression. For example, when lossless video compression is desirable, the transform and quantization steps are skipped. Or in lossy compression of synthetic video content (such as animations), the transform may be skipped for some of the blocks and the prediction error is quantized and entropy coded in those blocks. In these cases, the block-based spatial prediction (called intra prediction) cannot sufficiently decorrelate the pixels by itself and large prediction errors become more frequent. For the cases where the transform is skipped, the block-based prediction can be replaced with a more accurate pixel-by-pixel prediction since the original/reconstructed neighboring pixels inside the block will be readily available due to the lack of transform. This thesis explores pixel-by-pixel prediction methods based on 3-tap filtering which use three neighboring pixels for prediction according to a two-dimensional correlation model. Two of the proposed methods are designed for lossless intra coding, one with offline determined prediction weights and the other with online determined adaptive weights. The third proposed method uses the 3-tap filtering method for the transform skipped blocks in lossy intra coding. The proposed methods are implemented within the HEVC reference software and the experimental results indicate that the pixel-by-pixel spatial prediction method based on 3-tap filtering can improve the compression efficiency for both lossless and lossy coding.
Subject Keywords
Digital video.
,
Image processing.
,
Image processing
URI
http://etd.lib.metu.edu.tr/upload/12620387/index.pdf
https://hdl.handle.net/11511/25938
Collections
Graduate School of Natural and Applied Sciences, Thesis
Suggestions
OpenMETU
Core
Lossless Image and Intra-Frame Compression With Integer-to-Integer DST
Kamışlı, Fatih (2019-02-01)
Video coding standards are primarily designed for efficient lossy compression, but it is also desirable to support efficient lossless compression within video coding standards using small modifications to the lossy coding architecture. A simple approach is to skip transform and quantization, and simply entropy code the prediction residual. However, this approach is inefficient at compression. A more efficient and popular approach is to skip transform and quantization but also process the residual block in s...
Visibility grid method for efficient crowd rendering wirh shadows
Koçdemir, Şahin Serdar; İşler, Veysi; Department of Modeling and Simulation (2012)
Virtual crowd rendering have been used in film industry with offine rendering methods for a long time. But its existence in interactive real-time applications such as video games is not so common due to the limited rendering power of current graphics hardware. This thesis describes a novel method to improve shadow mapping performance of a crowded scene by taking into account the screen space visibility of the casted shadow of a crowd instance when rendering the shadow maps. A grid-based visibility mask crea...
Joint utilization of fixed and variable-length codes for improving synchronization immunity for image transmission
Alatan, Abdullah Aydın (1998-01-01)
Robust transmission of images is achieved by using fixed and variable-length coding together without much loss in compression efficiency. The probability distribution function of a DCT coefficient can be divided into two regions using a threshold; so that one portion contains roughly equiprobable transform coefficients. While fixed-length coding, which is a powerful solution to the synchronization problem, is used in this inner equiprobable region without sacrificing compression, the outer (saturating) regi...
End-to-end learned image compression with normalizing flows for latent space enhancement
Yavuz, Fatih; Kamışlı, Fatih; Department of Electrical and Electronics Engineering (2022-9)
Learning based methods for image compression recently received considerable attention and demonstrated promising performance, surpassing many commonly used codecs. Architectures of learning based methodologies are typically comprised of a nonlinear analysis transform, which maps the input image to a latent representation, a synthesis transform that maps the quantized latent representation back to the image domain and a model for the probability distribution of the latent representation. Successful modelling...
Video segmentation using partially decoded MPEG bitstream
Kayaalp, Işıl Burcun; Akar, Gözde; Department of Electrical and Electronics Engineering (2003)
In this thesis, a mixed type video segmentation algorithm is implemented to find the scene cuts in MPEG compressed video data. The main aim is to have a computationally efficient algorithm for real time applications. Due to this reason partial decoding of the bitstream is used in segmentation. As a result of partial decoding, features such as bitrate, motion vector type, and DC images are implemented to find both continuous and discontinuous scene cuts on a MPEG-2 coded general TV broadcast data. The result...
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
S. Ranjbar Alvar, “Intra prediction with 3-tap filters for lossless and lossy video coding,” M.S. - Master of Science, Middle East Technical University, 2016.