Multi-resolution extreme learning machine-based side information estimation in distributed video coding

Bodhisattva Dash¹,
Suvendu Rup¹,
Anjali Mohapatra¹,
Banshidhar Majhi² &
…
M. N. S. Swamy³

327 Accesses
Explore all metrics

Abstract

Context: Encoding of video frames in a traditional video coding architecture involves exhaustive computations due to the motion estimation (ME) task. Hence, it requires a considerable amount of computing aid, battery power, and resource memory. These codecs are not effective and reliable for applications like surveillance systems, wireless sensor networks, wireless camcorders, having scarcity in the availability of resources and computing ability. Therefore, in such scenarios, distributed video coding (DVC) represents a viable solution for power-constrained hand-held devices. DVC empowers the adaptability in distributing the complexity between the encoder and the decoder. Objective: Like any other building block, the decoder driven side information (SI) generation module plays a key role in a DVC codec. The efficacy of a DVC codec firmly relies on the quality of the SI generated at the decoder. SI is considered to be the facsimile of the original Wyner-Ziv (WZ) frame. Hence, the superior the quality of SI, improved is the efficiency of the codec. The primary objective of the present work is to enhance the quality of the SI frame so that the overall performance of the DVC is improved. To achieve this objective, this article deals with a hybrid SI generation scheme utilizing the principles of discrete wavelet transform (DWT) and extreme learning machine (ELM) algorithm in a transform domain-based DVC framework. Results: Exhaustive simulations have been carried out for some standard video sequences with the proposed and benchmark schemes. The proposed scheme is evaluated with respect to different performance metrics such as rate-distortion (RD), SI peak-signal-to-noise-ratio (PSNR) vs frame number, number of parity requests per SI frame, and so on. Experimental results and its analyses corroborate that the performance of the proposed technique surpasses as that of the benchmark schemes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Side Information Extraction using Bernoulli Distribution based Deep Learning Technique for Video Transmission

Article 30 August 2023

Enhanced holoentropy-based encoding via whale optimization for highly efficient video coding

Article 08 October 2020

AlphaVC: High-Performance and Efficient Learned Video Compression

References

Aaron A, Rane SD, Setton E, Girod B et al (2004) Transform-domain wyner-ziv codec for video. In: Proceedings of SPIE, vol 5308, pp 520–528
Abou-Elailah A, Dufaux F, Farah J, Cagnazzo M, Pesquet-Popescu B (2013) Fusion of global and local motion estimation for distributed video coding. IEEE Trans Circuits Syst Video Technol 23(1):158–172
Article Google Scholar
Artigas X, Ascenso J, Dalai M, Klomp S, Kubasov D, Ouaret M (2007) The discover codec: architecture, techniques and evaluation. In: Picture Coding Symposium (PCS” 07), MMSPL-CONF-2009-014
Ascenso J, Brites C, Pereira F (2006) Content adaptive wyner-ziv video coding driven by motion activity. In: Image processing, 2006 IEEE International Conference on, IEEE, pp605-608
Ascenso J, Brites C, Pereira F (2010) A flexible side information generation framework for distributed video coding. Multimedia Tools and Applications 48(3):381–409
Article Google Scholar
Brites C, Ascenso J, Pedro JQ, Pereira F (2008) Evaluating a feedback channel based transform domain wyner–ziv video codec. Signal Process Image Commun 23(4):269–297
Article Google Scholar
Ciuti G, Menciassi A, Dario P (2011) Capsule endoscopy: from current achievements to open challenges. IEEE Rev Biomed Eng 4:59–72
Article Google Scholar
Dash B, Rup S, Mohapatra A, Majhi B, Swamy M (2017) Decoder driven side information generation using ensemble of mlp networks for distributed video coding. Multimedia Tools and Applications pp1–30
Deligiannis N, Verbist F, Slowack J, Rvd Walle, Schelkens P, Munteanu A (2014) Progressively refined wyner-ziv video coding for visual sensors. ACM Transactions on Sensor Networks (TOSN) 10(2):21
Article Google Scholar
DISCOVER-Project ((accessed May 11, 2017)) Discover project page. http://www.img.lx.it.pt/discover/home.html
Dufaux F, Gao W, Tubaro S, Vetro A (2010) Distributed video coding: trends and perspectives. EURASIP Journal on Image and Video Processing 2009(1):508,167
Google Scholar
El-Dahshan ESA, Hosny T, Salem ABM (2010) Hybrid intelligent techniques for mri brain images classification. Digital Signal Process 20(2):433–441
Article Google Scholar
Girod B, Aaron AM, Rane S, Rebollo-Monedero D (2005) Distributed video coding. Proc IEEE 93(1):71–83
Article Google Scholar
Gurav P, Patil G (2016) Full-reference video quality assessment using structural similarity (SSIM) index. J Electr Commun Sys 1(2)
Huang GB (2003) Learning capability and storage capacity of two-hidden-layer feedforward networks. IEEE Trans Neural Netw 14(2):274–281
Article Google Scholar
Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on, IEEE, vol 2, pp 985–990
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501
Article Google Scholar
Huang GB, Wang DH, Lan Y (2011) Extreme learning machines: a survey. Int J Mach Learn Cybern 2(2):107–122
Article Google Scholar
Huang X, Rakêt LL, Van Luong H, Nielsen M, Lauze F et al. (2011) Multi-hypothesis transform domain wyner-ziv video coding including optical flow. In: Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on, IEEE, pp 1–6
Jia Y, Wang Y, Song R, Li J (2015) Decoder side information generation techniques in wyner-ziv video coding: a review. Multimedia Tools and Applications 74(6):1777–1803
Article Google Scholar
Kubasov D, Nayak J, Guillemot C (2007) Optimal reconstruction in wyner-ziv video coding with multiple side information. In: Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on, IEEE, pp 183–186
Li R, Liu H, Chen J, Gan Z (2016) Wavelet pyramid based multi-resolution bilateral motion estimation for frame rate up-conversion. IEICE Trans Info Sys 99 (1):208–218
Article Google Scholar
Liu W, Dong L, Zeng W (2010) Motion refinement based progressive side-information estimation for wyner-ziv video coding. IEEE Trans Circuits Syst Video Technol 20(12):1863–1875
Article Google Scholar
Mallat S, Hwang WL (1992) Singularity detection and processing with wavelets. IEEE Trans Inf Theory 38(2):617–643
Article MathSciNet Google Scholar
Mallat S G (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11(7):674–693
Article Google Scholar
Mallt S (1989) Multifrequency channel decomposition of image and wavelet modals. IEEE Trans, Acoust, Speech, Signal Process 37:2091–2110
Article Google Scholar
Martins R, Brites C, Ascenso J, Pereira F (2009) Refining side information for improved transform domain wyner-ziv video coding. IEEE Trans Circuits Syst Video Technol 19(9):1327–1341
Article Google Scholar
Martins R, Brites C, Ascenso J, Pereira F (2010) Statistical motion learning for improved transform domain wyner–ziv video coding. IET image processing 4(1):28–41
Article Google Scholar
Ortega JM (1987) Matrix theory. the university series in mathematics
Pereira F, Brites C, Ascenso J (2009) Distributed video coding: basics, codecs and performance. Distributed Source Coding pp 189–245
Petrazzuoli G, Cagnazzo M, Pesquet-Popescu B (2010) High order motion interpolation for side information improvement in dvc. In: Acoustics speech and signal processing (ICASSP), 2010 IEEE International Conference on, IEEE, pp 2342–2345
Puri R, Majumdar A, Ramchandran K (2007) Prism: a video coding paradigm with motion estimation at the decoder. IEEE Trans. Image Process. 16(10):2436–2448
Article MathSciNet Google Scholar
Qing L, Zeng W (2014) Context-adaptive modeling for wavelet-domain distributed video coding. IEEE MultiMedia 21(4):84–93
Article Google Scholar
Rencher AC (2003) Methods of multivariate analysis, vol 492. John Wiley & Sons
Rup S, Majhi B (2013) A mixed framework for transform domain wyner–ziv video coding. Optik-International Journal for Light and Electron Optics 124(21):4929–4938
Article Google Scholar
Rup S, Majhi B, Padhy S (2014) An improved side information generation for distributed video coding. AEU-International Journal of Electronics and Communications 68(3):201–209
Article Google Scholar
Said A, Pearlman WA (1996) A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans Circuits Syst Video Technol 6(3):243–250
Article Google Scholar
Shapiro JM (1993) Embedded image coding using zerotrees of wavelet coefficients. IEEE Trans Signal Process 41(12):3445–3462
Article Google Scholar
Slepian D, Wolf J (1973) Noiseless coding of correlated information sources. IEEE Trans Inf Theory 19(4):471–480
Article MathSciNet Google Scholar
Tagliasacchi M, Tubaro S, Sarti A (2006) On the modeling of motion in wyner-ziv video coding. In: Image processing, 2006 IEEE International Conference on, IEEE, pp 593-596
Taieb MH, Chouinard JY, Wang D (2013) Spatial correlation-based side information refinement for distributed video coding. EURASIP J Adv Signal Process 2013(1):168
Article Google Scholar
Thao NTH, Tien VH, Van Xiem H, Duong DT et al (2016) Side information creation using adaptive block size for distributed video coding. In: Advanced technologies for communications (ATC), 2016 International Conference on, IEEE, pp 339–343
Van Luong H, Raket LL, Huang X, Forchhammer S (2012) Side information and noise learning for distributed video coding using optical flow and clustering. IEEE Trans Image Process 21(12):4782–4796
Article MathSciNet Google Scholar
Van Luong H, Raket LL, Forchhammer S (2014) Re-estimation of motion and reconstruction for distributed video coding. IEEE Trans Image Process 23(7):2804–2819
Article MathSciNet Google Scholar
Varodayan D, Chen D, Flierl M, Girod B (2008) Wyner–ziv coding of video with unsupervised motion vector learning. Signal Process Image Commun 23(5):369–378
Article Google Scholar
Vetterli M, Herley C (1992) Wavelets and filter banks: Theory and design. IEEE Trans Signal Process 40(9):2207–2232
Article Google Scholar
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the H.264/AVC video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576
Article Google Scholar
Wyner A, Ziv J (1976) The rate-distortion function for source coding with side information at the decoder. IEEE Trans Inf Theory 22(1):1–10
Article MathSciNet Google Scholar
Yan C, Zhang Y, Xu J, Dai F, Li L, Dai Q, Wu F (2014a) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Signal Process Lett 21(5):573–576
Article Google Scholar
Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014b) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089
Article Google Scholar
Zhang Y, Zhao D, Liu H, Li Y, Ma S, Gao W (2012) Side information generation with auto regressive model for low-delay distributed video coding. J Vis Commun Image Represent 23(1):229–236
Article Google Scholar

Download references

Author information

Authors and Affiliations

Image and Video Processing Laboratory, Department of Computer Science and Engineering, International Institute of Information Technology, Bhubaneswar-751003, Odisha, India
Bodhisattva Dash, Suvendu Rup & Anjali Mohapatra
Pattern Recognition Research Laboratory, Department of Computer Science and Engineering, National Institute of Technology, Rourkela-769004, Odisha, India
Banshidhar Majhi
Department of Electrical and Computer Engineering, Concordia University, Montreal, QC, H3G 1M8, Canada
M. N. S. Swamy

Authors

Bodhisattva Dash
View author publications
You can also search for this author in PubMed Google Scholar
Suvendu Rup
View author publications
You can also search for this author in PubMed Google Scholar
Anjali Mohapatra
View author publications
You can also search for this author in PubMed Google Scholar
Banshidhar Majhi
View author publications
You can also search for this author in PubMed Google Scholar
M. N. S. Swamy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bodhisattva Dash.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dash, B., Rup, S., Mohapatra, A. et al. Multi-resolution extreme learning machine-based side information estimation in distributed video coding. Multimed Tools Appl 77, 27301–27335 (2018). https://doi.org/10.1007/s11042-018-5921-9

Download citation

Received: 21 September 2017
Revised: 17 March 2018
Accepted: 21 March 2018
Published: 27 March 2018
Issue Date: October 2018
DOI: https://doi.org/10.1007/s11042-018-5921-9

Multi-resolution extreme learning machine-based side information estimation in distributed video coding

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Side Information Extraction using Bernoulli Distribution based Deep Learning Technique for Video Transmission

Enhanced holoentropy-based encoding via whale optimization for highly efficient video coding

AlphaVC: High-Performance and Efficient Learned Video Compression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Multi-resolution extreme learning machine-based side information estimation in distributed video coding

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Side Information Extraction using Bernoulli Distribution based Deep Learning Technique for Video Transmission

Enhanced holoentropy-based encoding via whale optimization for highly efficient video coding

AlphaVC: High-Performance and Efficient Learned Video Compression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation