Abstract
Context: Encoding of video frames in a traditional video coding architecture involves exhaustive computations due to the motion estimation (ME) task. Hence, it requires a considerable amount of computing aid, battery power, and resource memory. These codecs are not effective and reliable for applications like surveillance systems, wireless sensor networks, wireless camcorders, having scarcity in the availability of resources and computing ability. Therefore, in such scenarios, distributed video coding (DVC) represents a viable solution for power-constrained hand-held devices. DVC empowers the adaptability in distributing the complexity between the encoder and the decoder. Objective: Like any other building block, the decoder driven side information (SI) generation module plays a key role in a DVC codec. The efficacy of a DVC codec firmly relies on the quality of the SI generated at the decoder. SI is considered to be the facsimile of the original Wyner-Ziv (WZ) frame. Hence, the superior the quality of SI, improved is the efficiency of the codec. The primary objective of the present work is to enhance the quality of the SI frame so that the overall performance of the DVC is improved. To achieve this objective, this article deals with a hybrid SI generation scheme utilizing the principles of discrete wavelet transform (DWT) and extreme learning machine (ELM) algorithm in a transform domain-based DVC framework. Results: Exhaustive simulations have been carried out for some standard video sequences with the proposed and benchmark schemes. The proposed scheme is evaluated with respect to different performance metrics such as rate-distortion (RD), SI peak-signal-to-noise-ratio (PSNR) vs frame number, number of parity requests per SI frame, and so on. Experimental results and its analyses corroborate that the performance of the proposed technique surpasses as that of the benchmark schemes.
Similar content being viewed by others
References
Aaron A, Rane SD, Setton E, Girod B et al (2004) Transform-domain wyner-ziv codec for video. In: Proceedings of SPIE, vol 5308, pp 520–528
Abou-Elailah A, Dufaux F, Farah J, Cagnazzo M, Pesquet-Popescu B (2013) Fusion of global and local motion estimation for distributed video coding. IEEE Trans Circuits Syst Video Technol 23(1):158–172
Artigas X, Ascenso J, Dalai M, Klomp S, Kubasov D, Ouaret M (2007) The discover codec: architecture, techniques and evaluation. In: Picture Coding Symposium (PCS” 07), MMSPL-CONF-2009-014
Ascenso J, Brites C, Pereira F (2006) Content adaptive wyner-ziv video coding driven by motion activity. In: Image processing, 2006 IEEE International Conference on, IEEE, pp605-608
Ascenso J, Brites C, Pereira F (2010) A flexible side information generation framework for distributed video coding. Multimedia Tools and Applications 48(3):381–409
Brites C, Ascenso J, Pedro JQ, Pereira F (2008) Evaluating a feedback channel based transform domain wyner–ziv video codec. Signal Process Image Commun 23(4):269–297
Ciuti G, Menciassi A, Dario P (2011) Capsule endoscopy: from current achievements to open challenges. IEEE Rev Biomed Eng 4:59–72
Dash B, Rup S, Mohapatra A, Majhi B, Swamy M (2017) Decoder driven side information generation using ensemble of mlp networks for distributed video coding. Multimedia Tools and Applications pp1–30
Deligiannis N, Verbist F, Slowack J, Rvd Walle, Schelkens P, Munteanu A (2014) Progressively refined wyner-ziv video coding for visual sensors. ACM Transactions on Sensor Networks (TOSN) 10(2):21
DISCOVER-Project ((accessed May 11, 2017)) Discover project page. http://www.img.lx.it.pt/discover/home.html
Dufaux F, Gao W, Tubaro S, Vetro A (2010) Distributed video coding: trends and perspectives. EURASIP Journal on Image and Video Processing 2009(1):508,167
El-Dahshan ESA, Hosny T, Salem ABM (2010) Hybrid intelligent techniques for mri brain images classification. Digital Signal Process 20(2):433–441
Girod B, Aaron AM, Rane S, Rebollo-Monedero D (2005) Distributed video coding. Proc IEEE 93(1):71–83
Gurav P, Patil G (2016) Full-reference video quality assessment using structural similarity (SSIM) index. J Electr Commun Sys 1(2)
Huang GB (2003) Learning capability and storage capacity of two-hidden-layer feedforward networks. IEEE Trans Neural Netw 14(2):274–281
Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In: Neural Networks, 2004. Proceedings. 2004 IEEE International Joint Conference on, IEEE, vol 2, pp 985–990
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70(1):489–501
Huang GB, Wang DH, Lan Y (2011) Extreme learning machines: a survey. Int J Mach Learn Cybern 2(2):107–122
Huang X, Rakêt LL, Van Luong H, Nielsen M, Lauze F et al. (2011) Multi-hypothesis transform domain wyner-ziv video coding including optical flow. In: Multimedia Signal Processing (MMSP), 2011 IEEE 13th International Workshop on, IEEE, pp 1–6
Jia Y, Wang Y, Song R, Li J (2015) Decoder side information generation techniques in wyner-ziv video coding: a review. Multimedia Tools and Applications 74(6):1777–1803
Kubasov D, Nayak J, Guillemot C (2007) Optimal reconstruction in wyner-ziv video coding with multiple side information. In: Multimedia Signal Processing, 2007. MMSP 2007. IEEE 9th Workshop on, IEEE, pp 183–186
Li R, Liu H, Chen J, Gan Z (2016) Wavelet pyramid based multi-resolution bilateral motion estimation for frame rate up-conversion. IEICE Trans Info Sys 99 (1):208–218
Liu W, Dong L, Zeng W (2010) Motion refinement based progressive side-information estimation for wyner-ziv video coding. IEEE Trans Circuits Syst Video Technol 20(12):1863–1875
Mallat S, Hwang WL (1992) Singularity detection and processing with wavelets. IEEE Trans Inf Theory 38(2):617–643
Mallat S G (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11(7):674–693
Mallt S (1989) Multifrequency channel decomposition of image and wavelet modals. IEEE Trans, Acoust, Speech, Signal Process 37:2091–2110
Martins R, Brites C, Ascenso J, Pereira F (2009) Refining side information for improved transform domain wyner-ziv video coding. IEEE Trans Circuits Syst Video Technol 19(9):1327–1341
Martins R, Brites C, Ascenso J, Pereira F (2010) Statistical motion learning for improved transform domain wyner–ziv video coding. IET image processing 4(1):28–41
Ortega JM (1987) Matrix theory. the university series in mathematics
Pereira F, Brites C, Ascenso J (2009) Distributed video coding: basics, codecs and performance. Distributed Source Coding pp 189–245
Petrazzuoli G, Cagnazzo M, Pesquet-Popescu B (2010) High order motion interpolation for side information improvement in dvc. In: Acoustics speech and signal processing (ICASSP), 2010 IEEE International Conference on, IEEE, pp 2342–2345
Puri R, Majumdar A, Ramchandran K (2007) Prism: a video coding paradigm with motion estimation at the decoder. IEEE Trans. Image Process. 16(10):2436–2448
Qing L, Zeng W (2014) Context-adaptive modeling for wavelet-domain distributed video coding. IEEE MultiMedia 21(4):84–93
Rencher AC (2003) Methods of multivariate analysis, vol 492. John Wiley & Sons
Rup S, Majhi B (2013) A mixed framework for transform domain wyner–ziv video coding. Optik-International Journal for Light and Electron Optics 124(21):4929–4938
Rup S, Majhi B, Padhy S (2014) An improved side information generation for distributed video coding. AEU-International Journal of Electronics and Communications 68(3):201–209
Said A, Pearlman WA (1996) A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans Circuits Syst Video Technol 6(3):243–250
Shapiro JM (1993) Embedded image coding using zerotrees of wavelet coefficients. IEEE Trans Signal Process 41(12):3445–3462
Slepian D, Wolf J (1973) Noiseless coding of correlated information sources. IEEE Trans Inf Theory 19(4):471–480
Tagliasacchi M, Tubaro S, Sarti A (2006) On the modeling of motion in wyner-ziv video coding. In: Image processing, 2006 IEEE International Conference on, IEEE, pp 593-596
Taieb MH, Chouinard JY, Wang D (2013) Spatial correlation-based side information refinement for distributed video coding. EURASIP J Adv Signal Process 2013(1):168
Thao NTH, Tien VH, Van Xiem H, Duong DT et al (2016) Side information creation using adaptive block size for distributed video coding. In: Advanced technologies for communications (ATC), 2016 International Conference on, IEEE, pp 339–343
Van Luong H, Raket LL, Huang X, Forchhammer S (2012) Side information and noise learning for distributed video coding using optical flow and clustering. IEEE Trans Image Process 21(12):4782–4796
Van Luong H, Raket LL, Forchhammer S (2014) Re-estimation of motion and reconstruction for distributed video coding. IEEE Trans Image Process 23(7):2804–2819
Varodayan D, Chen D, Flierl M, Girod B (2008) Wyner–ziv coding of video with unsupervised motion vector learning. Signal Process Image Commun 23(5):369–378
Vetterli M, Herley C (1992) Wavelets and filter banks: Theory and design. IEEE Trans Signal Process 40(9):2207–2232
Wiegand T, Sullivan GJ, Bjontegaard G, Luthra A (2003) Overview of the H.264/AVC video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576
Wyner A, Ziv J (1976) The rate-distortion function for source coding with side information at the decoder. IEEE Trans Inf Theory 22(1):1–10
Yan C, Zhang Y, Xu J, Dai F, Li L, Dai Q, Wu F (2014a) A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE Signal Process Lett 21(5):573–576
Yan C, Zhang Y, Xu J, Dai F, Zhang J, Dai Q, Wu F (2014b) Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE Trans Circuits Syst Video Technol 24(12):2077–2089
Zhang Y, Zhao D, Liu H, Li Y, Ma S, Gao W (2012) Side information generation with auto regressive model for low-delay distributed video coding. J Vis Commun Image Represent 23(1):229–236
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dash, B., Rup, S., Mohapatra, A. et al. Multi-resolution extreme learning machine-based side information estimation in distributed video coding. Multimed Tools Appl 77, 27301–27335 (2018). https://doi.org/10.1007/s11042-018-5921-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-5921-9