Abstract
Multiview video coding (MVC) exploits mode decision, motion estimation and disparity estimation to achieve high compression ratio, which results in an extensive computational complexity. This paper presents an efficient mode decision approach for MVC using a macroblock (MB) position constraint model (MPCM). The proposed approach reduces the number of candidate modes by utilizing the mode correlation and rate distortion cost (RD cost) in the previously encoded frames/views. Specifically, the mode correlations both in the temporal-spatial domain and the inter-view are modeled with MPCM. Then, MPCM is exploited to select the optimal prediction direction for the current encoding MB. Finally, the inter mode is early determined in the optimal prediction direction. Experimental results show that the proposed method can save 86.03 % of encoding time compared with the exhaustive mode decision used in the reference software of joint multiview video coding, with only 0.077 dB loss in Bjontegaard delta peak signal-to-noise ratio (BDPSNR) and 2.29 % increment of the total Bjontegaard delta bit rate (BDBR), which is superior to the performances of state-of-the-art approaches.
Similar content being viewed by others
References
Vetro, A., Wiegand, T., Sullivan, G.J.: Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard. Proc. IEEE 99(4), 626–642 (2011)
Vetro, A., Tourapis, A.M., Muller, K., Chen, T.: 3D-TV content storage and transimission. IEEE Trans. Broadcast. 572, 384–394 (2011)
Smolic, A., Muller, K., Stefanoski, N., Ostermann, J., Gotchev, A., Akar, G.B., Triantafyllidis, G., Koz, A.: Coding algorithms for 3DTV-A survey. IEEE Trans. Circ. Syst. Video Technol. 17(11), 1606–1621 (2007)
Vetro, A., Pandit, P., Kimata, H.: Joint multiview video model (JMVM) 7.0. ISO/IEC JTC1/SC29/WG11 and ITU-T Q6/SG16, Doc. JVT-Z207 (2008)
Chen, Y., Pandit, P., Yea, S.: WD 4 reference software for MVC (JMVC). ISO/IEC JTC1/SC29/WG11 and ITU-T Q6/SG16, Doc. JVT-AD207 (2009)
Zhang, Y., Kwong, S., Xu, L., Jiang, G.Y.: Direct mode early decision optimization based on rate distortion cost property and inter-view correlation. IEEE Trans. Broadcast. 59(2), 390–398 (2013)
Wang, F.S., Zeng, H.Q., Shen, Q.H., Du, S.D.: Efficient early direct mode decision for multiview video coding. Signal Process. Image Commun. 28(7), 736–744 (2013)
Shen, L.Q., Liu, Z., Yan, T., Zhang, Z.Y., An, P.: Early SKIP mode decision for MVC using inter-view correlation. Signal Process. Image Commun. 25(2), 88–93 (2010)
Zhu, W., Tian, X., Zhou, F., Chen, Y.W.: Fast inter mode decision based on textural segmentation and correlations for multiview video coding. IEEE Trans. Consum. Electron. 56(3), 1696–1704 (2010)
Yeh, C.H., Li, M.F., Chen, M.J., Chi, M.C., Huang, X.X., Chi, H.W.: Fast mode decision algorithm through inter-view rate-distortion prediction for multiview video coding system. IEEE Trans. Ind. Inform. 10(1), 594–603 (2014)
Zeng, H.Q., Ma, K.K., Cai, C.H.: Fast multiview video coding using adaptive prediction structure and hierarchical mode decision. IEEE Trans. Circ. Syst. Video Technol. 24(9), 1566–1578 (2014)
Chan, C.C., Tang, C.W.: Coding statistics based fast mode decision for multiview video coding. J. Vis. Commun. Image Represent. 24(6), 686–699 (2013)
Shen, L.Q., Liu, Z., An, P., Ma, R., Zhang, Z.Y.: Low-complexity mode decision for MVC. IEEE Trans. Circ. Syst. Video Technol. 21(6), 837–843 (2011)
Zeng, H.Q., Ma, K.K., Cai, C.H.: Fast mode decision for multiview video coding using mode correlation. IEEE Trans. Circ. Syst. Video Technol. 21(11), 1659–1666 (2011)
Zhao, T.S., Kwong, S., Wang, H.L., Wang, Z., Pan, Z., Kou, C.-C.J.: Multiview coding mode decision with hybrid optimal stopping model. IEEE Trans. Image Process. 22(4), 1598–1609 (2013)
G. Bjontegaard. Calculation of average PSNR difference between RD-curves. Doc. VCEG-M33, Austin, TX (2001)
X. Li, M. Wien, J. R. Ohm. Rate-complexity-distortion evaluation for hybrid video coding. In: IEEE International Conference on Multimedia and Expo (ICME), pp. 685–190, Suntec City (2010)
Acknowledgments
This work is supported in part by the National Natural Science Foundation of China (61379143, 61232016, U1405254), the Specialized Research Fund for the Doctoral Program of Higher Education (SRFDP) under Grant 20120161110014 and the S&T Program of Xuzhou City (XM13B119) and the PAPD fund. The authors greatly appreciate Mr Moses Odero for his nice help in improving the English usages in this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Yang, G., Zhu, Y. et al. Adaptive mode decision for multiview video coding based on macroblock position constraint model. J Real-Time Image Proc 12, 575–582 (2016). https://doi.org/10.1007/s11554-015-0527-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-015-0527-1