Abstract
New video applications, such as 3D video and free viewpoint video, require efficient compression of multi-view video. In addition to temporal redundancy, exploiting the inter-view redundancy is crucial to improve the performance of multi-view video coding. In this paper, we present a novel method to construct the optimal inter-view prediction structure for multi-view video coding using simulated annealing. In the proposed model, the design of the prediction structure is converted to the arrangement of coding order. Then, a simulated annealing algorithm is employed to minimize the total cost for obtaining the best coding order. This method is applicable to arbitrary irregular camera arrangements. As experiment results reveal, the annealing process converges to satisfactory results rapidly and the generated optimal prediction structure outperforms the reference prediction structure of the joint multi-view video model (JMVM) by 0.1–0.8 dB PSNR gains.
Similar content being viewed by others
References
Bohachevsky, I.O., Johnson, M.E., Stein, M.L., 1986. Generalized simulated annealing for function optimization. Technometrics, 28(3):209–217. [doi:10.2307/1269076]
Cormen, T.H., Leiserson, C.E., Rivest, R.L., Stein, C., 2001. Introduction to Algorithms (2nd Ed.). MIT Press, Cambridge, USA, p.966–1021.
Kalva, H., Christodoulou, L., Mayron, L., Marques, O., Furht, B., 2006. Challenges and Opportunities in Video Coding for 3DTV. IEEE Int. Conf. on Multimedia and Expo, p.1689–1692. [doi:10.1109/ICME.2006.262874]
Kang, J.W., Cho, S.H., Hur, N.H., Kim, C.S., Lee, S.U., 2007. Graph Theoretical Optimization of Prediction Structure in Multiview Video Coding. IEEE Int. Conf. on Image Processing, p.429–432. [doi:10.1109/ICIP.2007.4379613]
Kirkpatrick, S., Gelatt, C., Vechhi, M., 1983. Optimization by simulated annealing. Science, 220(4598):671–680. [doi:10.1126/science.220.4598.671]
Laarhoven, P., Aarts, E., 1988. Simulated annealing: theory and applications. Math. Its Appl., 12(1):108–111.
Li, D.X., Zheng, W., Xie, X.H., Zhang, M., 2007. Optimising inter-view prediction structure for multiview video coding with minimum spanning tree. Electron. Lett., 43(23): 1269–1271. [doi:10.1049/el:20072465]
Merkle, P., Smolic, A., Muller, K., Wiegand, T., 2007. Efficient prediction structures for multiview video coding. IEEE Trans. Circ. Syst. Video Technol., 17(11):1461–1473. [doi:10.1109/TCSVT.2007.903665]
Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., Teller, E., 1953. Equation of state calculations by fast computing machines. J. Chem. Phys., 21(6):1087–1090. [doi:10.1063/1.1699114]
Muller, K., Merkle, P., Schwarz, H., Hinz, T., Smolic, A., Wiegand, T., Oelbaum, T., 2006. Multi-view Video Coding Based on H.264/MPEG4-AVC Using Hierarchical B Pictures. Picture Coding Symp., p.385–390.
Schwarz, H., Marpe, D., Wiegand, T., 2006. Analysis of Hierarchical B Pictures and MCTF. IEEE Int. Conf. on Multimedia and Expo, p.1929–1932. [doi:10.1109/ICME.2006.262934]
Smolic, A., Mueller, K., Merkle, P., Fehn, C., Kauff, P., Eisert, P., Wiegand, T., 2006. 3D Video and Free Viewpoint Video—Technologies, Applications and MPEG Standards. IEEE Int. Conf. on Multimedia and Expo, p.2161–2164. [doi:10.1109/ICME.2006.262683]
Su, Y., Vetro, A., Smolic, A., 2006. Common Test Conditions for Multiview Video Coding. ITU-T SG16/Q6. Doc. JVT-T207, Klagenfurt, Austria.
Zomaya, A.Y., 2001. Natural and simulated annealing. Comput. Sci. Eng., 3(6):97–99. [doi:10.1109/MCISE.2001.963434]
Author information
Authors and Affiliations
Corresponding author
Additional information
Project supported by the National Natural Science Foundation of China (No. 60802013) and the Zhejiang Provincial Natural Science Foundation of China (No. Y106574)
Rights and permissions
About this article
Cite this article
Zhu, Z., Li, Dx. & Zhang, M. Optimizing inter-view prediction structures for multi-view video coding using simulated annealing. J. Zhejiang Univ. - Sci. C 12, 155–162 (2011). https://doi.org/10.1631/jzus.C1000016
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.C1000016