Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1180639.1180844acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Very low complexity MPEG-2 to H.264 transcoding using machine learning

Published: 23 October 2006 Publication History

Abstract

This paper presents a novel macroblock mode decision algorithm for inter-frame prediction based on machine learning techniques to be used as part of a very low complexity MPEG-2 to H.264 video transcoder. Since coding mode decisions take up the most resources in video transcoding, a fast macro block (MB) mode estimation would lead to reduced complexity. The proposed approach is based on the hypothesis that MB coding mode decisions in H.264 video have a correlation with the distribution of the motion compensated residual in MPEG-2 video. We use machine learning tools to exploit the correlation and derive decision trees to classify the incoming MPEG-2 MBs into one of the 11 coding modes in H.264. The proposed approach reduces the H.264 MB mode computation process into a decision tree lookup with very low complexity. The proposed transcoder is compared with a reference transcoder comprised of a MPEG-2 decoder and an H.264 encoder. Our results show that the proposed transcoder reduces the H.264 encoding time by over 95% with negligible loss in quality and bitrate.

References

[1]
ITU-T RECOMMENDATION H.264 "Advanced Video Coding for Generic Audiovisual Services". May 2003.
[2]
Implementation Studies Group, "Main Results of the AVC Complexity Analysis". MPEG Document N4964, ISO/IEC JTC11/SC29/WG11, July 2002.
[3]
T. Shanableh and M. Ghanbari, "Heterogeneous Video Transcoding to Lower Spatio-Temporal Resolutions and Different Encoding Formats," IEEE Transactions on Multimedia, vol.2, no.2, June 2000.
[4]
A. Vetro, C. Christopoulos, and H. Sun "Video Transcoding Architectures and Techniques: An Overview". IEEE Signal Processing Magazine, vol. 20, no. 2, pp.18--29, March. 2003.
[5]
H. Kalva, A. Vetro, and H. Sun, "Performance Optimization of the MPEG-2 to MPEG-4 Video Transcoder". Proceeding of SPIE Conference on Microtechnologies for the New Millennium, VLSI Circuits and Systems, May 2003.
[6]
S. Dogan, A.H. Sadka and A.M. Kondoz, "Efficient MPEG-4/ H.263 Video Transcoder for Interoperability of Heterogeneous Multimedia Networks," IEE Electronics Letters, Vol. 35, No.11. pp. 863--864.
[7]
H. Kalva. "Issues in H.264/MPEG-2 Video Transcoding". Proceedings of Consumer Communications and Networking Conference, January 2004.
[8]
Y. Su, J. Xin, A. Vetro, and H. Sun, "Efficient MPEG-2 to H.264/AVC Intra Transcoding in Transform-Domain," IEEE International Symposium on Circuits and Systems, 2005. ISCAS 2005. pp. 1234--1237 Vol. 2, 23-26 May 2005.
[9]
B. Petljanski and H. Kalva, "DCT Domain Intra MB Mode Decision for MPEG-2 to H.264 Transcoding" Proceedings of the ICCE 2006. January 2006. pp. 419--420.
[10]
Y.-K. Lee, S.-S. Lee, and Y.-L. Lee, "MPEG-4 to H.264 Transcoding using Macroblock Statistics," Proceedings of the ICME 2006, Toronto, Canada, July 2006.
[11]
X. Lu, A.M. Tourapis, P. Yin, and J. Boyce, "Fast Mode Decision and Motion Estimation for H.264 with a Focus on MPEG-/H.264 Transcoding," Proceedings of 2005 IEEE International Symposium on Circuits and Systems (ISCAS), Kobe, Japan, May 2005.
[12]
Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, Reference Software to Committee Draft. JVT-F100 JM10.2. 2006.
[13]
G. Sullivan and T. Wiegand, "Rate-Distortion Optimization for Video Compression," IEEE Signal Processing Magazine, vol. 15, no. 6, pp. 74--90, November. 1998.
[14]
T. Wiegand et al., "Rate-Constrained Coder Control and Comparison of Video Coding Standards," IEEE Transactions on Circuits Systems and Video Technology, vol. 13, no. 7, pp. 688--703, July 2003.
[15]
A.M. Tourapis, O.C. Au, M.L. Liou, "Highly Efficient Predictive Zonal Algorithms for Fast Block-Matching Motion Estimation," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, Issue 10, Oct. 2002.
[16]
Z. Chen, P. Zhou, and Y. He, "Fast Integer Pel and Fractional Pel Motion Estimation for JVT", 6th Meeting. Awaji, December 2002
[17]
M. Yang, H. Cui, K. Tang, "Efficient Tree Structured Motion Estimation using Successive Elimination," IEE Proceedings-Vision, Image and Signal Processing, Vol. 151, Issue 5, Oct. 2004.
[18]
Ian H. Witten and Eibe Frank, "Data Mining: Practical Machine Learning Tools and Techniques", 2nd Edition, Morgan Kaufmann, San Francisco, 2005.
[19]
J.R. Quinlan, "C4.5: Programs for Machine Learning", Morgan Kaufmann, 1993.

Cited By

View all
  • (2024)A systematic literature review on video transcoding acceleration: challenges, solutions, and trendsMultimedia Tools and Applications10.1007/s11042-023-17862-wOnline publication date: 12-Jan-2024
  • (2021)Knowledge formation of MPEG: Analysis using bibliographic clustering of citation networksSynthesiology10.5571/synth.2021.1_12021:1(1-17)Online publication date: 2021
  • (2018)Fast Inter Prediction Mode Decision Algorithm Based on Data MiningMachine Learning and Intelligent Communications10.1007/978-3-030-00557-3_10(95-102)Online publication date: 12-Oct-2018
  • Show More Cited By

Index Terms

  1. Very low complexity MPEG-2 to H.264 transcoding using machine learning

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '06: Proceedings of the 14th ACM international conference on Multimedia
    October 2006
    1072 pages
    ISBN:1595934472
    DOI:10.1145/1180639
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 23 October 2006

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. H.264
    2. MPEG-2
    3. inter-frame
    4. machine learning
    5. transcoding

    Qualifiers

    • Article

    Conference

    MM06
    MM06: The 14th ACM International Conference on Multimedia 2006
    October 23 - 27, 2006
    CA, Santa Barbara, USA

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 20 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A systematic literature review on video transcoding acceleration: challenges, solutions, and trendsMultimedia Tools and Applications10.1007/s11042-023-17862-wOnline publication date: 12-Jan-2024
    • (2021)Knowledge formation of MPEG: Analysis using bibliographic clustering of citation networksSynthesiology10.5571/synth.2021.1_12021:1(1-17)Online publication date: 2021
    • (2018)Fast Inter Prediction Mode Decision Algorithm Based on Data MiningMachine Learning and Intelligent Communications10.1007/978-3-030-00557-3_10(95-102)Online publication date: 12-Oct-2018
    • (2013)An H.264/AVC to HEVC video transcoder based on mode mapping2013 IEEE International Conference on Image Processing10.1109/ICIP.2013.6738406(1972-1976)Online publication date: Sep-2013
    • (2009)Low complexity intra MB encoding in AVC/H.264IEEE Transactions on Consumer Electronics10.1109/TCE.2009.481444655:1(277-285)Online publication date: 1-Feb-2009
    • (2009)Study of Video Conference System Based on RTI and Floor TransmissionProceedings of the 2009 International Conference on Future Networks10.1109/ICFN.2009.14(108-112)Online publication date: 7-Mar-2009
    • (2009)Research and Realization of Improved Algorithm for H.264/AVC Oriented to Video Conference under the RTI FrameworkProceedings of the 2009 Third International Conference on Digital Society10.1109/ICDS.2009.27(128-132)Online publication date: 1-Feb-2009
    • (2009)Improved machine learning techniques for low complexity MPEG-2 to H.264 transcoding using optimized codecs2009 Digest of Technical Papers International Conference on Consumer Electronics10.1109/ICCE.2009.5012345(1-2)Online publication date: Jan-2009
    • (2008)Dynamic motion estimation for transcoding P frames in H.264 to MPEG-2 transcodersIEEE Transactions on Consumer Electronics10.1109/TCE.2008.456014354:2(657-662)Online publication date: 1-May-2008
    • (2006)The H.264 Video Coding StandardIEEE MultiMedia10.1109/MMUL.2006.9313:4(86-90)Online publication date: 1-Oct-2006

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media