Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1991996.1992007acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

TV program segmentation using multi-modal information fusion

Published: 18 April 2011 Publication History

Abstract

A TV program segmentation algorithm is presented by the fusion of the multi-modal information in the large-scale videos. As "Inter-Programs" are generally inserted into the TV videos repeatedly, the macro structures of the videos can be effectively and automatically generated by identifying the video-audio features of the special sequences. The Electronic Program Guide (EPG) is used to organize the structures into the programs. Three sections are included in the algorithm, namely, the video-based non-supervised duplicate sequence detection, the audio-based special clip retrieval and the EPG-based 24-hour program segmentation. The algorithm has been tested in 60-day different-type TV videos. The F-measures of the multi-modal fusion and video-based duplicated sequence detection achieve the rates of over 98% and 96% respectively. These results show that the proposed method is highly efficient and effective for the TV Program segmentation.

References

[1]
H. Bai, C. Dong, L. Wang, G. Qin, K. Tao, X. Chang, and Y. Dong. Non-supervised macro segmentation of large-scale tv videos. In Multimedia Content Access: Algorithms and Systems V, 2011.
[2]
H. Bai, W. Hu, T. Wang, X. Tong, C. Liu, and Y. Zhang. A novel sports video logo detector based on motion analysis. In International Conference of Neural Information Processing, pages 448--457, 2006.
[3]
J. L. Bentley. Multidimensional binary search trees used for associative searching. Commun. ACM, 18(9):509--517, 1975.
[4]
S.-A. Berrani, G. Manson, and P. Lechat. A non-supervised approach for repeated sequence detection in tv broadcast streams. Image Commun., 23(7):525--537, 2008.
[5]
M. Datar, N. Immorlica, P. Indyk, and V. S. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. In Annual symposium on Computational geometry, pages 253--262, 2004.
[6]
I. Döhring and R. Lienhart. Fast and effective features for recognizing recurring video clips in very large databases. In International Conference of Image Analysis and Processing, pages 65--70, 2007.
[7]
I. Döhring and R. Lienhart. Mining tv broadcasts for recurring video sequences. In International Conference on Image and Video Retrieval, pages 1--8, 2009.
[8]
A. Ekin and A. M. Tekalp. Automatic soccer video analysis and summarization. IEEE Trans. on Image Processing, 12:796--807, 2003.
[9]
A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In International Conference on Very Large Data Bases, pages 518--529, 1999.
[10]
J. Haitsma, T. Kalker, and J. Oostveen. An efficient database search strategy for audio fingerprinting. In IEEE Workshop on Multimedia Signal Processing, pages 178--181, 2002.
[11]
P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. In ACM symposium on Theory of computing, pages 604--613, 1998.
[12]
Y. Ke, D. Hoiem, and R. Sukthankar. Computer vision for music identification: Video demonstration. In CVPR, 2005.
[13]
R. Lienhart, C. Kuhmünch, and W. Effelsberg. On the detection and recognition of television commercials. In International Conference on Multimedia Computing and Systems, pages 509--516, 1997.
[14]
G. Manson, X. Naturel, and S.-A. Berrani. Online macro-segmentation of television streams. In International Conference on Advances in Multimedia Modeling, pages 220--221, 2009.
[15]
X. Naturel and P. Gros. A fast shot matching strategy for detecting duplicate sequences in a television stream. In International workshop on Computer vision meets databases, pages 21--27, 2005.
[16]
X. Naturel and P. Gros. Detecting repeats for video structuring. Multimedia Tools Appl., 38(2):233--252, 2008.
[17]
S. B. Needleman and C. D. Wunsch. An efficient method applicable to the search for similarities in the amino acid sequences of two proteins. Journal of Molecular Biology, 48:444--453, 1970.
[18]
L. Rabiner and B. H. Juang. Fundamentals of Speech Recognition. Prentice-Hall, Inc., 1993.
[19]
P. H. Sellers. An algorithm for the distance between two finite sequences. Journal of Combinatorial Theory, A16:253--258, 1974.
[20]
A. Shivadas and J. M. Gauch. Real-time commercial recognition using color moments and hashing. In Canadian Conference on Computer and Robot Vision, pages 465--472, 2007.
[21]
A. F. Smeaton, P. Over, and A. R. Doherty. Video shot boundary detection: Seven years of trecvid activity. Computer Vision and Image Understanding, 114(4):411--418, 2010.
[22]
A. F. Smeaton, P. Over, and W. Kraaij. Evaluation campaigns and trecvid. In International Workshop on Multimedia Information Retrieval, pages 321--330, 2006.
[23]
X. Yang, Q. Tian, and P. Xue. Efficient short video repeat identification with application to news video structure analysis. IEEE Transactions on Multimedia, 9(3):600--609, 2007.
[24]
J. Yuan, W. Wang, J. Meng, Y. Wu, and D. Li. Mining repetitive clips through finding continuous paths. In International conference on Multimedia, pages 289--292, 2007.

Cited By

View all

Index Terms

  1. TV program segmentation using multi-modal information fusion

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '11: Proceedings of the 1st ACM International Conference on Multimedia Retrieval
    April 2011
    512 pages
    ISBN:9781450303361
    DOI:10.1145/1991996
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 April 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. EPG
    2. duplicate sequence detection
    3. fusion
    4. locality-sensitive hashing
    5. non-supervised
    6. program segmentation
    7. video-audio

    Qualifiers

    • Research-article

    Conference

    ICMR'11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2015)A novel feature fusion based framework for efficient shot indexing to massive web videosTelecommunications Systems10.1007/s11235-014-9945-959:3(401-413)Online publication date: 1-Jul-2015
    • (2015)SAPTEMultimedia Tools and Applications10.1007/s11042-014-2311-974:23(10923-10963)Online publication date: 1-Dec-2015
    • (2013)Macro Segmentation and Content Analysis of TV Broadcast StreamApplied Mechanics and Materials10.4028/www.scientific.net/AMM.284-287.3194284-287(3194-3198)Online publication date: Jan-2013
    • (2011)A word-based approach for duplicate picture in picture sequence detection2011 4th IEEE International Conference on Broadband Network and Multimedia Technology10.1109/ICBNMT.2011.6155942(286-290)Online publication date: Oct-2011

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media