Abstract
The temporal aggregation method applied for sports news videos detects and aggregates two kinds of shots: sequences of long shots, mainly studio shots unsuitable for content-based indexing, and sports player shots adequate for sports categorization. Hereby, it significantly reduces the number of frames analyzed in content-based indexing of TV sports news. The tests have shown that applying the temporal aggregation method it was possible to reject about half of video frames and despite this almost all sports scenes reported in TV sports news have been indexed. The paper examines the influence of the temporal aggregation on the detection of anchorperson shots in news videos. The TV news video editing is similar to that of TV sports news although news shots are longer in average than sports player shots. The interviews, statements, and commentaries are more significant in news than in sports news for content-based analyses because these statements are not necessarily spoken by an anchorman, so they are usually important informative parts of TV news. The experiments carried out on TV news and described in the paper have shown that anchorperson shots as well as interview shots may be more easily and faster selected when TV news videos are temporally aggregated. These experiments were performed in the Automatic Video Indexer AVI.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Choroś, K.: Temporal aggregation of video shots in TV sports news for detection and categorization of player scenes. In: Bădică, C., Nguyen, N.T., Brezovan, M. (eds.) ICCCI 2013. LNCS, vol. 8083, pp. 487–497. Springer, Heidelberg (2013)
Choroś, K., Gonet, M.: Effectiveness of video segmentation techniques for different categories of videos. In: New Trends in Multimedia and Network Information Systems, pp. 34–45. IOS Press, Amsterdam (2008)
Money, A.G., Agius, H.: Video summarisation: a conceptual framework and survey of the state of the art. J. of Visual Communication and Image Representation 19, 121–143 (2008)
Hu, W., Xie, N., Li, L., Zeng, X., Maybank, S.: A survey on visual content-based video indexing and retrieval. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 41(6), 797–819 (2011)
Del Fabro, M., Böszörmenyi, L.: State-of-the-art and future challenges in video scene detection: a survey. Multimedia Syst. 19(5), 427–454 (2013)
Asghar, M.N., Hussain, F., Manton, R.: Video indexing: a survey. Int. J. of Computer and Information Technology 3(1), 148–169 (2014)
Kompatsiaris, Y., Mérialdo, B., Lian, S. (eds.): TV Content Analysis: Techniques and Applications. CRC Press, Boca Raton (2012)
Ji, P., Cao, L., Zhang, X., Zhang, L., Wu, W.: News videos anchor person detection by shot clustering. Neurocomputing 123, 86–99 (2014)
Zheng, F., Li, S., Wu, H., Feng, J.: Anchor shot detection with diverse style backgrounds based on spatial-temporal slice analysis. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, Y.-P.P. (eds.) MMM 2010. LNCS, vol. 5916, pp. 676–682. Springer, Heidelberg (2010)
Broilo, M., Basso, A., De Natale, F.G.: Unsupervised anchorpersons differentiation in news video. In: Proc. of the 9th International Workshop on Content-Based Multimedia Indexing (CBMI), pp. 115–120. IEEE (2011)
Lee, H., Yu, J., Im, Y., Gil, J.M., Park, D.: A unified scheme of shot boundary detection and anchor shot detection in news video story parsing. Multimedia Tools and Applications 51(3), 1127–1145 (2011)
Montagnuolo, M., Messina, A., Borgotallo, R.: Automatic segmentation, aggregation and indexing of multimodal news information from television and the Internet. Int. J. of Information Studies 1(3), 200–211 (2010)
Dong, Y., Qin, G., Xiao, G., Lian, S., Chang, X.: Advanced news video parsing via visual characteristics of anchorperson scenes. Telecommunication Systems 54(3), 247–263 (2013)
El Khoury, E., Sénac, C., Joly, P.: Audiovisual diarization of people in video content. Multimedia Tools and Applications 68(3), 747–775 (2014)
Qu, B., Vallet, F., Carrive, J., Gravier, G.: Content-based inference of hierarchical structural grammar for recurrent TV programs using multiple sequence alignment. In: Proc. of the IEEE International Conference on Multimedia and Expo, ICME, pp. 1–6 (2014)
Choroś, K.: Video structure analysis and content-based indexing in the automatic video indexer AVI. In: Nguyen, N.T., Zgrzywa, A., Czyżewski, A. (eds.) Advances in Multimedia and Network Information System Technologies. AISC, vol. 80, pp. 79–90. Springer, Heidelberg (2010)
Choroś, K.: Video structure analysis for content-based indexing and categorisation of TV sports news. Int. J. of Intelligent Information and Database Systems 6(5), 451–465 (2012)
Choroś, K.: Automatic detection of headlines in temporally aggregated TV sports news videos. In: Proc. of the 8th International Symposium on Image and Signal Processing and Analysis (ISPA), pp. 147–152. IEEE (2013)
Choroś, K.: False and miss detections in temporal segmentation of TV sports news videos – causes and remedies. In: Zgrzywa, A., Choroś, K., Siemiński, A. (eds.) New Research in Multimedia and Internet Systems. AISC, vol. 314, pp. 35–46. Springer, Heidelberg (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Choroś, K. (2015). Automatic Fast Detection of Anchorperson Shots in Temporally Aggregated TV News Videos. In: Nguyen, N., Trawiński, B., Kosala, R. (eds) Intelligent Information and Database Systems. ACIIDS 2015. Lecture Notes in Computer Science(), vol 9012. Springer, Cham. https://doi.org/10.1007/978-3-319-15705-4_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-15705-4_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15704-7
Online ISBN: 978-3-319-15705-4
eBook Packages: Computer ScienceComputer Science (R0)