Abstract
With the rapid growing amount of multimedia, content-based information retrieval has become more and more important. As a significant clue for video indexing and retrieval, audio detection and classification attracts much more attention and becomes a hot topic. On the basis of the priori model of news video structure, a selective ensemble support vector machines (SE-SVMs) is proposed to detect and classify the news audio into 4 types, i.e., silence, music, speech, and speech with music background. Experiments with news audio clips of 8514 seconds in total length illustrate that the average accuracy rate of the proposed audio classification method reaches to 98.9%, which is much better than that of the available SVM-based or traditional threshold-based method.
This work was supported by National Natural Science Foundation of China (No.60202004) and the Key Project of Chinese Ministry of Education (No.104173).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Kimber, D., Wilcox, L.: Acoustic Segmentation for Audio Browsers. In: Proc. of Interface Conference, Sydney, Australia (1996)
Scheirer, E., Slaney, M.: Construction and Evaluation of a Robust Multifeature Speech/Music Discriminator. In: ICASSP 1997, Munich, Germany, vol. 2, pp. 1331–1334 (1997)
Lu, L., Li, S.Z., Zhang, H.J.: Content-based Audio Segmentation Using Support Vector Machine. ICME 1, 749–752 (2001)
Vapnik, V.N.: The Nature of Statistical Learning Theory, 2nd edn. Springer, Heidelberg (1995)
Li, S.Z., Guo, G.D.: Content-based Audio Classification and Retrieval Using SVM Learning. In: Proc. of the Special Session on Multimedia Information Indexing and Retrieval, pp. 156–163. University of Sydney, Sydney (2000)
Hansen, L.K., Salamon, P.: Neural Network Ensembles. IEEE Trans. Pattern Analysis and Machine Intelligence 12, 993–1001 (1990)
Maclin, R., Shavlik, J.W.: Combining the Predictions of Multiple Classifiers: Using Competitive Learning to Initialize Neural Networks. In: Proc. of the 14th International Joint Conference on Artificial Intelligence, Montreal, Canada, pp. 524–530 (1995)
Wang, Y., Liu, Z., Huang, J.: Multimedia Content Analysis Using Both Audio and Visual Cues. IEEE Signal Processing Magazine, 12–36 (2000)
Zhang, T., Kuo, C.C.: Audio Content Analysis for Online Audiovisual Data Segmentation. IEEE Trans. on Speech and Audio Processing 9, 441–457 (2001)
Li, Y., Dorai, C.: SVM-based Audio Classification for Instructional Video Analysis. In: ICASSP, Montreal, Quebec, Canada, pp. 897–900 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Han, B., Gao, X., Ji, H. (2005). Automatic News Audio Classification Based on Selective Ensemble SVMs. In: Wang, J., Liao, XF., Yi, Z. (eds) Advances in Neural Networks – ISNN 2005. ISNN 2005. Lecture Notes in Computer Science, vol 3497. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11427445_59
Download citation
DOI: https://doi.org/10.1007/11427445_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25913-8
Online ISBN: 978-3-540-32067-8
eBook Packages: Computer ScienceComputer Science (R0)