Abstract
Prior work in audio retrieval needs to generate audio templates by supervised learning and find similar audio clip based on pre-trained templates. This paper presents a new and efficient audio retrieval algorithm by unsupervised fuzzy clustering: first, audio features are extracted from compressed domain; second, these features are processed by temporal-spatial constrained fuzzy clustering, and the relevant audio clips can be represented by the clustering centroids; third, we use triangle tree to speedup the similarity measure. Relevance feedback is also implemented during retrieval. Therefore, the result can be adjusted according to users’ taste and is consistent with human perception.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Y. Wang, Z. Liu and J. Huang, “Multimedia content analysis using audio and visual information,” IEEE Signal Processing Magazine. vol. 17, no. 6, pp. 12–36, Nov. 2000. Invited paper in the special issue on joint audio-visual processing.
Foote J T, An overview of audio information retrieval, Multimedia Systems, 1999 7(1)
Fei Wu, Yueting Zhuang, Yin Zhang, and Yunhe Pan, “Hidden Markovia Model based Audio Semantic Retrieval”, Pattern Recognition and Artificial Intelligence, 14 (1):104–108, 2001
Jonathan T. Foote, “Content-Based Retrieval of Music and Audio”, In C.-C. J. Kuo et al., editor, Multimedia Storage and Archiving Systems II,Proc. of SPIE, Vol. 3229, pp. 138–147, 1997
Stan Z. Li and GuoDong Guo, “Content-based Audio Classification and Retrieval using SVM Learning”, the special session on Multimedia Information Indexing and Retrieval. The First IEEE Pacific-Rim Conference on Multimedia December 13-15, 2000, University of Sydney, Australia.
ISO/IEC JTC1/SC29, Information Technology-Generic Coding of Moving Pictures an Associate Audio Information-IS 13818 (Part 3, Audio), 1994.
Slaney M, Lyon R F, “A perceptual pitch detector”, In: Proc. Int. Conf. Acoustic, Speech, and Signal Processing 1990 (ICASSP 90). Albuquerque.
ISO/IEC JTC1/SC29, Information Technology-Coding of Moving Pictures and Associate Audio for Digital Storage Media at up to about 1.5Mbit/s-IS 11172 (Part 3,Audio), 1992.
Rui. Y, Huang, T. S., Ortega, M., Mehrotra, S., “Relevance Feedback: A Power Tool for Interactive Content-based Image Retrieval”, IEEE Trans. on Circuits and VideoTechnology, 1998.
JR.N. Dave and R. Krishnapuram, “ Robust clustering method: a unified view”, IEEE Transactions on Fuzzy systems, vol.5, no.2, pp.270–293, 1997
N.B. Karayiannis, J.C. Bezdek, “ an integrated approach to fuzzy learning vector quantization and fuzzy c-means clustering “, IEEE Trans. on Fuzzy systems, vol 5, no.4, pp 622–628, 1997
Andrew P. Berman, Linda G. Shapiro, “ Efficient Content-Based Retrieval: Experimental Results “, http://www.cs.washington.edu/research/imagedatabase/reportfin.htm
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhao, X., Zhuang, Y., Liu, J., Wu, F. (2002). Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table. In: Chen, YC., Chang, LW., Hsu, CT. (eds) Advances in Multimedia Information Processing — PCM 2002. PCM 2002. Lecture Notes in Computer Science, vol 2532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36228-2_30
Download citation
DOI: https://doi.org/10.1007/3-540-36228-2_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00262-8
Online ISBN: 978-3-540-36228-9
eBook Packages: Springer Book Archive