Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table

Xueyan Zhao³,
Yueting Zhuang³,
Junwei Liu³ &
…
Fei Wu³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2532))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

412 Accesses
6 Citations

Abstract

Prior work in audio retrieval needs to generate audio templates by supervised learning and find similar audio clip based on pre-trained templates. This paper presents a new and efficient audio retrieval algorithm by unsupervised fuzzy clustering: first, audio features are extracted from compressed domain; second, these features are processed by temporal-spatial constrained fuzzy clustering, and the relevant audio clips can be represented by the clustering centroids; third, we use triangle tree to speedup the similarity measure. Relevance feedback is also implemented during retrieval. Therefore, the result can be adjusted according to users’ taste and is consistent with human perception.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Integrating fuzzy C-means clustering and fuzzy inference system for audiovisual quality of experience

Article 07 November 2023

A Hierarchical Retrieval Method Based on Hash Table for Audio Fingerprinting

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

Article 10 January 2017

References

Y. Wang, Z. Liu and J. Huang, “Multimedia content analysis using audio and visual information,” IEEE Signal Processing Magazine. vol. 17, no. 6, pp. 12–36, Nov. 2000. Invited paper in the special issue on joint audio-visual processing.
Google Scholar
Foote J T, An overview of audio information retrieval, Multimedia Systems, 1999 7(1)
Google Scholar
Fei Wu, Yueting Zhuang, Yin Zhang, and Yunhe Pan, “Hidden Markovia Model based Audio Semantic Retrieval”, Pattern Recognition and Artificial Intelligence, 14 (1):104–108, 2001
Google Scholar
Jonathan T. Foote, “Content-Based Retrieval of Music and Audio”, In C.-C. J. Kuo et al., editor, Multimedia Storage and Archiving Systems II,Proc. of SPIE, Vol. 3229, pp. 138–147, 1997
Google Scholar
Stan Z. Li and GuoDong Guo, “Content-based Audio Classification and Retrieval using SVM Learning”, the special session on Multimedia Information Indexing and Retrieval. The First IEEE Pacific-Rim Conference on Multimedia December 13-15, 2000, University of Sydney, Australia.
Google Scholar
ISO/IEC JTC1/SC29, Information Technology-Generic Coding of Moving Pictures an Associate Audio Information-IS 13818 (Part 3, Audio), 1994.
Google Scholar
Slaney M, Lyon R F, “A perceptual pitch detector”, In: Proc. Int. Conf. Acoustic, Speech, and Signal Processing 1990 (ICASSP 90). Albuquerque.
Google Scholar
ISO/IEC JTC1/SC29, Information Technology-Coding of Moving Pictures and Associate Audio for Digital Storage Media at up to about 1.5Mbit/s-IS 11172 (Part 3,Audio), 1992.
Google Scholar
Rui. Y, Huang, T. S., Ortega, M., Mehrotra, S., “Relevance Feedback: A Power Tool for Interactive Content-based Image Retrieval”, IEEE Trans. on Circuits and VideoTechnology, 1998.
Google Scholar
JR.N. Dave and R. Krishnapuram, “ Robust clustering method: a unified view”, IEEE Transactions on Fuzzy systems, vol.5, no.2, pp.270–293, 1997
Article Google Scholar
N.B. Karayiannis, J.C. Bezdek, “ an integrated approach to fuzzy learning vector quantization and fuzzy c-means clustering “, IEEE Trans. on Fuzzy systems, vol 5, no.4, pp 622–628, 1997
Article Google Scholar
Andrew P. Berman, Linda G. Shapiro, “ Efficient Content-Based Retrieval: Experimental Results “, http://www.cs.washington.edu/research/imagedatabase/reportfin.htm

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Microsoft Visual Perception laboratory of Zhejiang University Zhejiang University, Hangzhou, China
Xueyan Zhao, Yueting Zhuang, Junwei Liu & Fei Wu

Authors

Xueyan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yueting Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Junwei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Fei Wu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Electrical Engineering, National Tsing Hua University, Hsinchu, Taiwan
Yung-Chang Chen
Department of Computer Science, National Tsing Hua University, Hsinchu, Taiwan
Long-Wen Chang & Chiou-Ting Hsu &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, X., Zhuang, Y., Liu, J., Wu, F. (2002). Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table. In: Chen, YC., Chang, LW., Hsu, CT. (eds) Advances in Multimedia Information Processing — PCM 2002. PCM 2002. Lecture Notes in Computer Science, vol 2532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36228-2_30

Download citation

DOI: https://doi.org/10.1007/3-540-36228-2_30
Published: 16 December 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00262-8
Online ISBN: 978-3-540-36228-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Integrating fuzzy C-means clustering and fuzzy inference system for audiovisual quality of experience

A Hierarchical Retrieval Method Based on Hash Table for Audio Fingerprinting

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Audio Retrieval with Fast Relevance Feedback Based on Constrained Fuzzy Clustering and Stored Index Table

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Integrating fuzzy C-means clustering and fuzzy inference system for audiovisual quality of experience

A Hierarchical Retrieval Method Based on Hash Table for Audio Fingerprinting

Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation