Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Computer Graphics Brazil: Content-based icons for music files

Published: 01 October 2008 Publication History

Abstract

Content-based icons are in widespread use for image and video files: Icons for them can easily be created as thumbnail-size pictures. We present a method to create content-based icons also for music files, thus allowing visual data mining of music collections right within the file listings of a standard operating system. The icons are generated automatically, employing a neural net to determine the graphical parameters from acoustic features of the waveform stored in the audio files. A brief initial training ensures that the icons fit the user's visual and aural perception and musical likings. User studies have been conducted to examine the quality of the subjective relation between music and icons. We present extensions such as a space-saving graphical user interface for mobile applications and an automatic one- and two-dimensional layout.

References

[1]
Lewis, J.P., Rosenholtz, R., Fong, N. and Neumann, U., VisualIDs: automatic distinctive icons for desktop interfaces. ACM Transactions on Graphics. v23 i3. 416-423.
[2]
Setlur, V., Albrecht-Buehler, C., Gooch, A.A., Rossoffa, S. and Gooch, B., Semanticons: visual metaphors as file icons. Computer Graphics Forum. v24 i3. 647-656.
[3]
Bainbridge, D., Cunningham, S.J. and Downie, J.S., Visual collaging of music in a digital library. In: ISMIR 2004: proceedings of the 5th international conference on music information retrieval, pp. 397-402.
[4]
Nilsson M, Sundström J. ID3v2. {http://www.id3.org/}; 1998-2005.
[5]
Aucouturier, J.-J. and Pachet, F., Improving timbre similarity: how high's the sky?. Journal of Negative Results in Speech and Audio Sciences. v1 i1.
[6]
Kolhoff, P., Preuí, J. and Loviscach, J., Music icons: procedural glyphs for audio files. In: SIBGRAPI 2006: proceedings of the XIX Brazilian symposium on computer graphics and image processing, pp. 289-296.
[7]
Keim, D.A., Sips, M. and Ankerst, M., Visual data-mining techniques. In: Hansen, C.D., Johnson, C. (Eds.), Visualization handbook, Academic Press, Burlington, MA. pp. 831-843.
[8]
Chernoff, H., Using faces to represent points in k-dimensional space graphically. Journal of the American Statistical Association. v68. 361-368.
[9]
Chuah, M.C. and Eick, S.G., Glyphs for software visualization. In: WPC '97: proceedings of the 5th international workshop on program comprehension (WPC '97), pp. 183-191.
[10]
Ebert, D.S., Rohrer, R.M., Shaw, C.D., Panda, P., Kukla, J.M. and Roberts, D.A., Procedural shape generation for multi-dimensional data Visualization. In: Data Visualization '99, pp. 3-12.
[11]
Ribarsky, W., Ayers, E., Eble, J. and Mukherjea, S., Glyphmaker: creating customized visualizations of complex data. Computer. v27 i7. 57-64.
[12]
Typke, R., Wiering, F. and Veltkamp, R.C., A survey of music information retrieval systems. In: DAFX-05: proceedings of the 8th international conference on digital audio effects, pp. 153-160.
[13]
Lew, M.S., Sebe, N., Djeraba, C. and Jain, R., Content-based multimedia information retrieval: state of the art and challenges. ACM Transactions on Multimedia Computing, Communications, and Applications. v2 i1. 1-19.
[14]
Logan B. Mel frequency cepstral coefficients for music modeling. In: ISMIR '00: international symposium on music information retrieval, 2000.
[15]
Foote, J., Cooper, M. and Nam, U., Audio retrieval by rhythmic similarity. In: ISMIR 2002: proceedings of the 3rd international conference on music information retrieval, pp. 265-272.
[16]
Kurth, F., Gehrmann, T. and Müller, M., The cyclic beat spectrum: tempo-related audio features for time-scale invariant audio identification. In: ISMIR 2006: proceedings of the 7th international conference on music information retrieval, pp. 35-40.
[17]
McKinney, M.F. and Breebaart, J., Features for audio and music classification. In: ISMIR 2003: proceedings of the 4th international conference on music information retrieval, pp. 151-158.
[18]
Pampalk, E., Dixon, S. and Widmer, G., On the evaluation of perceptual similarity measures for music. In: DAFX-03: proceedings of the 6th international conference on digital audio effects, pp. 7-12.
[19]
Cannam C. The sonic visualiser. {http://www.sonicvisualiser.org/}; 2006.
[20]
Baumann S. Visualization for music IR. In: ISMIR 2005 tutorial, 2005.
[21]
Cooper, M., Foote, J., Pampalk, E. and Tzanetakis, G., Visualization in audio-based music information retrieval. Computer Music Journal. v30 i2. 42-62.
[22]
Tzanetakis, G. and Cook, P., 3D graphics tools for sound collections. In: DAFX-00: proceedings of the COST G-6 conference on digital audio effects, pp. 115-118.
[23]
Hakala, T., Lehikoinen, J. and Aaltonen, A., Spatial interactive visualization on small screen. In: MobileHCI '05: proceedings of the 7th international conference on human-computer interaction with mobile devices and services, pp. 137-144.
[24]
Patel, D., Marsden, G., Jones, M. and Jones, S., Improving photo searching interfaces for small-screen mobile computers. In: MobileHCI '06: proceedings of the 8th conference on human-computer interaction with mobile devices and services, pp. 149-156.
[25]
Hao, J., Zhang, K. and Hsieh, T., A mobile interface for hierarchical information visualization and navigation. In: ISCE '07: IEEE symposium on consumer electronics,
[26]
Byrne, M.D., Using icons to find documents: simplicity is critical. In: CHI '93: proceedings of the SIGCHI conference on human factors in computing systems, pp. 446-453.
[27]
Haviland-Jones, J., Rosario, H.H., Wilson, P. and McGuire, T.R., An environmental approach to positive emotion: flowers. Evolutionary Psychology. i3. 104-132.
[28]
Meyer-Spradow, J. and Loviscach, J., Evolutionary design of BRDFs. In: Eurographics 2003 short paper proceedings, pp. 301-306.
[29]
Hijikata, Y., Iwahama, K., Takegawa, K. and Nishida, S., Content-based music filtering system with editable user profile. In: SAC '06: ACM symposium on applied computing, pp. 1050-1057.
[30]
Loviscach J. A real-time rhythmic analyzer and equalizer. In: Proceedings of the 121st AES convention, AES Paper 6973, 2006.
[31]
Arbinger Systems. DLL to decode MP3 to WAV/PCM. {http://www.codeproject.com/audio/madlldlib.asp}; 2004.
[32]
Underbit Technologies. MAD: MPEG audio decoder. {http://www.underbit.com/products/mad/}; 2004.
[33]
Fleurey F. C# neural network library. {http://franck.fleurey.free.fr/NeuralNetwork/index.htm}; 2002.
[34]
Hundred Miles Software. UltraID3Lib. {http://home.fuse.net/honnert/hundred/?UltraID3Lib}; 2002-2006.
[35]
Pfeiffer S, Vincent T, Kudras S, Parker C. Maaate! The Australian audio analysis toolkit. {http:www.cmis.csiro.au/maaate/docs/index.html}; 1997-2002.
[36]
Tzanetakis, G. and Cook, P., Marsyas: a framework for audio analysis. Organized Sound. v3 i4. 169-175.
[37]
McEnnis, D., McKay, C., Fujinaga, I. and Depalle, P., JAudio: a feature extraction library. In: ISMIR 2005: proceedings of the 6th international conference on music information retrieval, pp. 600-603.
[38]
Music Technology Group, Universitat Pompeu Fabra. CLAM: C++ library for audio and music, {http://www.clam.iua.upf.edu/}; 2004-2006.

Cited By

View all
  • (2023)TimToShape: Supporting Practice of Musical Instruments by Visualizing Timbre with 2D Shapes based on Crossmodal CorrespondencesProceedings of the 28th International Conference on Intelligent User Interfaces10.1145/3581641.3584053(850-865)Online publication date: 27-Mar-2023
  • (2020)Introducing time series snippets: a new primitive for summarizing long time seriesData Mining and Knowledge Discovery10.1007/s10618-020-00702-y34:6(1713-1743)Online publication date: 2-Jul-2020
  • (2016)Icon set selection via human computationProceedings of the 24th Pacific Conference on Computer Graphics and Applications: Short Papers10.5555/3061425.3061426(1-6)Online publication date: 11-Oct-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computers and Graphics
Computers and Graphics  Volume 32, Issue 5
October, 2008
128 pages

Publisher

Pergamon Press, Inc.

United States

Publication History

Published: 01 October 2008

Author Tags

  1. Human-computer interfaces
  2. Music information retrieval
  3. Visual data mining

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)TimToShape: Supporting Practice of Musical Instruments by Visualizing Timbre with 2D Shapes based on Crossmodal CorrespondencesProceedings of the 28th International Conference on Intelligent User Interfaces10.1145/3581641.3584053(850-865)Online publication date: 27-Mar-2023
  • (2020)Introducing time series snippets: a new primitive for summarizing long time seriesData Mining and Knowledge Discovery10.1007/s10618-020-00702-y34:6(1713-1743)Online publication date: 2-Jul-2020
  • (2016)Icon set selection via human computationProceedings of the 24th Pacific Conference on Computer Graphics and Applications: Short Papers10.5555/3061425.3061426(1-6)Online publication date: 11-Oct-2016
  • (2011)The art of metaphorProceedings of the 10th International Conference on Virtual Reality Continuum and Its Applications in Industry10.1145/2087756.2087780(171-178)Online publication date: 11-Dec-2011

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media