Abstract
With the advancement of technology and computational power, crafting a chart-topping song has become more effortless than before, achievable from the convenience of our residences with just a computer at hand. This has led to the emergence of vast arrays of catalogs of music, containing a variety of genres and styles from different music makers with different ethnicities and backgrounds, resulting in a large database that clogs most music streaming platforms with little automated categorization. Based on the GTZAN audio dataset, this paper revisits the use of Convolution Neural Networks (CNN) for classifying different types of music genres. Using Mel-frequency cepstral coefficients (MFCC) features, the CNN model achieved an accuracy of 85%. As a result of the careful design of the CNN model, it is on par with many latest and greatest CNN frameworks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Negro, G., Kovács, B., Carroll, G.R.: Bustin’ out: the evolution of novelty and diversity in recorded music. In: Cattani, G., Deichmann, D., Ferriani, S. (ed.) The Generation, Recognition and Legitimation of Novelty (Research in the Sociology of Organizations, vol. 77, Emerald Publishing Limited, Bingley, pp. 51–87 (2022)
Sturm, B.L.: The GTZAN dataset: Its contents, its faults, their effects on evaluation, and its future use. arXiv preprint arXiv:1306.1461 (1999)
Retta, Ephrem A., et al.: Kinit classification in ethiopian chants, Azmaris and modern music: a new dataset and CNN benchmark. arXiv preprint arXiv:2201.08448 (2022)
Bora, K., Barman, M.P., Patowary, A.N., Bora, T.: Classification of assamese folk songs’ melody using supervised learning techniques. Indian J. Sci. Technol. 16(2), 89–96 (2023)
Falola, B., Alabi, E., Ogunajo, F., Fasae, O.: Music genre classification using machine and deep learning techniques: a review 03, 35–50 (2022). https://doi.org/10.17605/OSF.IO/FZQXW
Farajzadeh, N., Sadeghzadeh, N., Hashemzadeh, M.: PMG-Net: Persian music genre classification using deep neural networks. Entert. Comput. 44, 100518 (2023)
Hung, Y.N., Yang, C.H.H., Chen, P.Y., Lerch, A.: Low-resource music genre classification with cross-modal neural model reprogramming. In: ICASSP 2023–2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1–5. IEEE (2023)
Pelchat, N., Gelowitz, C.M.: Neural network music genre classification. Can. J. Electr. Comput. Eng. 43(3), 170–173 (2020)
Tang, C.P., Chui, K.L., Yu, Y.K., Zeng, Z., Wong, K.H.:Music genre classification using a hierarchical long short term memory (LSTM) model. In: Proceedings of SPIE 10828, Third International Workshop on Pattern Recognition, 108281B, 26 July 2018
Vishnupriya, S., Meenakshi, K.:Automatic music genre classification using convolution neural network. In: 2018 International Conference on Computer Communication and Informatics (ICCCI), pp. 1–4 (2018). https://doi.org/10.1109/ICCCI.2018.8441340
Chillara, S., et al.: Music genre classification using machine learning algorithms: a comparison. Int. Res. J. Eng. Technol. (IRJET) 6(05), 851–858 (2019)
Chathuranga, D., Jayaratne, L.: Automatic music genre classification of audio signals with machine learning approaches. GSTF J Comput 3, 14 (2013)
Yi, Y., Zhu, X., Yue, Y., Wang, W.: Music genre classification with LSTM based on time and frequency domain features. In: 2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS), pp. 678–682 (2021)
Dhall, A., Srinivasa Murthy, Y.V., Koolagudi, S.G.: Music genre classification with convolutional neural networks and comparison with f, q, and mel spectrogram-based images. In: Biswas, A., Wennekes, E., Hong, TP., Wieczorkowska, A. (eds.) Advances in Speech and Music Technology. Advances in Intelligent Systems and Computing, vol. 1320, pp. 235–248. Springer, Singapore (2021). https://doi.org/10.1007/978-981-33-6881-1_20
Li, T., Ogihara, M., Qi, L.: A comparative study on content-based music genre classification. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR ‘03). Association for Computing Machinery, New York, NY, USA, pp. 282–289 (2003)
Meng, A., Ahrendt, P., Larsen, J., Hansen, L.K.: Temporal feature integration for music genre classification. IEEE Trans. Audio Speech Lang. Process. 15(5), 1654–1664 (2007)
Li, T., Ogihara, M.: Music genre classification with taxonomy. In: Proceedings. (ICASSP ‘05). IEEE International Conference on Acoustics, Speech, and Signal Processing (2005)
Costa, Y.M.G., Oliveira, L.S., Koerich, A.L., Gouyon, F., Martins, J.G.: Music genre classification using LBP textural features. Sig. Process. 92(11) 2723–2737 (2012)
Rao, K.S., Manjunath, K.E.: Speech Recognition using Articulatory and Excitation Source Features, SpringerBriefs in Speech Technology (2017)
Wen, Z., et al.: Parallel attention of representation global time–frequency correlation for music genre classification. Multimedia Tools Appl. 1–21 (2023)
Prabhakar, S.K., Lee, S.-W.: Holistic approaches to music genre classification using efficient transfer and deep learning techniques. Expert Syst. Appl. 211, 118636 (2023)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Bawitlung, A., Dash, S.K. (2024). Genre Classification in Music using Convolutional Neural Networks. In: Badioze Zaman, H., et al. Advances in Visual Informatics. IVIC 2023. Lecture Notes in Computer Science, vol 14322. Springer, Singapore. https://doi.org/10.1007/978-981-99-7339-2_33
Download citation
DOI: https://doi.org/10.1007/978-981-99-7339-2_33
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7338-5
Online ISBN: 978-981-99-7339-2
eBook Packages: Computer ScienceComputer Science (R0)