A Comparative Study of Convolutional Neural Networks and Recurrent Neural Networks for Chord Recognition

Authors

  • Hania Nawaz Khan Department of Software Engineering, Balochistan University of Information, Technology, Engineering and Management Sciences, Quetta, 87650 Pakistan
  • Sibghatullah Bazai Department of Software Engineering, Balochistan University of Information, Technology, Engineering and Management Sciences, Quetta, 87650 Pakistan
  • Zubair Zaland Department of Software Engineering, Balochistan University of Information, Technology, Engineering and Management Sciences, Quetta, 87650 Pakistan
  • Sibghatullah Durrani Department of Software Engineering, Balochistan University of Information, Technology, Engineering and Management Sciences, Quetta, 87650 Pakistan
  • Saad Aslam Smart Photonics Research Laboratory and Department of Computing and Information Systems, School of Engineering and Technology, Sunway University, 47500 Petaling Jaya, Malaysia.
  • Angela Amphawan Smart Photonics Research Laboratory and Department of Computing and Information Systems, School of Engineering and Technology, Sunway University, 47500 Petaling Jaya, Malaysia.
  • Fatima Ali Department of Software Engineering, Balochistan University of Information, Technology, Engineering and Management Sciences, Quetta, 87650 Pakistan
  • Tse-Kian Neo CAMELOT, Faculty of Creative Multimedia, Multimedia University, Cyberjaya 63100, Selangor, Malaysia

DOI:

https://doi.org/10.15379/ijmst.v10i2.1837

Keywords:

Chord, Pitch, Timbre, Features, Music, Deep Learning

Abstract

Using Mel-spectrograms, this study evaluates the effectiveness of Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNNs). Mel-spectrograms are justified by their non- linearity and similarity to the human hearing system. This study uses over 200 tracks by The Beatles and Queen collected through the Music Information Retrieval Evaluation Exchange. Data augmentation approaches are used to increase accuracy on unusual chords. This paper presents a 3-layer 2D CNN model trained on major and minor chords and then expanded to different types of chords. The dataset demonstrates that both models can recognize musical chords across various genres. We compare the proposed results to the existing literature and demonstrate the effectiveness of the proposed methodology. As a result of our analysis, we found that the CNN and RNN models were 79% and 76% accurate, respectively. The presented findings suggest that CNNs and RNNs are suitable models for chord recognition using Mel-spectrograms. Data augmentation can be an effective technique for improving accuracy on rare chords.

Downloads

Download data is not yet available.

Downloads

Published

2023-09-05

How to Cite

[1]
H. N. . Khan, “A Comparative Study of Convolutional Neural Networks and Recurrent Neural Networks for Chord Recognition”, ijmst, vol. 10, no. 2, pp. 1617-1630, Sep. 2023.