Assessing Efficacy of High and Low Level Features in Niche Music Genre Classification

This is a summary. For a detailed overview, please read the pdf inside the files.

Music genre classification is one of the more prominent tasks in Machine Learning. The GTZAN dataset has been heavily utilized in order to train and test new models. However, there are very few cases where genres from languages other than English are considered. In this project, we looked at genres that differed from traditional English genres like Pop and Rock and deepdived into the more vocal genres like Ghazal and Qawwali from the Urdu language. In essence, what we wanted to asses was not the models themselves but whether commonly used features used in models perform equally well for different genres from different languages. For comparison, we decided to compare high and low level features and their efficacy in classification of several genres, and used two different models - Support Vector Machines and Convolutional Neural Networks. Both SVM and CNN models have shown great applicability in classification of music.

The music genres we considered were:

Blues
Classical
EDM
Hip-Hop
Metal
Pop
Rap
Rock
Ghazal (Urdu)
Qawwali (Urdu)

For the high level features, Spotify was utilized as it can easily be accessed through Spotipy, a python library. Spotify automatically generates high-level features such as Acousticness and Instrumentalness which can easily be discerened by the human ear. For the low-level features, we extracted features such as MFCCs and Zero-Crossing Rate among others.

The notebooks are self-explanatory and each notebook documents the steps we took which were - extracting song previews from spotify, extracting and storing their features in a dataframe, conducting rudimentary exploratory analysis to discern any key features, training an SVM mdodel on low-level features, training an SVM model on high level features and finally, training a CNN model on MFCCs. For replicability, a lot of notebooks can simply be skipped as the data has already been extracted. You need only run the models.

In conclusion, we discovered that low-level features are excellent at classifying genres such as Pop. However, these features struggle immensely with classification of the genres from the Urdu language. High-level features performed better but were not completely accurate in classification of niche genres either.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.cache		.cache
1. Scraping_Spotify.ipynb		1. Scraping_Spotify.ipynb
2. EDA.ipynb		2. EDA.ipynb
3. SVM_Spotify_Features.ipynb		3. SVM_Spotify_Features.ipynb
4. Custom_Feature_Extraction.ipynb		4. Custom_Feature_Extraction.ipynb
5. SVM_Extracted_Features.ipynb		5. SVM_Extracted_Features.ipynb
6. CNN_Using_MFCC.ipynb		6. CNN_Using_MFCC.ipynb
Assessing Efficacy of High and Low Level Features in Niche Music Classification - Bashir, Rossi.pdf		Assessing Efficacy of High and Low Level Features in Niche Music Classification - Bashir, Rossi.pdf
README.md		README.md
musicdata.csv		musicdata.csv
singers.csv		singers.csv
subsampled.csv		subsampled.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Assessing Efficacy of High and Low Level Features in Niche Music Genre Classification

This is a summary. For a detailed overview, please read the pdf inside the files.

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

nomanbash/Niche_Music_Genre_Classification

Folders and files

Latest commit

History

Repository files navigation

Assessing Efficacy of High and Low Level Features in Niche Music Genre Classification

This is a summary. For a detailed overview, please read the pdf inside the files.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages