Skip to content

Exploring Music Recommendation Systems - Content Based Filtering and Collaborative Filtering

Notifications You must be signed in to change notification settings

psukh28/MusicRecSystem

Repository files navigation


Music Recommendation System

Introduction

This project explores music recommendation systems leveraging the Million Song Dataset (MSD) and EchoNest user play history. It focuses on algorithms like Alternating Least Squares (ALS) for matrix factorization and Word2Vec, TF-IDF, and Latent Dirichlet Allocation (LDA) for lyrics-based song recommendations.

Features

  • Collaborative Filtering: Using ALS algorithm for matrix factorization based on user listening data.
  • Content-Based Filtering: Employing TF-IDF, Word2Vec, and LDA for song recommendation based on lyrics data.
  • Scalable Architecture: Utilization of Amazon EMR clusters and Amazon S3 for handling large datasets.
Screenshot 2567-01-30 at 10 50 31

Dataset

The project utilizes the Million Song Dataset, enriched with additional data like lyrics from musiXmatch, user-generated tags from Last.fm, and listening data from EchoNest.

Methodology

  1. ALS Model for Collaborative Filtering: Matrix factorization using the ALS algorithm.
Screenshot 2567-01-30 at 10 49 55
  1. Content-Based Filtering Techniques:
    • TF-IDF for emphasizing the uniqueness of song lyrics. [Include TF-IDF formula]
    • Word2Vec for capturing contextual relationships in lyrics.
    • LDA for topic modeling and discovering thematic essence in songs.

Results

  • Implementation of ALS and content-based filtering models.
  • Analysis of model performance using metrics like Root Mean Square Error (RMSE).
  • Insights into thematic patterns and user preferences. [Include screenshot of thematic visualization]

About

Exploring Music Recommendation Systems - Content Based Filtering and Collaborative Filtering

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published