This is the codebase corresponding to my summer project report, titled "A deep embedding system for melodic plagiarism detection", completed for the course MSc Data Science and Machine Learning at UCL. It is roughly broken down into three parts, corresponding to the experiments in the report.
-
Clone this repository:
git clone https://github.com/lcunn/plagdet.git cd plagdet
-
Create and activate the Conda environment:
conda env create -f environment.yml conda activate plagdet
-
Unzip the data folder:
unzip data.zip
This experiment involves training a Siamese network