Orchestral Music Instrument Detector using CNN

Academic Project: Developed for a Deep Learning course, achieving a 4.0 GPA. This project demonstrates the application of Convolutional Neural Networks (CNNs) for audio classification.

🎵 Overview

This project implements a deep learning system capable of classifying orchestral musical instruments from audio recordings. It utilizes a Convolutional Neural Network (CNN) trained on Mel-frequency cepstral coefficients (MFCCs) and Spectrograms extracted from audio samples.

The system is deployed as a full-stack web application with a FastAPI backend for inference and a modern React frontend for user interaction.

Key Features

Deep Learning Model: Custom CNN architecture optimized for audio feature classification.
Audio Processing: Real-time spectrogram generation using Librosa.
Interactive UI: Drag-and-drop interface built with React and Framer Motion.
Production Ready: Modular architecture with automated deployment configuration (Railway/Nixpacks).

🏗️ Architecture

The project follows a modular microservices-inspired architecture, separating the Machine Learning engine, Backend API, and Frontend UI.

graph TD
    User[User] -->|Uploads Audio| Frontend[React Frontend]
    Frontend -->|POST /predict| Backend[FastAPI Backend]
    Backend -->|Raw Audio| Preprocessing[Librosa Preprocessing]
    Preprocessing -->|Spectrogram| Model[CNN Model]
    Model -->|Prediction| Backend
    Backend -->|JSON Result| Frontend
    Frontend -->|Display| User

Directory Structure

The codebase is organized using industry-standard engineering practices:

.
├── backend/                # FastAPI Application
│   └── main.py             # API Entry point & Routes
├── frontend/               # React Vite Application
│   ├── src/                # React Components
│   └── dist/               # Production Build
├── ml/                     # Machine Learning Engine
│   ├── model.py            # CNN Architecture Definition
│   ├── data_loader.py      # Audio Processing & Data Pipeline
│   ├── train.py            # Training Loop & Callbacks
│   └── predict.py          # Inference Logic
├── notebooks/              # Jupyter Notebooks
│   └── experimentation.ipynb # Initial Research & Experiments
├── data/                   # Dataset
│   └── raw/                # Raw Audio Files (organized by class)
├── models/                 # Saved Model Artifacts
│   └── best_model.keras    # Best performing model
├── tests/                  # Unit & Integration Tests
├── nixpacks.toml           # Deployment Configuration
└── requirements.txt        # Python Dependencies

🚀 Getting Started

Prerequisites

Python 3.8

Installation

Clone the repository

git clone https://github.com/SanketBaviskar/Orchestral-Music-Instrument-Detector-using-CNN.git
cd Orchestral-Music-Instrument-Detector-using-CNN

Install Python Dependencies
```
pip install -r requirements.txt
```
Install Frontend Dependencies
```
cd frontend
npm install
```

Running Locally

Build the Frontend
```
cd frontend
npm run build
cd ..
```
Start the Backend Server
```
uvicorn backend.main:app --reload
```
Access the App Open http://127.0.0.1:8000 in your browser.

🧠 Model Details

The model is a Sequential CNN designed to process 2D Spectrograms:

Input: 1025x87x1 Spectrograms (1-second audio clips).
Layers:
- 2x Convolutional Blocks (Conv2D + BatchNorm + MaxPool + Dropout).
- Flatten Layer.
- Dense Layer (64 units, ReLU).
- Output Layer (8 units, Softmax).
Optimization: Adam Optimizer, Categorical Crossentropy Loss.

📊 Dataset

The dataset consists of audio samples for the following instruments:

Cello
Contrabassoon
Flute
Mandolin
Oboe
Saxophone
Trumpet
Viola

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📄 License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
data/raw		data/raw
frontend		frontend
logs		logs
ml		ml
notebooks		notebooks
static		static
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Procfile		Procfile
nixpacks.toml		nixpacks.toml
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orchestral Music Instrument Detector using CNN

🎵 Overview

Key Features

🏗️ Architecture

Directory Structure

🚀 Getting Started

Prerequisites

Installation

Running Locally

🧠 Model Details

📊 Dataset

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Orchestral Music Instrument Detector using CNN

🎵 Overview

Key Features

🏗️ Architecture

Directory Structure

🚀 Getting Started

Prerequisites

Installation

Running Locally

🧠 Model Details

📊 Dataset

🤝 Contributing

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages