Audio Feature Extraction & Visualization Toolkit

Librosa-powered utilities for consistent audio feature extraction, batch processing, and quick visualization. Designed to standardize parameters across datasets while keeping the workflow Pythonic and notebook-friendly.

Highlights

Standardized sample rate, frame, and hop settings for reproducible experiments
Single-file and batch pipelines with optional preemphasis, bandpass filtering, and onset trimming
Rich feature coverage (spectral, chroma, MFCC, RMS, ZCR, tonnetz, polynomial features)
Pandas-friendly outputs for downstream ML tasks
Lightweight matplotlib hooks to inspect extracted features

Project Layout

afe/AudioFeatureExtractor.py: Core wrappers around librosa feature extractors
afe/BatchExtractor.py: Batch pipeline with metadata indexing and merge/flatten helpers
afe/FeatureVisualizer.py: Minimal plots for spectrograms, Mel spectrograms, and chromagrams
afe/__init__.py: Module exports
example.wav: Sample audio for quick smoke tests
README.md, LICENSE: Documentation and licensing

Installation

Create and activate a virtual environment (recommended):
```
python -m venv .venv
.\.venv\Scripts\activate
```

Install runtime dependencies:

pip install numpy pandas librosa matplotlib

Quickstart

Extract features for a single file

from afe import AudioFeatureExtractor

afe = AudioFeatureExtractor(sr=22050, frame_length=1024, hop_ratio=4)
audio = afe.get_audio("example.wav")

mfcc = afe.extract_mfcc(audio, n_mfcc=12)
rms = afe.extract_rms(audio)
stft = afe.extract_stft(audio)

Batch extraction with metadata

BatchExtractor expects an index with a file_name column relative to audio_folder.

import pandas as pd
from afe import BatchExtractor

index = pd.DataFrame({"file_name": ["example.wav"]})
batch = BatchExtractor(
    audio_folder=".",
    audio_index=index,
    preemphasis=True,
    pre_coef=0.97,
    bp_filter=False,
)

batch.batch_extract_and_merge(
    extraction_methods=["melspec", "mfcc", "zcr"],
    results_folder="feature_extraction/",
)

Merge, flatten, and label

flat = batch.merge_and_flatten_features(
    extraction_methods=["mfcc", "zcr"],
    results_folder="feature_extraction/",
    label=False,
)
print(flat.head())

Visualize saved features

from afe import FeatureVisualizer

viz = FeatureVisualizer(feature_folder="feature_extraction/")
viz.plot_melspec("example")       # expects example_melspec_features.csv
viz.plot_chromagram("example")    # expects example_cstft_features.csv
viz.plot_spectrogram("example")   # expects example_stft_features.csv

Feature Abbreviations

Abbrev	Method
stft	`AudioFeatureExtractor.extract_stft`
cqt	`AudioFeatureExtractor.extract_cqt`
cstft	`AudioFeatureExtractor.extract_chroma_stft`
ccqt	`AudioFeatureExtractor.extract_chroma_cqt`
ccens	`AudioFeatureExtractor.extract_chroma_cens`
melspec	`AudioFeatureExtractor.extract_melspectrogram`
mfcc	`AudioFeatureExtractor.extract_mfcc`
rms	`AudioFeatureExtractor.extract_rms`
zcr	`AudioFeatureExtractor.extract_zero_crossing_rate`
centroid	`AudioFeatureExtractor.extract_spectral_centroid`
bandwidth	`AudioFeatureExtractor.extract_spectral_bandwidth`
contrast	`AudioFeatureExtractor.extract_spectral_contrast`
flatness	`AudioFeatureExtractor.extract_spectral_flatness`
rolloff	`AudioFeatureExtractor.extract_spectral_rolloff`
tonnetz	`AudioFeatureExtractor.extract_tonnetz`
poly	`AudioFeatureExtractor.extract_poly_features`

Preprocessing Flags

preemphasis: First-order differencing to emphasize higher frequencies
bp_filter: Hard bandpass on STFT using fmin/fmax
trim: Onset-based trimming of the leading silence

Roadmap

Tempo-related feature extractors and deltas
Feature inversion utilities (MFCC/Mel back to audio)
Additional visualization presets and notebooks

License

See LICENSE for details. This project is released under a permissive license to encourage reuse and extension.

Maintained by Apurba Kumar Show, IIT Kharagpur.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
afe		afe
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio Feature Extraction & Visualization Toolkit

Highlights

Project Layout

Installation

Quickstart

Extract features for a single file

Batch extraction with metadata

Merge, flatten, and label

Visualize saved features

Feature Abbreviations

Preprocessing Flags

Roadmap

License

About

Uh oh!

Releases

Packages

Languages

License

IITApurba/Audio-Feature-Lab

Folders and files

Latest commit

History

Repository files navigation

Audio Feature Extraction & Visualization Toolkit

Highlights

Project Layout

Installation

Quickstart

Extract features for a single file

Batch extraction with metadata

Merge, flatten, and label

Visualize saved features

Feature Abbreviations

Preprocessing Flags

Roadmap

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages