Skip to content

NeMo Mel Spectogram#1025

Draft
nenad1002 wants to merge 3 commits intomainfrom
nebanfic/nemo-mel-spec
Draft

NeMo Mel Spectogram#1025
nenad1002 wants to merge 3 commits intomainfrom
nebanfic/nemo-mel-spec

Conversation

@nenad1002
Copy link
Contributor

WORK IN PROGRESS

NeMo mel spectrogram implements the NeMo/librosa-specific feature extraction pipeline, Slaney mel scale with area normalization, global pre-emphasis, natural log, and a streaming mode with overlap state. Nemotron and Parakeet ASR models will use this functionality.

This differs fundamentally from the Whisper path (log10 + clamp + affine normalization, no pre-emphasis, no streaming), hence created as a separate shared API as there is not much common functionality, and merging would make it much harder to understand.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant