Skip to content

Jayem-11/Swahili_speech_to_text

Repository files navigation

Swahili Speech to text

Finetuning whisper-small for swahili speech to text.

car photo credits: Devonyu

Description:

This project uses the small version of Whisper, a general-purpose speech recognition model created by OpenAI, to convert Swahili audio to text. Whisper is pretrained for ASR (Automatic speech Recognition) and speech translation on 680k hours of labelled data

Author

Table of Contents

A Data
B Machine learning
C Deploying

Design

Design

A. Data

The data consist of about 82K instances of swahili audio form Mozilla common voice. I got the dataset from participating in a Zindi competition. Join the completed competiotion to get acces to the data.

B. Machine Learning

Evaluation

The model had a WER score of 8.365 wandb

C. Deploying

  • Hit the text button. Jupyter notebook example

About

Speech to Text for Swahili Language with Whisper-small.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages