Skip to content

ziyuanzhao2000/Listen_and_Rearrange

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

As a cochlear implant user, I need to practice my listening to get used to the sound from the implant. This is called auditory rehabilitation. Inspired by another app called AngleSound, which contains an extensive collection of practices, but most of them are more for pediatric users, I had this idea to design an app that could benefit deaf adults with existing language abilities (yeah I know how that could sound like an oxymoron). In this app, the user is asked to reshuffle spoken sentence fragments by listening to a short audio clip and then reorder the provided words to match what is spoken in the audio clip.

As a demo, I have preprocessed the first chapter of one of my favorite books, Alice in Wonderland, to produce 174 spoken fragments with their corresponding aligned transcripts. These resource files are deposited in the repo and will be loaded by the python app once started. The requirements for installing the app is minimal. Please just use pip or conda to install the playsound package or replace its occurence in my demo code by any other package that can load and play mp3 files.

The raw text for the transcript is generously provided by Project Gutenberg. Raw audio files are generously provided by LibriVox. The app is smashed together in hours using tkinter, Python's standard GUI toolkit. The preprocessing of raw audio and transcript, including automatic segmentation at pauses and alignment of transcript to audio clips, are done with librosa and TorchAudio's pretrained wav2vec model. Code may be released upon sufficient interest. The preprocessing pipeline can be applied to any spoken material in English with minimal background noise. Effectiveness in case of speaker diarization has not been tested.

Please reach out to me at [email protected] if you are interested in the project, thanks!

A picture is worth a thousand words - here's a screenshot: Demo screenshot

About

A demo app built with tkinter for practice listening

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages