Audio / Youtube to Noun Word-Cloud

Generate wordcloud from extracted nouns from speech audio

This is a python3 program used for generating a word cloud of the most commonly used nouns and proper nouns said in a particular youtube video or audio file (containing spoken word). This program utilizes pocketsphinx for speech-to-text and nltk for word tokenizing.

To generate a wordcloud from a youtube video:

python3 audio_noun_wordcloud.py --url=<youtube_video_url>

To generate a wordcloud from an audio .wav file on disk:

python3 audio_noun_wordcloud.py --path=<path_to_wav>

Options

Use the --save flag to save the generated word-cloud as a .png image to a specified path:

python3 audio_noun_wordcloud.py --url=<youtube_video_url> --save path/

Dependencies

SpeechRecognition
pocketsphinx
ffmpeg
youtube_dl
nltk
matplotlib
wordcloud
pydub

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
audio_noun_wordcloud.py		audio_noun_wordcloud.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio / Youtube to Noun Word-Cloud

Generate wordcloud from extracted nouns from speech audio

Options

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ereid7/audio-to-wordcloud

Folders and files

Latest commit

History

Repository files navigation

Audio / Youtube to Noun Word-Cloud

Generate wordcloud from extracted nouns from speech audio

Options

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages