Panel_discussion_summarization

My Approach

The input of the file is revieved in .mp3 format

The file is converted to .wav format

An .rttm file is generated using 'pyannote/[email protected]' using hugging face to generate the timestamps of each speaker

This then converted to a csv format

Then the audio files are generated for those timestamps

The text is extracted from each audio file, speaker-wise

The overall summary of the file is then obtained using hugging face's "/knkarthick/MEETING_SUMMARY"

How to run the code?

It is easy to execute the code in google colab by just uploading the .mp3 file and running all the cells

Results

In CPU it took 45 minutes to execute the entire code for the audio uploaded

In GPU it took less than a minute to execute the entire code for the audio uploaded

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
model		model
README.md		README.md
Whisper.ipynb		Whisper.ipynb
Y2Mate.is - Panel Discussion Are Young Students Getting Too Much Homework-yX5EJf4R77s-128k-1656876095992.mp3		Y2Mate.is - Panel Discussion Are Young Students Getting Too Much Homework-yX5EJf4R77s-128k-1656876095992.mp3
reference.md		reference.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Panel_discussion_summarization

My Approach

How to run the code?

Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Vikram12301/Panel_discussion_summarization

Folders and files

Latest commit

History

Repository files navigation

Panel_discussion_summarization

My Approach

How to run the code?

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages