PDF to Podcast Using Gemini and Elevenlabs

This project provides a tool to convert any PDF document into a engaging podcast! Using Google's Gemini for dialogue generation and Elevenlabs for text-to-speech.

App Screenshot

Here's what the app looks like:

Features

Understand PDFs: Understand PDFs rather than extract text. (It's can understand figure, table, images....)
Dialogue Generation: Uses Gemini to generate conversational dialogues based on the input PDF content.
Text-to-Speech: Converts the generated dialogues into audio using Elevenlabs' text-to-speech service.
Streamlit UI: Provides an easy-to-use interface to upload PDFs and generate podcasts.

Installation

Clone the Repository
Clone the project to your local machine:

git clone https://github.com/chiragjoshi12/pdf-to-podcast.git
cd pdf-to-podcast

Install Dependencies
Install the required Python dependencies:
```
pip install -r requirements.txt
```

Set Up API Keys
Create a .env file and add your Gemini and Elevenlabs API keys:

GEMINI_API_KEY="YOUR_GEMINI_API_KEY"
ELEVENLABS_API_KEY="YOUR_ELEVENLABS_API_KEY"

Run the Application
Start the application with Streamlit:
```
streamlit run main.py
```
The app will be available in your browser for use.

Usage

Upload a PDF file to the app.
Set your podcast prompt and click "Generate Podcast."
The app will generate the dialogue and convert it into an audio file, which you can listen to.

Blog Post

For detailed instructions and insights, check out the blog post:
NotebookLM with Gemini and Elevenlabs (Detailed Documentation)

Readme made with 💖 using README Generator by Chirag Joshi

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
img		img
services		services
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDF to Podcast Using Gemini and Elevenlabs

App Screenshot

Features

Installation

Usage

Blog Post

About

Uh oh!

Releases

Packages

Languages

chiragjoshi12/pdf-to-podcast

Folders and files

Latest commit

History

Repository files navigation

PDF to Podcast Using Gemini and Elevenlabs

App Screenshot

Features

Installation

Usage

Blog Post

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages