Podnexus - An Interactive Podcast Application

Author: Suryateja Duvvuri

Motivation

Listening to a podcast is like listening to a 90 min lecture where the content is only delivered by the host or the professor themselves. This is one way of delivering content but it does not always engage the audience to interact with the content especially in a podcast where audience cannot directly interact with the host. With the power of AI, we can make the podcast interactive which improves audience engagement with the content by asking questions and providing feedback in which the AI can give responses in real time. This application also allows the user to explore a topic in context as much as they want just like in any conversational interaction.

General Description:

The application offers a simple, intuitive user interface that prompts the user to insert the YouTube podcast link. The backend processes the link and converts it to an audio file that the user can play. Meanwhile, it uses speech to text transcription as well as Ollama to customize the AI to our needs by giving the content of the podcast as if they were the host. Once the audio player shows up, the user can ask questions by clicking the "Start Recording" button. Once they're done asking their question, it will be prompted to the AI where the AI will produce a response and be delivered through text to speech using ElevenLabs API.

Demo

Languages/Tools/Technologies used:

Frontend: React.js, Tailwind CSS, Radix UI(for customized components) Backend: Spring Boot, Spring MVC, Spring Web, Spring AI(Ollama LLM), Local Whisper.cpp(For Speech to Text), ElevenLabs API(For text to speech) API: Youtube Data API(For extracting audio from a link), REST API for communicating between frontend and backend

The features in this project would be:

Youtube Podcast Integration

Users can give a Youtube Link which is then converted into an audio file locally.

Real-time AI-Interaction

Users can get AI-generated responses based on the context of the podcast

Speech to Text and Text to Speech

Speech to Text converts user's voice or podcast's audio to text using Whisper Text To Speech converts AI's textual response into voice using ElevenLabs

Screenshots

Installation and Usage

Prerequisites

Before you can start running this project, make sure you have the following tools installed:

Java 11 or higher (for backend)
Node.js/NPM and React.js (for frontend development)
Python (for diarization and speech-to-text)

1. Clone the Repository

First, clone the project repository to your local machine:

git clone https://github.com/SuryatejaDuvvuri/podnexus.git

2. Backend Setup (Spring Boot)

Run the following in separate terminal.
cd PodnexusBackend
Do mvn clean install and mvn spring-boot::run

3. Frontend Setup

cd podnexus npm install npm run start

4. Ollama Setup

Refer to Ollama Documentation on how to install Ollama and run Llama 3.2: https://github.com/ollama/ollama

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.vscode		.vscode
PodnexusBackend		PodnexusBackend
public		public
src		src
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Podnexus - An Interactive Podcast Application

Motivation

General Description:

Demo

Languages/Tools/Technologies used:

The features in this project would be:

Youtube Podcast Integration

Real-time AI-Interaction

Speech to Text and Text to Speech

Screenshots

Installation and Usage

Prerequisites

1. Clone the Repository

2. Backend Setup (Spring Boot)

3. Frontend Setup

4. Ollama Setup

About

Uh oh!

Releases

Packages

Languages

SuryatejaDuvvuri/PodNexus

Folders and files

Latest commit

History

Repository files navigation

Podnexus - An Interactive Podcast Application

Motivation

General Description:

Demo

Languages/Tools/Technologies used:

The features in this project would be:

Youtube Podcast Integration

Real-time AI-Interaction

Speech to Text and Text to Speech

Screenshots

Installation and Usage

Prerequisites

1. Clone the Repository

2. Backend Setup (Spring Boot)

3. Frontend Setup

4. Ollama Setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages