Voice AI HR Agent

An automated voice-based AI system for conducting initial candidate screening interviews. The system uses natural language processing and speech recognition to evaluate candidates' responses and provide structured feedback.

Features

Automated Phone Interviews: Conducts phone interviews using AI-generated voice
Real-time Transcription: Converts candidate responses to text using OpenAI Whisper
Sentiment Analysis: Evaluates candidate's tone and sentiment
Keyword Extraction: Identifies key skills and qualifications
Decision Engine: Provides structured recommendations based on responses

Technology Stack

Backend Framework: FastAPI
Voice Calls: Twilio
Speech-to-Text: OpenAI Whisper
Text-to-Speech: Google Cloud TTS
NLP: HuggingFace Transformers, spaCy
Sentiment Analysis: VADER

Prerequisites

Python 3.8 or higher
Twilio Account (for voice calls)
Google Cloud Account (for Text-to-Speech)
OpenAI API key (for Whisper, optional if using local model)

Installation

Clone the repository:

git clone <repository-url>
cd voice-ai-hr-agent

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Install spaCy English model:

python -m spacy download en_core_web_sm

Configuration

Create a .env file in the project root with the following variables:

# Twilio Configuration
TWILIO_ACCOUNT_SID=your_account_sid
TWILIO_AUTH_TOKEN=your_auth_token
TWILIO_PHONE_NUMBER=your_twilio_phone_number

# Google Cloud Configuration
GOOGLE_CLOUD_PROJECT_ID=your_project_id
GOOGLE_APPLICATION_CREDENTIALS=path_to_credentials.json

# Application Configuration
APP_HOST=0.0.0.0
APP_PORT=8000
DEBUG=True
BASE_URL=http://localhost:8000

Running the Application

Start the FastAPI server:

uvicorn main:app --reload --host 0.0.0.0 --port 8000

The API will be available at http://localhost:8000

API Endpoints

GET /: Health check endpoint
POST /initiate-call: Start a new interview call
POST /process-response: Process candidate's audio response
GET /health: Application health check

Usage Example

Initiate a call:

import requests

response = requests.post(
    'http://localhost:8000/initiate-call',
    json={'phone_number': '+1234567890'}
)
print(response.json())

The system will:
- Call the candidate
- Ask screening questions
- Process responses in real-time
- Generate a structured evaluation

Sample Output

{
  "candidate_name": "John Doe",
  "skills": ["Python", "React", "AWS"],
  "experience": "5 years",
  "location": "New York",
  "sentiment": "Positive",
  "decision": "Recommend",
  "reason": "Strong candidate with good skills and positive interaction"
}

Testing

The system includes mock functions for testing without external API dependencies:

# Mock a call without Twilio
call_handler = CallHandler()
result = call_handler.mock_call()

# Mock transcription without Whisper
stt = SpeechToText()
result = stt.mock_transcribe("Sample response")

Error Handling

The system includes comprehensive error handling for:

Failed API calls
Audio processing issues
NLP analysis errors
Decision engine failures

Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.gitignore		.gitignore
README.md		README.md
call_handler.py		call_handler.py
config.py		config.py
decision_engine.py		decision_engine.py
jhb.py		jhb.py
main.py		main.py
mock_services.py		mock_services.py
nlp_analysis.py		nlp_analysis.py
requirements.txt		requirements.txt
speech_to_text.py		speech_to_text.py
text_to_speech.py		text_to_speech.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice AI HR Agent

Features

Technology Stack

Prerequisites

Installation

Configuration

Running the Application

API Endpoints

Usage Example

Sample Output

Testing

Error Handling

Contributing

License

About

Uh oh!

Releases

Packages

Languages

rishabhsingroha/automated_voice_based_AI

Folders and files

Latest commit

History

Repository files navigation

Voice AI HR Agent

Features

Technology Stack

Prerequisites

Installation

Configuration

Running the Application

API Endpoints

Usage Example

Sample Output

Testing

Error Handling

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages