Percepteye

We are building this to help disabled people to navigate like face recognition, sign language gesture identification and Object detection in case of no faces or gestures.

Apart from that we have a semantic router which determines which route to trigger. So we have a RaspberryPi which has a WebCam inbuilt, and that has a client that will call the semantic router, based on the image and the audio frame the Router will route the required API.

Now we have deployed the three APIs in three VMs provisioned in Digital Ocean. We also have created a Space in Digital Ocean, which contains the dataset for hand gesture aplhabets and that was trained in Gesture AI playground. So we used a transfer learning strategy where we have a ResNet Model trained on ImageNet and then it was fine tuned using the Dataset in the Space Storage of Digital Ocean. After training we dockerized and deployed the model to access the endpoint, that consumes an image frame as a payload and returns the output of the gesture.

We have thoughtfully leverage a hybrid model where we are using both pre-trained model as well running our own model based on the use cases.

All the three APIS are in three different Repos:

Sign Language Detector - https://github.com/team-hopkins/sign_language_detection
Face Recognition and TTS using 11 labs - https://github.com/team-hopkins/face_recognition
Object Detection - https://github.com/team-hopkins/percepteye
Semantic Router - https://github.com/team-hopkins/percepteye

We have also leveraged 11 labs for the Text to Speech for the face recognition, that helps the blind people to identify the known and unknown people.

The semantic router supports three types of routes:

Face Recognition + TTS API: Identifies faces and provides audio descriptions
Sign Language Detection API: Recognizes hand gestures and interprets sign language
Scene Description: When no faces or hand gestures are detected, uses Gemini 2.5 Flash to describe nearby objects, helping blind users understand their surroundings

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
API_DOCUMENTATION.md		API_DOCUMENTATION.md
API_INTEGRATION.md		API_INTEGRATION.md
API_INTEGRATION_UPDATE.md		API_INTEGRATION_UPDATE.md
DEPLOYMENT_STATUS.md		DEPLOYMENT_STATUS.md
DOCKER_DEPLOYMENT.md		DOCKER_DEPLOYMENT.md
Dockerfile		Dockerfile
Face_recognition_TTS.md		Face_recognition_TTS.md
QUICK_START_RASPBERRY_PI.md		QUICK_START_RASPBERRY_PI.md
RASPBERRY_PI_INTEGRATION.md		RASPBERRY_PI_INTEGRATION.md
README.md		README.md
README_ROUTER.md		README_ROUTER.md
SETUP_GUIDE.md		SETUP_GUIDE.md
SIGN_LANGUAGE.md		SIGN_LANGUAGE.md
api_server.py		api_server.py
docker-compose.yml		docker-compose.yml
raspberry_pi_client.py		raspberry_pi_client.py
requirements.txt		requirements.txt
semantic_router.py		semantic_router.py
test.py		test.py
test_face_api_simple.py		test_face_api_simple.py
test_simple.py		test_simple.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Percepteye

About

Uh oh!

Releases

Packages

Languages

team-hopkins/percepteye

Folders and files

Latest commit

History

Repository files navigation

Percepteye

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages