Smartphones have opened the door to a world of possibilities, far beyond what we could have imagined years ago. They have innovations that allow accessibility and functionality to the people who are visually impaired. With the help of a smartphone they can recognize their surroundings and become more independent.
Vision uses the smartphone's camera to continuously capture and recognize the objects and people currently in the view.
We used the SSD MobileNet v2 trained on Microsoft COCO dataset to recognize the objects captured by the smartphone's live camera feed and uses the Azure Face API to give certain details about a person if requested.
- Avoid detecting objects in the frame which are too far and not relevant
- Couldn't detect stairs and walls with just the mobile's camera.
- The Face API
- TensorFlow
- Integration with android
- Store faces of recognized people, so that user can be notified if the app "sees" a person it already "knows".
- Assist user with traffic signals
- Assist user in navigating indoors more accurately (detecting object distances, stairs and walls)
- azure
- google-web-speech-api
- tensorflow
- android