- Aniket Ghumed
Email : aghumed@uncc.edu - Rimjhim Jain
Email : rjain19@uncc.edu - Sudhanshu Dalvi
Email : sdalvi2@uncc.edu - Pradip Nemane
Email : pnemane@uncc.edu
The goal of this project is to create a web application and an android application that uses the Mediapipe Holistic Model and WebRTC (Web Real-Time Communication) technology to perform pose estimation and action recognition in real time, which includes body, hand, and face pose estimation. Users of the proposed web application will be able to upload images, record video or use webcam through their web browser and perform pose estimation using the Mediapipe Model, which can quickly and accurately identify poses and actions in an image or video stream.
ML Model : https://github.com/google/mediapipe
Mediapipe can provide live perception of simultaneous human pose, face landmarks, and hand tracking with accuracy and speed. WebRTC is an open-source project that gives web browsers and mobile applications access to real-time communication features like peer-to-peer dialogue, audio and video streaming, and data transfer. Real-time apps benefit greatly from the ability of WebRTC to interact directly across web browsers without the use of a third-party server.
The scope of this project is to create a web-based application and also an android application that will use Mediapipe Holistic Model for pose estimation and WebRTC technology for real-time communication. The application will consist of two main components:
Users will be able to record video or use live webcam using this component and their web browser's camera. The pose estimation component will receive a real-time stream of the webcam or the recorded video.
This component will use Mediapipe Holistic Model to detect pose estimation in the captured video stream. The detected poses will be highlighted in real-time and the results will be displayed to the user.
Develop a web-based application and an android app that can perform pose estimation using Mediapipe Holistic Model and WebRTC technology. Provide users with an interactive and user-friendly interface that allows them to capture video in real-time through their web browser and perform pose estimation. Deploy Mediapipe Model for pose estimation and integrate it with WebRTC technology. Display the results of pose estimation in real-time to the user.
This project will be based on client-server architecture

Image source : https://google.github.io/mediapipe/solutions/holistic.html
| Duration | Goal |
|---|---|
| Week 1 and 2 |
|
| Week 3 and 4 |
|
| Week 5 and 6 |
|
| Week 7 and 8 |
|
The main project will be web-based. We will try to implement the project on android, but the android project is optional. *
mediapipe==0.9.1.0
streamlit==1.21.0
streamlit-webrtc==0.45.0
numpy==1.24.2
opencv-python-headless==4.7.0.72
By default the streamlit server will run on http and if we want to run to https then we should add the ssl certificates to config.toml file in ~/.streamlit/ folder.
add following configuration in ~/.streamlit/config.toml for https communication.
[server]
sslCertFile = path-to-pem-file
sslKeyFile = path-to-key-file
- Clone this repository
- Run the command
streamlit run main.py
