Documentation

Important

Some information below are incorrect. This documentation will need to be updated.

Introduction to the Project

Welcome to the CodeNetwork 2025 Semester 1 gesture recognition program. Here is all the necessary documentation you will need to operate and understand the gesture recognition program.

This project focuses on detecting distinct live hand gestures. A Machine Learning approach is used to distinguish gestures.

Data Collection

Step 1

Get a camera (any camera)

Step 2

Start taking photos of your gesture
Make sure to grab different angles (move the camera around and move a hand around)
Try to grab different lightings (time of day, different directional lighting)
Try different backgrounds
Try different distances from the camera
If you can try to capture other people, different skin tones, small hands, big hands, etc.

Step 3

Upload images to the Hugging face repository
Depending on the gestures you captured add to the respective folder

Data Annotation

Step 1

Clone the HuggingFace repository using the following command:

git clone https://huggingface.co/datasets/CNGR/CN_Gesture_Recognition --depth=1

This will download all the pictures from our dataset, organised into their respective folders.
It would be wise to just select one gesture (folder) to work on at a time

Step 2

Go to cvat.ai and log in with your GitHub account (or another way, but GitHub will be convenient since we've already been using it a lot).
On the landing page, select Projects, and press the blue plus button, then create new project.
Name the project after the gesture you picked in step 1.
Select 'add label' and give it an appropriate name (of your gesture) and press continue
Submit and open
On the next screen, press the blue button and this will create a new task
Name the task after the gesture you picked in step 1 and add _train to the end
For subset, select 'Train'
Now follow the prompt to upload the files and upload the photos associated with your gesture, however, do NOT upload all. Leave the last three images in your folder unuploaded to be used for validation.
Select submit and continue
After uploading is complete, go back to projects, and double click on your project. Press the blue button again and add another task.
Name the task after the gesture you picked in step 1 and add _val to the end
For subset, select 'Validation'
Upload the remaining files you have not yet uploaded.
Press submit and continue

Step 3

In the navbar up the top, select 'jobs' then select your job (please note that you'll have to do step 3 twice, one for test, another for val)
You should see one of the pictures of your gesture. On the sidebar to the left, select the rectangle icon and make sure the label is set to the one you created.
Draw a new rectangle that highlights the section of the image that your gesture is in. (Note that you can press 'N' to start and stop the rectangle drawing process, and 'F' to progress to the next image. Utilising this will drastically improve the speed at which you annotate

Step 4

Return to Projects on the navbar
Press on the three dots on your project and select Export dataset
Select export format as Ultralytics YOLO Detection 1.0 and export.
Once complete
Exported dataset can be found under the Requests navbar, and click on the three dots with the most recent requets, and press download.

File Structure

For the YOLO model, the following file structure is required:

For CNN:

Platform

Libraries

Hardware

This project requires a webcam.

Running the Program

Run the gesture-recog.py file in order to launch the main program.

Main Loop

Continuously capture the camera input. *Flip the input on the vertical axis so the output reflects like a mirror
Read the frames into variables
Loop ends upon user input (video capture stops asw)

Hand Recognition

Using MediaPipe and OpenCV

Using Contour Analysis

Objects to store and update hand data
Convert to greyscale and smoothen to facilitate optimal edge detection (ignore first 30 frames as cameras generally use this time to adjust to shadows and lighting)
Threshold the hand from the background
Use contour analysis to extract the contours, find the largest contour and compute the convex hull

Gesture Recognition

Neural Network

Use the keras convolutional networks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!