Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation

This repository contains the codes of the work "Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation" created for classifying calisthenics skills using raw and depth image patches.

Installation

Create conda environment and clone this repository:

conda create -n calisthenics_env python=3.12
conda activate calisthenics_env
git clone https://github.com/antof27/rgb-based-pose-classification.git

Clone repositories: Since it leverages the Depth Anything v2 and YOLOv10 libraries for efficient depth estimation and person detection. You need to clone this repository in src/.
```
cd src
git clone https://github.com/DepthAnything/Depth-Anything-V2.git
git clone https://github.com/THU-MIG/yolov10.git
cd ..
```
Install dependencies: Run the following command to install all the dependencies needed for the project.
```
pip install -r requirements.txt
```
Copy yolov10 files: In the src/yolov10/ directory, there are two files: bbox_operations.py and image_inference.py that need to be copied into the main directory of YOLOv10 repository.
```
cp -r src/yolo_files/* yolov10/
```
Download YOLOv10 pre-trained weights:

For running inference with raw and depth patches, download the YOLOv10 pre-trained weights from YOLOv10 weights.
Download the CNN weights:

In order to fine-tune or perform inference with the pre-trained weights, you can download the pre-trained weights from the following link: EfficientNetv2 weights.

Project Structure

After cloning the repositories and installing the dependencies, the project structure should look like this:

src/
├── yolo_files/
├── yolov10/
├── Depth-Anything-V2/
├── inference/
└── training_eval/

data/
├── csv_files/
├── input_inference_images/
└── output_inference_images/

Finocchiaro_ACSCtRDP.pdf
    
requirements.txt

How to Use

To train and test on your own dataset
- Place your depth images, raw image patches or depth patches in the data directory. You should include two .csv files with filename columnn and label column. See the examples in the data/csv_files directory. If you want to reproduce the same experiments, you can ask me to provide the image dataset.
- Modify paths and hyperparameters in src/training_eval/main_script.py and run the script specifying some parameters in the code or in the command, for example:
```
python3 model.py --weights pretrained --mode normal --n_gpu 1
```
  where --weights can be [non-pretrained, pretrained], --mode refers to images and can be [normal, depth], --n_gpu refers to the GPU id (in case of multiple GPUs).
Perform inference:
- Place your input images in the data/input_inference_images directory.
- Edit the script full_inference.py in src/inference to specify the input, output images and weights paths.
- Run inference:
```
python3 full_inference.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
data		data
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation

Installation

Project Structure

How to Use

About

Uh oh!

Releases

Packages

Languages

antof27/rgb-based-pose-classification

Folders and files

Latest commit

History

Repository files navigation

Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation

Installation

Project Structure

How to Use

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages