JaxFlax-PointNet

JaxFlax-PointNet is an implementation of the PointNet architecture using JAX and Flax. This repository aims to provide a modern and efficient version of PointNet for researchers, students, and developers working on 3D point cloud data.

Features

PointNet Architecture: A deep learning model designed for processing 3D point cloud data.
JAX and Flax: Leverages the power of JAX for high-performance numerical computing and Flax for flexible neural network modeling.
Educational Resource: Ideal for university projects, research, or learning about 3D deep learning.

The architecture is composed by a first block INPUT TRANSFORM NET whose job is to align all input points clouds to an universal coordinates system, then, after some convolution layers, a second block, FEATURE TRANSFORM NET, align the features of our processed point clouds, and finally we MaxPool and then connect everything to a MLP for the classification part.

You may find all specs inside the models.py file, but for semplicity here there are some flowcharts of what we did:

Installation

Clone the repository :

git clone https://github.com/TheGlitch2701/JaxFlax-PointNet.git
cd JaxFlax-PointNet/

Anaconda Installation :

To run this first part of the project you need to install Anaconda following the instructions one can find on the Anaconda Official Site.

Moreover, be sure to set conda-forge as main channel for Anaconda.
Conda Environment Setup :

If you DON'T HAVE a NVIDIA GPU, please refer to Part 3.1 to prepare a CPU ONLY environment (notice that this will heavily affect the total training time).

3.1 CPU Only Environment Installation (Not Recommended) :
```
 conda env create -f cpu_environment.yml
 conda activate cpu_env
```
3.2 NVIDIA GPU Environment Installation (Recommended):
```
 conda env create -f gpu_environment.yml
 conda activate gpu_env2
```
Consider that I ran the code with the following specs:
- Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz
- 16.0 GB RAM
- NVIDIA GeForce GTX 1050 Ti
- NVIDIA-SMI 576.02
- Driver Version: 576.02
- CUDA Version: 12.9
- Ubuntu Version 22.04 installed with WSL2
hence, be careful, since the gpu_environment.yml works fine for my setup, but may not work at all (especially for the Jax library) if you have a different GPU setup.

In any case always refers to Jax Installation Guide to install the correct version for your machine setup. Moreover, inside the requirements.txt file you may find the version of the library I used so that you can install your own environment and not directly from the gpu_environment.yml file.

If you obtain the following error:
```
OSError: libGL.so.1: cannot open shared object file: No such file or directory
```
run the following script:
```
sudo apt update
sudo apt install libgl1
```

Usage

Training Phase:
```
cd src/
mkdir data
sh training.sh
```
Inside the training.sh file, one can find the full signature to run our presented simulation (note that since the algorithm is stochastic, the result might be different but the trend should be the same).

IMPORTANT

This error might occur if the ShapeNet website is down. This error might occur if the ShapeNet website is down.

When running sh training.sh, if the following error shows up:

--  https://shapenet.cs.stanford.edu/media/modelnet40_ply_hdf5_2048.zip
Resolving shapenet.cs.stanford.edu (shapenet.cs.stanford.edu)... 171.67.77.19
Connecting to shapenet.cs.stanford.edu (shapenet.cs.stanford.edu)|171.67.77.19|:443... failed: Connection refused.

Please download the data manually from the following link:

https://polimi365-my.sharepoint.com/:f:/g/personal/10787939_polimi_it/EiEMCNq6_udCtkbJ9lrBAFUBEHESb8Nu1LJU-BwDmLLrDw

and put it inside the following directory after you have unzip it:

$$ \text{src/data/} $$

In particular, if one want to customize some settings, here we listed all parameters that can be changed:

log_dir : the directory where training files and figures are saved
model : the selected model to use (can be either pointnet_basic or pointnet)
num_point : the number of point to consider of each point cloud can be either one of the following:

$$ \mathbf{n} \in [ \mathbf{256,512,1024,2048} ] $$

max_epoch : maximum number of epoch before the training ends:

$$ Default = \mathbf{250} $$

batch_size : maximum number of point clouds inside each batch:

$$ Default = \mathbf{32} $$

learning_rate : value of the learning rate for the optimizer used for the training of the Neural Network:

$$ Default = \mathbf{0.001} $$

momentum : value for the momentum of the Convolutional layers inside the Neural Network:

$$ Default = \mathbf{0.9} $$

optimizer : optimizer used for the training of the Neural Network; it can be either 'adam' or 'momentum'
decay_step : number of step before having a decay inside the exponential decay scheduler:

$$ Default = \mathbf{200000} $$

decay_rate : decay rate used by the exponential decay scheduler:

$$ Default = \mathbf{0.7} $$

At the end of the Training Loop, you'll find inside log_dir:

directory called as a number which contains the checkpoint of the Neural Network,
a log_train.txt file, in which you can see some info about the training process
a Loss & Accuracy picture where both the Loss (CrossEntropy) and the Accuracy of the Training and Validation Dataset are shown.

Evaluation Phase:

python evaluate_jax.py --log_dir=log_prova --model=pointnet --num_point=1024 --batch_size=4 --dump_dir=dump_prova --optimizer=adam --checkpoint=best_checkpoint_from_training

At the end of this phase, you'll find inside the directory dump_dir the following files:

log_evaluate.txt, in which the accuracy of each class is listed combined with the eval mean loss, the eval accuracy among all object and the eval accouracy among all classes
Flax_Accuracy.tar, a file containing the accuracies for further comparisons.

Comparing Phase:

python compare.py --dump_flax=dump_prova --dump_tf=../old_repo_result/dump_tf_250_1024

In the end you'll find inside the dump_dir (same directory as for the previous point), a plot with the Jax&Flax Accuracies on the x-axis and the Tensorflow Version Accuracies on the y-axis.

This is usefull to see the comparison between the Tensorflow approach with our Jax&Flax one.

Note that the times for training is really low for our approach with respect to the Tensorflow one.

Results

This is the result that we achieve:

Maximum Training Accuracy : 89.016 %
Respective CrossEntropy Loss : 0.218748
Maximum Validation Accuracy : 88.938 %
Respective CrossEntropy Loss : 0.388915

Here's a picture presenting the Loss and the Accuracy troughout epochs:

For what concerns the evaluation part, here we show our results:

Note that you can find more details about the training, evaluation and comparing results inside the doc directory.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgments

This implementation is inspired by the original PointNet paper and aims to provide a JAX/Flax-based alternative for the community.

Also all data are intended to be used as specified in their research and site.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JaxFlax-PointNet

Features

Installation

Usage

IMPORTANT

Results

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
doc		doc
old_repo_result/dump_tf_250_1024		old_repo_result/dump_tf_250_1024
src		src
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cpu_environment.yml		cpu_environment.yml
gpu_environment.yml		gpu_environment.yml
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

JaxFlax-PointNet

Features

Installation

Usage

IMPORTANT

Results

License

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages