TeraiNet

This repository contains scripts and notebooks to build a species classification model focussed on the Terai region in Nepal, namely TeraiNet. It can classify 10 different classes (including tigers) in camera trap images, using ML (MegaDetector and EfficientNetV2M), open data (LILA BC), open source tools (MEWC) and free compute resources (Google Colab and Kaggle).

Credentials: LILA BC, MegaDetector, own illustration.

Motivation and relevance

tigers are an endangered species, NGOs like the Nepal Tiger Trust protect them
there is no open and easy way for ecologists/researchers/NGOs to classify their camera trap images with regard to tigers
ML and open data/tools can help reduce the amount of manual labor when sifting through large amounts of camera trap images, looking for the needle in the haystack
there appears to be no such species classification model focussed on the Terai region in Nepal yet
goal: train a species classifier focussing on the most relevant species in the Bengal tiger's ecosystem in Nepal, namely the Terai region, and make it openly available through AddaxAI (formerly known as EcoAssist)

Data preparation

1. Select data sources

LILA BC
amur tiger re-identification challenge at CVWC 2019

2. Sample images

Get image URLs from LILA BC
Sample desired amount of images per class
Create train test split if applicable

Classes

Tiger
Leopard
Asian black bear, American black bear (not enough images of Asian black bear alone)
Other carnivores, including dhole, black-backed jackal, gray fox, leopard cat, mainland leopard cat, marbled cat, Asian golden cat (including substitutes, i.e., black-backed jackal and gray fox)
Deer
Wild boar
African buffalo, Cape buffalo (substitute for gaur)
White rhinoceros (substitute for Indian rhino)
Asian elephant, African elephant, African bush elephant (not enough images of Asian elephant alone)
Bird

Note: Due to a lack of images for certain species some fairly heavy compromises had to be taken. It remains to be seen how well a model trained with such compromises generalizes to the target region.

3. Download images

LILA BC
amur tiger re-identification challenge

Note: Since free Colab and Drive have limited storage capacities, one might have to download images for one species at a time (and free up space before proceeding to the next).

Note: I found the image downloading to be much faster in Colab and Drive compared to Kaggle.

4. Preprocess images

Run MegaDetector on all images
Snip images following mewc-snip
Copy snipped images to Kaggle Output

Note: Images must have been previously downloaded to Drive/local machine via Colab and then uploaded to Kaggle as a dataset (zipped folder).

Note: I found access to free GPUs much better and transparent in Kaggle compared to Colab.

Model training and evaluation

Use Keras Image Models
Follow mewc-flow and mewc-train
Log experiments using Weights & Biases

I selected a pre-trained EfficientNetV2M with 54M parameters because it constitutes a good compromise between predictive performance, training time and model size. The model has been trained for 50 epochs (early stopping after 31 epochs) with 2000 images per class (250 images per class in the validation set). The model has been evaluated on a test set with 250 images per class.

Test set performance overview

Accuracy: 0.899
Precision: 0.900
Recall: 0.899
F1: 0.899

Test set performance per class

	precision	recall	f1-score	support
tiger	1.000	0.996	0.998	250
leopard	0.919	0.868	0.893	250
black_bear	0.938	0.968	0.953	250
other_carnivores	0.927	0.912	0.919	250
deer	0.911	0.896	0.903	250
wild_boar	0.909	0.916	0.912	250
buffalo	0.891	0.884	0.888	250
rhino	0.843	0.836	0.839	250
elephant	0.862	0.848	0.855	250
bird	0.801	0.868	0.833	250
accuracy	0.899	0.899	0.899	2500
macro avg	0.900	0.899	0.899	2500
weighted avg	0.900	0.8992	0.899	2500

Test set performance for the tiger class is extremely high because the respective images are unrealistically good. The performance on the tiger images in test2 is more realistic (test2 accuracy: 0.868). Most other classes have metrics of ~0.9+. But performance on some classes, including rhino, elephant and bird is somewhat lower. I believe that the quality of the rhino and elephant images is not ideal, e. g. there seem to be mix-ups or even other species included (e. g. zebra). Birds tend to be difficult to classify correctly in general.

Test set confusion matrix

There appears to be some confusion between leopard and rhino, other carnivores and buffalo, deer and black bear, buffalo and rhino, rhino and bird, and elephant and bird.

Note: There are only ~300 tiger images on LILA BC. I didn't use them in training and validation but instead put all of them in test2 to examine how the model would potentially generalize to tiger camera trap images from another source than the tiger training images (like it would be the case with the Nepal Tiger Trust using the model on their own images through AddaxAI, for example).

Deployment

Publish model on HuggingFace
Integrate and use model in AddaxAI (formerly known as EcoAssist)

Join AI for Conservation Slack and WILDLABS if you're interested in using technology for conservation.

Feel free to reach out if you have feedback/ideas for new use cases or would like to contribute/collaborate!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TeraiNet

Motivation and relevance

Data preparation

1. Select data sources

2. Sample images

3. Download images

4. Preprocess images

Model training and evaluation

Test set performance overview

Test set performance per class

Test set confusion matrix

Deployment

Files

README.md

Latest commit

History

README.md

File metadata and controls

TeraiNet

Motivation and relevance

Data preparation

1. Select data sources

2. Sample images

3. Download images

4. Preprocess images

Model training and evaluation

Test set performance overview

Test set performance per class

Test set confusion matrix

Deployment