Run main.py function as a Python script (e.g. "python main.py" from Anaconda Prompt). The main.py function has the arguments below:
-h, --help show this help message and exit <br>
--train_tif_dir TRAIN_TIF_DIR <br>
Directory containing TIF files of the training set <br>
--train_jpg_dir TRAIN_JPG_DIR <br>
Destination folder for JPEG files of the training set <br>
--test_tif_dir TEST_TIF_DIR <br>
The folder with the tif images for the predictions <br>
--test_jpg_dir TEST_JPG_DIR <br>
The destination folder for JPEG files to be modelled <br>
--base_dir [BASE_DIR] <br>
Directory of the training set <br>
--threshold [THRESHOLD] <br>
Classification threshold <br>
--model_name MODEL_NAME <br>
Arbitrary name of the Random forest model <br>
1.) The pipeline starts with converting the confocal images in tif format to jpg format.
Notes:
If there aren't new tif images for training then the model will use the available training set.
If there are new tif images for training place them into
train/tif/A: if there isn't colony in the field
train/tif/B: if it is an image with colony/colonies.
2.) Training Random forest on the training set (=converted jpg images)
3.) Evaluating the training
Notes:
There will be generated some performance plots. These can be found in the plot folder.
4.) Making predictions on new images using the trained and tuned Random forest model
Notes:
The new images to be tested must place into the test/tif folder before the run.
5.) Selecting the images with potential colony/colonies and place them in the "final" folder.
See requirements.txt