GitHub

SCHISM stands for Semantic Classification of High-resolution Imaging for Scanned Materials. This framework provides tools for semantic segmentation of CT scanner images of rocks, but it is also applicable to any kind of image as long as semantic segmentation is required. The framework supports both training and inference workflows. As for the little trivia, this project got named after this :)

⚙️ Installation

Clone this repository to your local machine: git clone [email protected]:FloFive/SCHISM.git
Navigate to the cloned directory: cd <some path> SCHISM
Install the library (python 3.9 mini is required) pip install -e .

❓ How to use

SCHISM offers two main functionalities: Training and Inference.

General Steps

Organize your data in the required structure (see Data Preparation).
Set up an INI configuration file (see INI File Setup).
Run the main script: python schism.py
Navigate through the command-line menu:
- Option 1: Train a new model.
- Option 2: Make predictions using a trained model.

Training Workflow

Prepare the dataset: Ensure the dataset is organized according to the required directory structure (presented below).
Create an INI file: Define training parameters such as learning rate, batch size, and model architecture in the INI file (presented below).
Run the training command: Launch the training process, then select the training option and specify:
- The dataset directory: contains one or more datasets. The ordering and sorting of the data are explained later in this readme.
- The output folder: the space where, amongst others, a folder containing the model weights will be created after training. The files saved in the folder are later described in this readme.
- The path to the INI file.

Inference Workflow

To make predictions:

Use trained weights: Ensure the trained model weights are saved from the training phase.
Prepare the dataset for prediction: Organize the data in a compatible format.
Run the inference command: Launch the prediction process, then select the training option and specify:
- The folder containing trained weights.
- The dataset for prediction.

📜 INI File Setup

Below is an example of an INI file:

[Model]
n_block=4
channels=16
num_classes=3
model_type=UnetSegmentor
k_size=3
activation=leakyrelu
channel=16
 
[Optimizer]
optimizer=RAdam
lr=0.001
eps=1e-6
weight_decay=0.001

[Scheduler]
scheduler=ReduceLROnPlateau
mode=min
factor=0.5
patience=5
threshold=1e-4
threshold_mode=rel
cooldown=2
min_lr=1e-6
eps=1e-8
verbose=True

[Loss]
loss=CrossEntropyLoss
ignore_background=True
weights=True

[Training]
batch_size=4
val_split=0.8
epochs=40
metrics=Jaccard, F1, Recall, Accuracy, Precision, ConfusionMatrix
early_stopping=True

[Data]
crop_size=225
img_res=512
num_samples=1500

For information on both the network configurations and the INI file setup, please refer to this page.

👾 Data Preparation

The data should be organized as follows:

data <--- Select this folder for data input during training or inference.
|_dataset 1/
|   |_images/ <--- Contains grayscale TIFF images, sequentially named for logical ordering (e.g., image0000.tif, image0001.tif, etc.).
|   |_masks/ <--- Contains corresponding TIFF masks, named to match their respective images (e.g., mask0000.tif for image0000.tif).
|_dataset 2/
|   |_images/
|   |_masks/
|_dataset n/
|   |_images/
|   |_masks/
|_data_stats.json <--- This file is optional.

Images: The directory containing the input images. Images must be in TIFF format and will be automatically converted to HWC (Height, Width, Channels) format.
Masks: The directory containing the corresponding segmentation masks. Masks will be converted to 8-bit format (uint8) with values set between 0 and 255.
data_stats.json: (Optional) A JSON file containing mean and standard deviation values for normalization. Currently, this file must be set manually and should follow this format:

{
    "dataset1": [
        [0.52, 0.52, 0.52],
        [0.31, 0.31, 0.31]
    ],
    "dataset2": [
        [0.46, 0.46, 0.46],
        [0.5, 0.5, 0.5]
    ],

   [...]

    "datasetn": [
        [0.11, 0.11, 0.11],
        [0.42, 0.42, 0.42]
    ]
}

💾 Training Output Files

Upon completing a training session, several files will be generated in the weight folder:

data_stats.json: The standard deviation and mean values used to normalize the images.
hyperparameters.ini: A copy of the INI file used for the training session.
learning_curves.png: Displays the loss and metrics values as a function of the epochs.
model_best_{metric(s)}.pth: Contains the best model weights based on each metric specified in the INI file.
model_best_loss.pth: Contains the best model weights based on the loss value.
test/train/val_indices.txt: Records the indices of images and masks used for training, validation, and testing. These indices are formatted as [dataset subfolder][image or mask number in the folder]. For example, if you have 5,000 image/mask pairs, but num_samples is set to 3,000 and val_split is 0.8, then 2,400 indices will be recorded in train_indices.txt, 600 in val_indices.txt, and the remaining 2,000 in test_indices.txt.

❤️‍🔥 Contributions

Contributions are welcome! Please fork the repository and submit a pull request.

📚 Citation / Bibtex

If you use our solution or find our work helpful, please consider citing it as follows:

@misc{schism2025,
  title       = {SCHISM: Semantic Classification of High-resolution Imaging for Scanned Materials},
  author      = {Florent Brondolo and Samuel Beaussant and Soufiane Elbouazaoui and Saïd Ezzedine},
  year        = {2025},
  howpublished= {\url{https://github.com/FloFive/SCHISM}},
  note        = {GitHub repository}
}

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
code		code
docs		docs
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

⚙️ Installation

❓ How to use

General Steps

Training Workflow

Inference Workflow

📜 INI File Setup

👾 Data Preparation

💾 Training Output Files

❤️‍🔥 Contributions

📚 Citation / Bibtex

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

FloFive/SCHISM

Folders and files

Latest commit

History

Repository files navigation

⚙️ Installation

❓ How to use

General Steps

Training Workflow

Inference Workflow

📜 INI File Setup

👾 Data Preparation

💾 Training Output Files

❤️‍🔥 Contributions

📚 Citation / Bibtex

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages