Skip to content

Commit 8738735

Browse files
committed
Updated readme with undersampling ratio
1 parent e9e4984 commit 8738735

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -41,8 +41,9 @@ This project develops machine learning models for predicting Tunnel Boring Machi
4141

4242
- **Outlier Detection**: Isolation Forest-based outlier removal
4343
- **Class Imbalance Handling**:
44-
- RandomUnderSampler for majority class reduction
44+
- RandomUnderSampler for majority class reduction (ratio-based or absolute)
4545
- SMOTE for minority class oversampling
46+
- Configurable undersampling ratio (e.g., 1.0 = 1:1 majority:minority ratio)
4647
- **Feature Scaling**: StandardScaler normalization
4748
- **Train/Test Split**: Stratified split preserving class distributions
4849

@@ -146,7 +147,8 @@ data:
146147
test_path: "data/model_ready/dataset_test.csv"
147148

148149
preprocessing:
149-
undersample_level: 2000 # Majority class samples
150+
undersample_level: null # Absolute number (overrides ratio if set), or null to use ratio
151+
undersample_ratio: 1.0 # Majority:minority ratio (1.0 = 1:1, 2.0 = 2:1, etc.)
150152
oversample_level: 0 # Minority class SMOTE (0=disabled)
151153
outlier_removal: true
152154

0 commit comments

Comments
 (0)