Skip to content

Conversation

@abstractqqq
Copy link
Owner

@abstractqqq abstractqqq commented Nov 28, 2025

If min(a) = max(a), then min_max scaling of column a will map it to all NaN, which may cause subsequent issues in models.

Keeping a column of constant is better than running into errors in modelling.

To avoid that, we can drop such expressions once we learn the min, max.

The same for standard transform when std = 0.

Updated transformations to skip columns with zero means or standard deviations.
@abstractqqq abstractqqq marked this pull request as ready for review November 28, 2025 15:39
@abstractqqq abstractqqq merged commit 8249f31 into main Nov 30, 2025
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants