-
Notifications
You must be signed in to change notification settings - Fork 124
Open
Description
We get this error when doing rep-reading on google/gemma-2-2b-it:
ValueError: Input X contains NaN.
PCA does not accept missing values encoded as NaN natively. For supervised learning, you might want to consider sklearn.ensemble.HistGradientBoostingClassifier and Regressor which accept missing values encoded as NaNs natively. Alternatively, it is possible to preprocess the data, for instance by using an imputer transformer in a pipeline or drop samples with missing values. See https://scikit-learn.org/stable/modules/impute.html You can find a list of all estimators that handle NaN values at the following page: https://scikit-learn.org/stable/modules/impute.html#estimators-that-handle-nan-values
This is upstream of us (see: huggingface/transformers#32390, pytorch/pytorch#131060).
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels