Skip to content

Add scikit-learn set_output() API support to cuml.accel #6606

Open
@csadorf

Description

@csadorf

Currently, cuml.accel does not support scikit-learn's set_output() API, which was introduced in scikit-learn 1.2.0 to control the output format of transformers and estimators. This feature allows users to specify whether they want outputs as numpy arrays, pandas DataFrames, or other formats.

Expected Behavior

All cuml.accel estimators that implement transform() or predict() methods should support the set_output() API. This includes:

  • PCA
  • TruncatedSVD
  • KNeighborsClassifier
  • KNeighborsRegressor
  • NearestNeighbors
  • And other relevant estimators

Dependencies

  • scikit-learn >= 1.2.0 (for set_output API)
  • pandas (for DataFrame output support)

Related Issues

Acceptance Criteria

  • All relevant estimators support set_output()
  • Tests pass for both numpy and pandas output formats
  • Documentation is updated to reflect the new functionality
  • No regression in existing functionality

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingcuml-accelIssues related to cuml.accel

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions