-
Notifications
You must be signed in to change notification settings - Fork 568
Improve detection of scikit-learn parity regressions #6553
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve detection of scikit-learn parity regressions #6553
Conversation
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
/ok to test 5759457 |
/ok to test 8d56c54 |
/ok to test 1772262 |
…sai#6553) This change modifies the KMeans test to set n_init to 2, enhancing the stability of the k-means|| initialization process. Additional context can be found in issue rapidsai#5530.
#6555) This change modifies the KMeans test to set n_init to 2, enhancing the stability of the k-means|| initialization process. Additional context can be found in issue #5530. Closes #5530. Authors: - Simon Adorf (https://github.com/csadorf) Approvers: - Divye Gala (https://github.com/divyegala) - Jim Crist-Harif (https://github.com/jcrist) URL: #6555
1772262
to
fd2d0b2
Compare
/ok to test fd2d0b2 |
fd2d0b2
to
a80327d
Compare
/ok to test a80327d |
/ok to test d5ed917 |
/ok to test e00dc0b |
e00dc0b
to
86819cf
Compare
/ok to test 86819cf |
- Move most of the related tooling from ci/ to python/cuml/cuml/accel/tests/scikit-learn/ - Add xfail-list.yaml to track known differences from scikit-learn - Add --format=xfail_list to automatically generate xfail lists from failures The infrastructure enables continuous monitoring of scikit-learn parity by: - Running scikit-learn tests with cuML acceleration - Tracking pass rates and comparing against thresholds - Automatically detecting new failures - Managing known differences via xfail lists Import pytest lazily within the pytest_collection_modifyitems function to avoid requiring it for normal cuML usage, ensuring pytest is only needed during test execution.
86819cf
to
58f24eb
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Two tiny nits, but otherwise this looks great! Thanks for working on this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving for packaging-codeowners
/ ci-codeowners
.
Full disclosure, I only skimmed summarize-results.py
, since it looks pretty complex but also isn't user-facing.
/merge |
This PR enhances the infrastructure for monitoring scikit-learn parity in cuML's accelerated estimators, enabling better tracking and management of differences between scikit-learn and cuML implementations.
Key Changes
ci/
topython/cuml/cuml/accel/tests/scikit-learn/
for better organizationxfail-list.yaml
to track known differences from scikit-learnInfrastructure Capabilities
The new infrastructure enables continuous monitoring of scikit-learn parity by:
Related Changes