Release v24.10.0 release · NVIDIA/spark-rapids-ml

Release notes as follows:

Migrated cuML based ivf-flat and ivf-pq to cuVS and added support for cosine distance.
Added support for sparse data in UMAP.
Added support for NNDescent based k-NN graph building for UMAP.
Updated AWS EMR examples to EMR version 7.3.
Updated RAPIDS dependencies to 24.10.
Dropped support for Python 3.9 (transitive from RAPIDS).
Multiple bug and documentation fixes for data generation, CrossValidator, UMAP, DBScan, KMeans, and approximate k-NN implementations.
Known issues:
- LogisticRegression hangs on fitting sparse data with all zero features in a GPU
- various CUDA errors when spark.rapids.ml.uvm.enabled or spark.python.worker.reuse are set to true and with multiple GPUs per executor. Work around is to set either of those configs to false in multiple GPU per exectuor clusters.
- error in multi-class RandomForest fit when one GPU does not see all class label values.
- CUDA error when fewer probes than k in ivflat-pq ANN algorithm.

Provide feedback