Skip to content

Commit e2b45a8

Browse files
committed
add pointer to stage level scheduling
Signed-off-by: Erik Ordentlich <[email protected]>
1 parent 6793d1e commit e2b45a8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/site/FAQ.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,5 +24,5 @@ Be aware that some Spark Rapids ML algorithms (such as NearestNeighbors) may con
2424
### What are some possible causes of low-level CUDA and/or native code errors?
2525

2626
- NaNs or nulls in the input data. These are currently passed directly into the cuML layer and may trigger such errors.
27-
- NCCL communication library does not allow communication between processes on the same GPU. Check your Spark GPU configs to ensure 1 task per GPU during fit() calls.
27+
- NCCL communication library does not allow communication between processes on the same GPU. [Stage level scheduling](https://nvidia.github.io/spark-rapids-ml/performance.html#stage-level-scheduling) can avoid this but it is not supported in all cases. Check requirements and adjust your Spark GPU configs to ensure 1 task per GPU during fit() calls if needed.
2828
- Previously unknown bugs. Please file an issue.

0 commit comments

Comments
 (0)