Skip to content

Conversation

@Dawoodsdatalife
Copy link

#777 Note: cuDNN 9 is also common with CUDA 12.x
#1322 Added faster compute capabilities: 9.0

 Note: cuDNN 9 is also common with CUDA 12.x
 Added faster compute capabilities : 9.0
@Dawoodsdatalife
Copy link
Author

Optimized TensorFlow CUDA Support for CUDA 12.8

My Comments:

This pull request updates TensorFlow's CUDA support to ensure compatibility and optimal performance with CUDA 12.8.

Key changes include:

  • Updated documentation and code examples to reflect CUDA 12.8.
  • Added compute capability 9.0 to supported architectures.
  • Clarified cuDNN 8.x and 9.x compatibility.

Ensured that the advanced CUDA features like Tensor Core usage, memory management, asynchronous operations, and CUDA Graphs are aligned with the latest CUDA best practices.

These updates provide the following benefits:

  • Improved performance on the latest NVIDIA GPUs.
  • Enhanced compatibility with the latest CUDA toolkit.
  • Clearer guidance for developers on leveraging advanced CUDA features.

I have tested these changes on an Ubuntu 22.04 system with an NVIDIA RTX 4090, CUDA 12.8, and cuDNN 8.9. Please let me know if you have any questions or require further information.

Ready for review. Thank you!

@CLAassistant
Copy link

CLAassistant commented Mar 24, 2025

CLA assistant check
All committers have signed the CLA.

Copy link
Collaborator

@Yancey0623 Yancey0623 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants