To accelerate your high-performance deep learning models, you can integrate Intel Gaudi AI accelerators into {productname-short}. This integration enables your data scientists to use Gaudi libraries and software associated with Intel Gaudi AI accelerators through custom-configured workbench instances.
Intel Gaudi AI accelerators offer optimized performance for deep learning workloads, with the latest Gaudi 3 devices providing significant improvements in training speed and energy efficiency. These accelerators are suitable for enterprises running machine learning and AI applications on {productname-short}.
Before you can enable Intel Gaudi AI accelerators in {productname-short}, you must complete the following steps:
-
Install the latest version of the Intel Gaudi AI Accelerator Operator from OperatorHub.
-
Create and configure a custom workbench image for Intel Gaudi AI accelerators. A prebuilt workbench image for Gaudi accelerators is not included in {productname-short}.
-
Manually define and configure an accelerator profile for each Intel Gaudi AI device in your environment.
{org-name} supports Intel Gaudi devices up to Intel Gaudi 3. The Intel Gaudi 3 accelerators, in particular, offer the following benefits:
-
Improved training throughput: Reduce the time required to train large models by using advanced tensor processing cores and increased memory bandwidth.
-
Energy efficiency: Lower power consumption while maintaining high performance, reducing operational costs for large-scale deployments.
-
Scalable architecture: Scale across multiple nodes for distributed training configurations.
Your OpenShift platform must support EC2 DL1 instances to use Intel Gaudi AI accelerators in an Amazon EC2 DL1 instance. You can use Intel Gaudi AI accelerators in workbench instances or model serving after you enable the accelerators, create a custom workbench image, and configure the accelerator profile.
To identify the Intel Gaudi AI accelerators present in your deployment, use the lspci
utility. For more information, see lspci(8) - Linux man page.
Important
|
The presence of Intel Gaudi AI accelerators in your deployment, as indicated by the |