Skip to content

Library Overview

Abhiram S edited this page Sep 23, 2025 · 6 revisions

Library Overview

High-level overview of AOCL-DLP architecture, components, and design goals.

Components

  • GEMM kernels and drivers
  • Post-operations framework (metadata-driven)
  • Element-wise utilities
  • Threading and parallelization controls

Data Types

Call Layers

  1. Prepare data (layouts, leading dimensions)
  2. Optional reordering for repeated use
  3. Configure dlp_metadata_t for fused post-ops
  4. Call GEMM or eltwise
  5. Optional de/reordering for outputs

Hardware Features

Targets AVX2/FMA3, AVX512, AVX512_VNNI, AVX512_BF16 on supported AMD CPUs.

Clone this wiki locally