-
Notifications
You must be signed in to change notification settings - Fork 5
Library Overview
Abhiram S edited this page Sep 23, 2025
·
6 revisions
High-level overview of AOCL-DLP architecture, components, and design goals.
- GEMM kernels and drivers
- Post-operations framework (metadata-driven)
- Element-wise utilities
- Threading and parallelization controls
- float32, bfloat16, int8/uint8, int32; mixed-precision flows
- Types: Types API Reference
- Prepare data (layouts, leading dimensions)
- Optional reordering for repeated use
- Configure
dlp_metadata_tfor fused post-ops - Call GEMM or eltwise
- Optional de/reordering for outputs
Targets AVX2/FMA3, AVX512, AVX512_VNNI, AVX512_BF16 on supported AMD CPUs.
Getting Started
User Guides
- Library Overview
- GEMM Guide
- Batch GEMM Guide
- Post-Operations
- Eltwise Operations
- Quantization
- API Lifecycle
Performance & Config
Testing & Benchmarking
Developer Guides
Reference