rt-sdk-ara2 ⚡

Runtime SDK for AI/ML acceleration with Ara240 NPU on i.MX SoCs 🚀🧠💻

📋 Table of Contents

Overview
Key Components
Required Hardware
Required Software
Installation on i.MX
Getting Started
Helper Scripts
Python API and Optimum-Ara
GStreamer Plugins
UIO DMA Driver
Package Components
Release Notes
Licensing

🎯 Overview

The rt-sdk-ara2 Debian package provides a complete runtime environment for AI/ML acceleration using the Ara240 NPU on NXP i.MX SoCs. This package includes:

🔧 Runtime libraries for Ara240 NPU integration
🐍 Python bindings (DVAPI) for custom inference applications
🤖 Optimum-Ara framework for LLMs and VLMs
🎥 GStreamer plugins for Real-Time Detection Object Applications
🛠️ Helper scripts for monitoring, benchmarking, and model management
⚙️ Systemd service for automatic hardware initialization
🚗 UIO DMA driver for PCIe communication

Software Architecture

flowchart LR
   a["Application"]
   
   subgraph Python
      b["Optimum-Ara Framework"]
      c["dvapi.py - Python Bindings"]
   end
   
   subgraph GST
      y["GstdvPlugins"]
      z["ara_vision_infer"]
   end
   
   d["libaraclient.so - Client Library"]
   e["Proxy Daemon"]
   f["uiodma.ko - UIO DMA Driver"]
   h["ARA240 NPU Hardware"]
   
   a --> Python
   a --> GST
   Python --> d
   GST --> d
   d --> e
   e --> f
   f --> h

   classDef appLayer fill:#FFD9A3,stroke:#C98400,color:#333
   classDef directLayer fill:#A8E0F0,stroke:#0B8AB3,color:#333
   classDef clientLayer fill:#B8E6A3,stroke:#54A200,color:#333
   classDef proxyLayer fill:#FFD9A3,stroke:#C98400,color:#333
   classDef driverLayer fill:#A8E0F0,stroke:#0B8AB3,color:#333
   classDef hardwareLayer fill:#B8E6A3,stroke:#C98400,color:#333
   classDef pythonLayer fill:#A8E0F0,stroke:#0B8AB3,color:#333
   classDef pythonBlock fill:#A8E0F0,stroke:#54A200,color:#333

   class a appLayer
   class b,c,y,z pythonLayer
   class Python,GST pythonBlock
   class t,x pythonBlock
   class i directLayer
   class d clientLayer
   class e proxyLayer
   class f driverLayer
   class h hardwareLayer

🔑 Key Components

This package integrates the following external components:

Component	Repository	License
UIO DMA Driver	https://github.com/nxp-imx-support/uiodma-driver	GPL-2.0-only
Optimum-Ara	[Coming Soon]	Apache-2.0
GStreamer Plugins	[Coming Soon]	LGPL-2.1-or-later

🧰 Required Hardware

i.MX 8M Plus FRDM / i.MX 95 FRDM
Ara240 NPU module
microSD card (≥ 64GB recommended for LLM/VLM support)
USB-C debug cable
Internet connection
Power supply

💻 Required Software

Embedded Linux for i.MX (== LF6.18.2_1.0.0)
rt-sdk-ara2_2.0.4.deb

🛠️ Installation on i.MX

Step 1: Set System Date and Time

Set the current date and time to avoid certificate issues during package downloads:

date -s "DD-MMM-YYYY HH:MM:SS"

Example:

date -s "1-APR-2026 00:00:00"

Step 2: Transfer the Debian Package

Transfer rt-sdk-ara2_2.0.4.deb to your i.MX board using scp:

scp rt-sdk-ara2_2.0.4.deb root@<ip_addr>:

Step 3: Install the Package

Install the package with automatic disk partition resizing for LLM support:

dpkg -i rt-sdk-ara2_2.0.4.deb

The installation process will:

⚙️ Configure systemd service (rt-sdk-ara2.service) for automatic startup
💾 Expand system partition to maximize storage capacity
🔧 Set up udev rules for Ara240 NPU detection
📦 Install all necessary libraries, scripts, and tools

Step 4: Reboot the Board

This is a critical step! Reboot the board to load the driver and start the service:

reboot

Wait approximately 60-70 seconds for the boot process to complete. You should see these log messages indicating successful initialization:

[   59.645245] bash[1051]: 2026-03-11 02:21:19 - Proxy launched succesfully
[   60.678728] bash[1059]: 2026-03-11 02:21:20 - Hardware bringup is done (1 device(s) configured) and proxy is launched successfully in the background.
[   60.679857] bash[868]: Logs saved in: /usr/share/rt-sdk-ara240/saved_logs/rt-sdk-ara2_logs.txt

Note: The boot process includes loading the uiodma.ko driver, performing Ara240 hardware bringup, and launching the proxy daemon.

Step 5: Verify Firmware Version

Check the installed firmware version (should be 131072):

chip_info.sh

Look for the firmware_version(raw) field in the output:

| firmware_version(raw)       | 131072     |
| firmware_version            | 2.0.0      |

Step 6: Update Firmware (if needed)

Firmware flashing is a one-time activity and persists across reboots. If the firmware version is not 131072, update it:

program_flash.sh

Reboot the board after firmware update to ensure the new version is properly loaded and initialized.

Known Issue: FRDM i.MX8M Plus freeze when updating ARA240 Firmware

On FRDM i.MX 8M PLUS during firmware update the program_flash.sh script in BSP 6.18.2_1.0.0 can cause the board to freeze, resulting in a failed flashing process.

Workaround:

Stop at uboot.
Append pcie_aspm=off to the mmcargs and then continue boot

u-boot=> editenv mmcargs
edit: setenv bootargs ${jh_clk} ${mcore_clk} console=${console} root=${mmcroot} pcie_aspm=off
u-boot=> boot

For more details on this known issue or any other, visit the [Release Note Document].

Step 8: Verify Service Status (optional)

In order to ensure the installation was successful and all components are properly initialized, you can verify the service status with the following commands:

To check rt-sdk-ara2.service is running:

systemctl status rt-sdk-ara2.service --no-pager -l

To view detailed service logs:

journalctl -u rt-sdk-ara2.service

To verify the proxy is running:

ps -eaf | grep proxy_ara240

🚀 Getting Started

Once installation is complete, you can start using the Ara240 NPU with the included helper scripts and APIs.

[!WARNING] At this point we recommend to switch to a SSH connection for the best experience.

Download Sample Models

Download pre-compiled models for testing. Run the download script to get different flavours of YOLOv8 models.

fetch_models --repo-id nxp/YOLOv8

Quick Performance Test

Run a quick performance benchmark on downloaded models:

run_model_perf.sh

Follow the interactive prompts to select a model category and specific model. The script will display performance metrics including HW IPS (Hardware Inferences Per Second), which indicates the speed of inference execution on the Ara240 NPU.

🔧 Helper Scripts

The SDK includes several helper scripts to monitor, manage, and benchmark the Ara240 NPU. These scripts make it easy to check device status, update firmware, download models, and run performance tests.

ara2_metrics.sh - Monitor NPU Performance

Monitor real-time NPU metrics including utilization, temperature, DRAM usage, and device state:

ara2_metrics.sh

Interactive menu options:

1 - Print NPU utilization continuously
2 - Print device information
3 - Print DRAM information
4 - Print NPU temperature (one-time)
5 - Print NPU temperature continuously
6 - Print utilization, temperature, and DRAM continuously
7 - Print NPU state
0 - Exit

Use case: This tool is invaluable during benchmarking or model execution. You can correlate model performance with thermal behavior and utilization to identify bottlenecks. For example, if performance drops, you can verify whether the NPU is throttling due to temperature or experiencing memory pressure.

chip_info.sh - Device Summary

Get a comprehensive overview of the Ara240 NPU configuration and status:

chip_info.sh

Displays:

Chip ID and revision
Bus ID and interface type (PCIe)
System, NPU, and DDR frequencies
Device temperature and voltage
Power state
Firmware version (raw and display format)
DDR and flash information
Life cycle and chip part type

This script runs automatically twice during Ara240 boot-up and provides a quick health check of the device.

program_flash.sh - Firmware Update

Update the Ara240 NPU firmware to the required version:

program_flash.sh

Always reboot the board after running this script to ensure the new firmware is fully applied and the device state is clean.

fetch_models - Download Pre-compiled Models

Download pre-trained (CNN/LLM) models from Hugging Face Hub for use with the ARA240 platform.

fetch_models --list

This command shows all supported models available to fetch from Hugging Face.

Supported models:

LLMs/VLMs

Qwen2.5-7B-Instruct (For Ara240)
Qwen2.5-Coder-1.5B (For Ara240)
Qwen2.5-VL-7B-Instruct (For Ara240)

YOLOv8

detection:
- yolov8n
- yolov8s
- yolov8m
- yolov8l
- yolov8x
pose:
- yolov8n-pose
- yolov8s-pose
- yolov8m-pose
- yolov8l-pose
- yolov8x-pose
segmentation:
- yolov8n-seg
- yolov8s-seg
- yolov8m-seg
- yolov8l-seg
- yolov8x-seg

run_model_perf.sh - Performance Benchmarking

Run performance benchmarks on downloaded models:

[!WARNING] We strongly recommend using an SSH connection rather than console/UART/debug cable communication when running the benchmark. Console-based connections can introduce significant performance degradation due to the high volume of printed output, leading to unrealistic or misleading benchmark results.

run_model_perf.sh

Example output:

==========================
Available Model Categories
==========================
  1) detection
  2) pose
  3) segmentation
  q) Quit
Enter the number corresponding to the category you want to explore: 1
==========================
Available Models in detection
==========================
1. yolov8n

Key metric: Look for HW IPS (Hardware Inferences Per Second) in the results. This represents the maximum throughput the Ara240 NPU can achieve for the selected model under current conditions.

Note: IPS values may vary depending on thermal conditions, system load, interface type (PCIe), and whether the benchmark runs continuously or in bursts.

Customizing benchmark parameters:

By default, the script runs with:

Iterations: 1000
Batch size: 10

To modify these parameters, edit the script variables in run_model_perf.sh:

NUM_ITERATIONS="0.iterations=1000"  # Change to desired number of iterations
BATCH_SIZE="0.batch_size=10"        # Change to desired batch size

Results location:

Performance logs are saved to:

/usr/share/rt-sdk-ara240/saved_logs/<category>_<model>_perf_log.txt

Device statistics are dumped to:

/usr/share/rt-sdk-ara240/saved_logs/device_stats/

Service Management

The rt-sdk-ara2.service is enabled by default and handles:

Loading the uiodma.ko kernel driver
Performing hardware bringup
Launching the proxy daemon

Check service status:

systemctl status rt-sdk-ara2.service --no-pager -l

Restart the service:

systemctl restart rt-sdk-ara2.service

Disable automatic startup (not recommended):

systemctl disable rt-sdk-ara2.service

View detailed logs:

journalctl -u rt-sdk-ara2.service

📚 Python API and Optimum-Ara

DVAPI Python Bindings

The SDK includes Python bindings for the DVAPI (DeepVision API):

Location: /usr/share/rt-sdk-ara240_2.0.4/include/dvapi.py

Key classes and methods:

DVSession - Session management for Ara240 NPU
DVModel - Model loading and management
DVEndpoint - Endpoint (device) management
dv_infer_wait_for_completion() - Blocking inference execution

Optimum-Ara Framework

Optimum-Ara is a framework for running Large Language Models (LLMs) and Vision-Language Models (VLMs) on Ara240.

Location: /usr/share/rt-sdk-ara240_2.0.4/optimum-ara/

Repository: [Coming Soon]

License: Apache-2.0

Supported models:

LLMs: Qwen2.5-7B-Instruct, Qwen2.5-Coder-1.5B
VLMs: Qwen2.5-VL-7B-Instruct

🎥 GStreamer Plugins

The package includes custom GStreamer plugins optimized for zero-copy video inference pipelines with the Ara240 NPU.

Location: /usr/lib/gstreamer-1.0/

Repository: [Coming Soon]

License: LGPL-2.1-or-later

Available Plugins

libgstdvPre.so - Pre-processing plugin
- Handles image preprocessing (resizing, normalization, color conversion)
- Zero-copy buffer operations for optimal performance
libgstdvInf.so - Inference plugin
- Executes inference on Ara240 NPU
- Manages model loading and execution
libgstdvPost.so - Post-processing plugin
- Processes inference results
- Prepares output for visualization or further processing

🚗 UIO DMA Driver

The uiodma.ko kernel driver enables high-speed PCIe communication between the i.MX host processor and the Ara240 NPU module.

Location: /usr/share/rt-sdk-ara240_2.0.4/driver/

Repository: https://github.com/nxp-imx-support/uiodma-driver

License: GPL-2.0-only

Driver Information

The driver is automatically loaded during system boot by the rt-sdk-ara2.service. You can verify it's loaded with:

lsmod | grep uiodma

To manually load the driver:

insmod /usr/share/rt-sdk-ara240_2.0.4/driver/uiodma.ko

To manually unload the driver:

rmmod uiodma

Note: The driver is essential for Ara240 NPU operation. Do not unload it while running inference workloads.

📦 Package Components

The Debian package installs components to the following locations:

Binaries and Scripts

/usr/share/rt-sdk-ara240_2.0.4/scripts/ - All utility scripts
/usr/share/rt-sdk-ara240_2.0.4/scripts/ara2_metrics_bin/ - Hardware metrics binary

Libraries

/usr/lib/libaraclient_aarch64.so - Ara client library
/usr/lib/libara_vision_inference.so* - Vision inference library
/usr/lib/gstreamer-1.0/ - GStreamer plugins for video inference pipelines

Headers and Python Modules

/usr/include/sdk_ara/ - C/C++ headers (dvapi.h, dv_status_codes.h, etc.)
/usr/share/rt-sdk-ara240_2.0.4/include/ - Python bindings (dvapi.py)

Runtime Artifacts

/usr/share/rt-sdk-ara240_2.0.4/optimum-ara/ - Optimum-Ara framework, examples, and documentation
/usr/share/rt-sdk-ara240_2.0.4/hw_utils/ - Hardware utilities and firmware
/usr/share/rt-sdk-ara240_2.0.4/proxy/ - Proxy daemon

System Configuration

/etc/systemd/system/rt-sdk-ara2.service - Systemd service
/etc/udev/rules.d/99-ara2.rules - Udev rules for NPU detection
/etc/rt-sdk-ara240/ - Configuration files (proxy_config.yaml, cnn_config.yaml)

Drivers

/usr/share/rt-sdk-ara240_2.0.4/driver/ - UIO DMA driver

📝 Release Notes

Version Information

Component	Version
Kinara SDK	r2.0.4
ara-client	r1.3.2.0
proxy	1.4.0.0
Debian package	v2.0.4
Firmware (raw)	131072
Firmware (display)	2.0.0

Related Projects

The following projects depend on this SDK:

Project	Repository	Version/Tag
eiq-aaf-connector	https://github.com/nxp-imx-support/eiq-aaf-connector	v2.0.0
ara2-vision-examples	https://github.com/nxp-imx-support/ara2-vision-examples	lf-6.18.2-1.0.0_Q1-2026
LLM-Edge-Studio	https://github.com/nxp-imx-support/llm-edge-studio	v2.0.0
VLM-Edge-Studio	https://github.com/nxp-imx-support/vlm-edge-studio	v1.0.0

Known Issues

The following are known issues for this release:

FRDM i.MX8M Plus freeze when updating ARA240 Firmware
FRDM i.MX 8MP and FRDM i.MX 95 Freeze after Firmware Upgrade
Model Load Stalls at ~95% in VLM-Edge-Studio and LLM-Edge-Studio
ara2_metrics script failure on FRDM i.MX 8MP and FRDM i.MX 95
Hugging Face download model silent failures under bad network conditions

For more details on each known issue, please refer to the Release Notes Document.

⚖️ Licensing

This repository is licensed under the LA_OPT_NXP_Software_License license.

Licensing and Copyright in Debian Package

Software Bill of Materials (SBOM): /usr/share/doc/rt-sdk-ara2/SBOM-rt-sdk-ara2-2.0.4.spdx.json
License files: /usr/share/doc/rt-sdk-ara2/LICENSE.txt

For support and additional information, please refer to the official NXP documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
licenses		licenses
LICENSE.txt		LICENSE.txt
README.md		README.md
SBOM-rt-sdk-ara2-2.0.4.spdx.json		SBOM-rt-sdk-ara2-2.0.4.spdx.json

Folders and files

Latest commit

History

Repository files navigation

rt-sdk-ara2 ⚡

📋 Table of Contents

🎯 Overview

Software Architecture

🔑 Key Components

🧰 Required Hardware

💻 Required Software

🛠️ Installation on i.MX

Step 1: Set System Date and Time

Step 2: Transfer the Debian Package

Step 3: Install the Package

Step 4: Reboot the Board

Step 5: Verify Firmware Version

Step 6: Update Firmware (if needed)

Step 8: Verify Service Status (optional)

🚀 Getting Started

Download Sample Models

Quick Performance Test

🔧 Helper Scripts

ara2_metrics.sh - Monitor NPU Performance

chip_info.sh - Device Summary

program_flash.sh - Firmware Update

fetch_models - Download Pre-compiled Models

run_model_perf.sh - Performance Benchmarking

Service Management

📚 Python API and Optimum-Ara

DVAPI Python Bindings

Optimum-Ara Framework

🎥 GStreamer Plugins

Available Plugins

🚗 UIO DMA Driver

Driver Information

📦 Package Components

Binaries and Scripts

Libraries

Headers and Python Modules

Runtime Artifacts

System Configuration

Drivers

📝 Release Notes

Version Information

Related Projects

Known Issues

Related Documentation

⚖️ Licensing

Licensing and Copyright in Debian Package

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages