LLM Inference Engine

This repository contains the source code and the config files for LLM Inference Engine which is based on llama.cpp and built to be run on AMD Strix Halo Hardware with Vulkan backend.

Components

Base Container Image: I am planning to run the inference engine inside of a container, so the base container is built with all the necessary dependencies such as AMD drivers and Vulkan SDK.
LLM Inference Engine: The main application that utilizes llama.cpp to perform inference. (in progress)
Monitoring Service: Some sort of metrics collection tool to send data to LangFuse or Grafana. (in progress)

Build Image

Manual Build

docker build -f base.dockerfile -t llm-inference-base .

Automated Build and Push to GitHub Container Registry

The base Docker image is automatically built and pushed to GitHub Container Registry (GHCR) when you create a tag on the main branch.

Creating a Release Tag

# Create and push a tag (with v prefix)
git tag v1.0.0
git push origin v1.0.0

# OR create a tag without v prefix
git tag 1.0.0
git push origin 1.0.0

This will trigger the GitHub Actions workflow that:

Builds the base Docker image
Pushes it to ghcr.io/avikantsrivastava/llm-inference-engine/base with the tag name (e.g., v1.0.0 or 1.0.0)
Also tags it as latest if on the default branch

Manual Workflow Trigger

You can also manually trigger the workflow from the GitHub Actions tab:

Go to Actions → "Build and Push Base Docker Image"
Click "Run workflow"
Optionally specify a custom tag name (defaults to latest)

Pull the Image

# Pull the latest image
docker pull ghcr.io/avikantsrivastava/llm-inference-engine/base:latest

# Pull a specific version (with v prefix)
docker pull ghcr.io/avikantsrivastava/llm-inference-engine/base:v1.0.0

# Pull a specific version (without v prefix)
docker pull ghcr.io/avikantsrivastava/llm-inference-engine/base:1.0.0

Note: The image is published as a public package and can be pulled without authentication.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
README.md		README.md
base.dockerfile		base.dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Inference Engine

Components

Build Image

Manual Build

Automated Build and Push to GitHub Container Registry

Creating a Release Tag

Manual Workflow Trigger

Pull the Image

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Inference Engine

Components

Build Image

Manual Build

Automated Build and Push to GitHub Container Registry

Creating a Release Tag

Manual Workflow Trigger

Pull the Image

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages