StarPU Inference Server

⚠️ Project Status: In Development

This project is currently under active development. There are no releases yet, and the interface or features will change frequently.

Inference Scheduling with StarPU and LibTorch

This project combines StarPU and LibTorch to efficiently schedule deep learning inference tasks across CPUs and GPUs of a compute node. The main goal is to maximize throughput while maintaining latency control, by leveraging asynchronous and heterogeneous execution.

Goal

Perform inference of TorchScript models (e.g., ResNet, BERT) using LibTorch.
Dynamically schedule inference tasks between CPU and GPU using StarPU.
Optimize throughput while satisfying latency constraints.

Installation

See installation for setup instructions, including dependency lists, and native build steps. See docker guide for Docker image build commands and execution.

Quickstart

Follow the Quickstart guide to:

Build the gRPC inference server.
Export the bert-base-uncased TorchScript model.
Launch the server with the provided configuration.
Drive it using the Python gRPC client or by authoring your own client.

Documentation

The documentation index lives in the docs folder.

Name		Name	Last commit message	Last commit date
Latest commit History 216 Commits
.github		.github
.vscode		.vscode
ci/perf		ci/perf
client		client
cmake		cmake
docs		docs
external		external
models		models
ops		ops
reports		reports
scripts		scripts
src		src
tests		tests
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
.markdownlint.yaml		.markdownlint.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
sonar-project.properties		sonar-project.properties

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StarPU Inference Server

⚠️ Project Status: In Development

Inference Scheduling with StarPU and LibTorch

Goal

Installation

Quickstart

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

StarPU Inference Server

⚠️ Project Status: In Development

Inference Scheduling with StarPU and LibTorch

Goal

Installation

Quickstart

Documentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages