Skip to content

mingxin-zheng/tritonserver-che-master

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Chest X-Ray Classification Triton Server

This repository demonstrates how to serve a chest X-ray classification model from HuggingFace using NVIDIA Triton Inference Server.

Prerequisites

Quick Start

1. Build the Docker Container

docker build -t chest-xray-triton .

2. Run the Docker Container

docker run -d --gpus "device=0" -p 8000:8000 8001:8001 -p 8002:8002 chest-xray-triton tritonserver --model-repository=/models

3. Test the Model

# You may need to create a new virtual environment
pip install -r requirements.txt
python client.py

4. Test in other languages

You can also create a client in other languages (JAVA, C++, etc). Please refer to the this guide for more information.

5. Scaling the server to enable multiple instances on a single GPU

Please refer to the NVIDIA Triton Inference Server documentation

You may change the count in the instance group section of the config.pbtxt file to enable multiple instances on a single GPU.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors