Skip to content

TorchServe v0.1.0

Compare
Choose a tag to compare
@mycpuorg mycpuorg released this 21 Apr 17:58
· 3386 commits to master since this release
f6c2d2c

TorchServe (Experimental) v0.1.0 Release Notes

This is the first release of TorchServe (Experimental), a new open-source model serving framework under the PyTorch project (RFC #27610).

Highlights

  • Clean APIs - Support for an Inference API for predictions and a Management API for managing the model server.

  • Secure Deployment - Includes HTTPS support for secure deployment.

  • Robust model management capabilities - Allows full configuration of models, versions, and individual worker threads via command line interface, config file, or run-time API.

  • Model archival - Provides tooling to perform a ‘model archive’, a process of packaging a model, parameters, and supporting files into a single, persistent artifact. Using a simple command-line interface, you can package and export in a single ‘.mar’ file that contains everything you need for serving a PyTorch model. This `.mar’ file can be shared and reused. Learn more here.

  • Built-in model handlers - Support for model handlers covering the most common use-cases (image classification, object detection, text classification, image segmentation). TorchServe also supports custom handlers

  • Logging and Metrics - Support for robust logging and real-time metrics to monitor inference service and endpoints, performance, resource utilization, and errors. You can also generate custom logs and define custom metrics.

  • Model Management - Support for management of multiple models or multiple versions of the same model at the same time. You can use model versions to roll back to earlier versions or route traffic to different versions for A/B testing.

  • Prebuilt Images - Ready to go Dockerfiles and Docker images for deploying TorchServe on CPU and NVIDIA GPU based environments. The latest Dockerfiles and images can be found here.

Platform Support

      - Ubuntu 16.04, Ubuntu 18.04, MacOS 10.14+

Known Issues

  • The default object detection handler only works on cuda:0 device on GPU machines #104
  • For torchtext based models, the sentencepiece dependency fails for MacOS with python 3.8 #232

Getting Started with TorchServe

  • Additionally, you can get started at pytorch.org/serve with installation instructions, tutorials and docs.
  • Lastly, if you have questions, please drop it into the PyTorch discussion forums using the ‘deployment’ tag or file an issue on GitHub with a way to reproduce.