Skip to content

Latest commit

 

History

History
73 lines (56 loc) · 4.24 KB

serving-large-models.adoc

File metadata and controls

73 lines (56 loc) · 4.24 KB

Serving models on the single-model serving platform

Monitoring model performance

In the single-model serving platform, you can view performance metrics for a specific model that is deployed on the platform.

Optimizing model-serving runtimes

You can optionally enhance the preinstalled model-serving runtimes available in {productname-short} to leverage additional benefits and capabilities, such as optimized inferencing, reduced latency, and fine-tuned resource allocation.