Skip to content

Latest commit

 

History

History
65 lines (48 loc) · 3.59 KB

serving-large-models.adoc

File metadata and controls

65 lines (48 loc) · 3.59 KB

Serving large models

Monitoring model performance

In the single-model serving platform, you can view performance metrics for a specific model that is deployed on the platform.

Optimizing model-serving runtimes

You can optionally enhance the preinstalled model-serving runtimes available in {productname-short} to leverage additional benefits and capabilities, such as optimized inferencing, reduced latency, and fine-tuned resource allocation.