From f8e777dfb9deaf1821db0766475638d2b8c9b8f1 Mon Sep 17 00:00:00 2001 From: JuHyung Son Date: Mon, 27 Nov 2023 00:14:25 +0900 Subject: [PATCH] clarify prometheus annotation (#316) Signed-off-by: JuHyung-Son --- docs/modelserving/observability/prometheus_metrics.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/docs/modelserving/observability/prometheus_metrics.md b/docs/modelserving/observability/prometheus_metrics.md index 4f2180962..d7f5c3892 100644 --- a/docs/modelserving/observability/prometheus_metrics.md +++ b/docs/modelserving/observability/prometheus_metrics.md @@ -35,6 +35,9 @@ The default values for `serving.kserve.io/enable-prometheus-scraping` can be set There is not currently a unified set of metrics exported by the model servers. Each model server may implement its own set of metrics to export. +!!! note + This annotation defines the prometheus port and path, but it does not trigger the prometheus to scrape. Users must configure prometheus to scrape data from inference service's pod according to the prometheus settings. + ## Metrics for lgbserver, paddleserver, pmmlserver, sklearnserver, xgbserver, custom transformer/predictor Prometheus latency histograms are emitted for each of the steps (pre/postprocessing, explain, predict).