Skip to content

Commit

Permalink
Updated docs for autoscaling on gpu. (#328)
Browse files Browse the repository at this point in the history
Signed-off-by: Andrews Arokiam <[email protected]>
  • Loading branch information
andyi2it authored Mar 24, 2024
1 parent 144d603 commit 2ba4317
Show file tree
Hide file tree
Showing 3 changed files with 8 additions and 2 deletions.
2 changes: 2 additions & 0 deletions docs/modelserving/autoscaling/autoscale-gpu-new.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,8 @@ metadata:
name: "flowers-sample-gpu"
spec:
predictor:
scaleTarget: 1
scaleMetric: concurrency
model:
modelFormat:
name: tensorflow
Expand Down
4 changes: 2 additions & 2 deletions docs/modelserving/autoscaling/autoscale-new.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -2,10 +2,10 @@ apiVersion: "serving.kserve.io/v1beta1"
kind: "InferenceService"
metadata:
name: "flowers-sample"
annotations:
autoscaling.knative.dev/target: "1"
spec:
predictor:
scaleTarget: 1
scaleMetric: concurrency
model:
modelFormat:
name: tensorflow
Expand Down
4 changes: 4 additions & 0 deletions docs/modelserving/autoscaling/autoscaling.md
Original file line number Diff line number Diff line change
Expand Up @@ -248,6 +248,8 @@ Apply the tensorflow gpu example CR
name: "flowers-sample-gpu"
spec:
predictor:
scaleTarget: 1
scaleMetric: concurrency
model:
modelFormat:
name: tensorflow
Expand All @@ -265,6 +267,8 @@ Apply the tensorflow gpu example CR
kind: "InferenceService"
metadata:
name: "flowers-sample-gpu"
annotations:
autoscaling.knative.dev/target: "1"
spec:
predictor:
tensorflow:
Expand Down

0 comments on commit 2ba4317

Please sign in to comment.