Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

changed two AOAI alerts to visible with the proper thresholds #395

Merged
merged 3 commits into from
Oct 31, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 13 additions & 13 deletions services/CognitiveServices/accounts/alerts.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -308,19 +308,19 @@
- name: AzureOpenAIContextTokensCacheMatchRate
description: Percentage of the prompt tokens hit the cache, avaiable for PTU-managed.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAIContextTokensCacheMatchRate
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 75
guid: 81f8369c-65bf-4194-bfd2-ffdfa2470577
- name: AzureOpenAIProvisionedManagedUtilization
description: Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100.
Expand All @@ -342,19 +342,19 @@
- name: AzureOpenAIProvisionedManagedUtilizationV2
description: Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAIProvisionedManagedUtilizationV2
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 80
guid: 693a3b37-1e2a-42d1-aaed-b1f374276d1c
- name: AzureOpenAIRequests
description: Number of calls made to the Azure OpenAI API over a period of time.
Expand All @@ -374,21 +374,21 @@
threshold: 0.0
guid: a1528d17-f288-46b1-b084-8b8fe3af90fa
- name: AzureOpenAITimeToResponse
description: Recommended latency (responsiveness) measure for streaming requests.
description: Recommended latency (responsiveness) measure for streaming requests. Time in milliseconds.
type: Metric
verified: false
visible: false
verified: true
visible: true
tags: manual-ck
properties:
metricName: AzureOpenAITimeToResponse
metricNamespace: Microsoft.CognitiveServices/accounts
severity: 0
severity: 2
windowSize: PT5M
evaluationFrequency: PT1M
timeAggregation: Total
operator: GreaterThan
criterionType: StaticThresholdCriterion
threshold: 0.0
threshold: 200
guid: 995cc12a-1887-4669-92c5-70a6ca8bfe70
- name: BaselineEstimatorOverallReward
description: Baseline Estimator Overall Reward.
Expand Down