Add support for tgi metricesfor vllm #1513

mwaykole · 2024-06-07T16:21:59Z

No description provided.

Signed-off-by: Milind Waykole <[email protected]>

sonarcloud · 2024-06-07T16:24:38Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
No data about Duplication

See analysis details on SonarCloud

github-actions · 2024-06-07T16:25:44Z

Robot Results

✅ Passed	❌ Failed	⏭️ Skipped	Total	Pass %
477	0	0	477	100

mwaykole · 2024-06-08T13:14:52Z

lugi0

Test currently does not validate TGI metrics' values, and I would suggest separating the testing of TGI and vLLM metrics in two different TCs for better visibility into their behaviour

lugi0 · 2024-06-11T09:24:30Z

.../Tests/400__ods_dashboard/420__model_serving/LLMs/vllm/426__model_serving_vllm_metrics.robot

@@ -89,7 +103,7 @@ Verify Vllm Metrics Are Present
    Set Suite Variable    ${token}
    Metrics Should Exist In UserWorkloadMonitoring    ${thanos_url}    ${token}    ${SEARCH_METRICS}

-Verify Vllm Metrics Values Match Between UWM And Endpoint
+Verify Vllm And tgi Metrics Values Match Between UWM And Endpoint


This does not work - Line 98/112 fetches the metrics to test for by looking at those that start with vllm:, so TGI metrics are not being tested at all here.
You will need to either change the logic of the Get Vllm Metrics And Values in Helpers.py or develop an equivalent keyword for TGI metrics

lugi0 · 2024-06-11T09:25:42Z

.../Tests/400__ods_dashboard/420__model_serving/LLMs/vllm/426__model_serving_vllm_metrics.robot

-Verify User Can Deploy A Model With Vllm Via CLI
+Verify User Can Deploy A Model With Vllm And tgi Via CLI
    [Documentation]    Deploy a model (gpt2) using the vllm runtime and confirm that it's running
    [Tags]    Tier1    Sanity    Resources-GPU    RHOAIENG-6264   VLLM
    ${rc}    ${out}=    Run And Return Rc And Output    oc apply -f ${DL_POD_FILEPATH}


This test is not using tgi to deploy the model, it's using vLLM, please revert the name of this TC

lugi0 · 2024-06-11T09:26:49Z

.../Tests/400__ods_dashboard/420__model_serving/LLMs/vllm/426__model_serving_vllm_metrics.robot

+Verify Vllm And tgi Metrics Are Present
+    [Documentation]    Confirm vLLM and tgi metrics are exposed in OpenShift metrics


IMHO you should create a separate test to validate that TGI metrics are present - as far as I understand you've already encountered the issue where only vLLM ones would get exposed, so it'd be better to separate the two and have clear reporting on which ones (if any) are failing to get exposed.

Add support for tgi metricesfor vllm

5f0c8e2

mwaykole requested a review from lugi0 June 7, 2024 16:22

mwaykole self-assigned this Jun 7, 2024

Add support for tgi metricesfor vllm

744f9b8

Signed-off-by: Milind Waykole <[email protected]>

mwaykole added the verified This PR has been tested with Jenkins label Jun 8, 2024

mwaykole requested a review from tarukumar June 10, 2024 11:56

lugi0 requested changes Jun 11, 2024

View reviewed changes

mwaykole added do not merge Do not merge this yet please and removed verified This PR has been tested with Jenkins labels Jun 11, 2024

mwaykole closed this Jun 18, 2024

mwaykole reopened this Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for tgi metricesfor vllm #1513

Add support for tgi metricesfor vllm #1513

mwaykole commented Jun 7, 2024

sonarcloud bot commented Jun 7, 2024

github-actions bot commented Jun 7, 2024 •

edited

Loading

mwaykole commented Jun 8, 2024

lugi0 left a comment

lugi0 Jun 11, 2024

lugi0 Jun 11, 2024

lugi0 Jun 11, 2024

mwaykole Jun 18, 2024

		Verify Vllm And tgi Metrics Are Present
		[Documentation] Confirm vLLM and tgi metrics are exposed in OpenShift metrics

Add support for tgi metricesfor vllm #1513

Are you sure you want to change the base?

Add support for tgi metricesfor vllm #1513

Conversation

mwaykole commented Jun 7, 2024

sonarcloud bot commented Jun 7, 2024

Quality Gate passed

github-actions bot commented Jun 7, 2024 • edited Loading

Robot Results

mwaykole commented Jun 8, 2024

lugi0 left a comment

Choose a reason for hiding this comment

lugi0 Jun 11, 2024

Choose a reason for hiding this comment

lugi0 Jun 11, 2024

Choose a reason for hiding this comment

lugi0 Jun 11, 2024

Choose a reason for hiding this comment

mwaykole Jun 18, 2024

Choose a reason for hiding this comment

github-actions bot commented Jun 7, 2024 •

edited

Loading