ci: increase timeout for k8s intg tests #9929

rb-determined-ai · 2024-09-12T21:37:41Z

The latest versions of k8s have a new Job condition called
JobFailureTarget which is a signal to the jobs controller to kill off
the pods of the job.

Our jobUpdatedCallback() function waits for the JobFailure condition,
which now comes after the pod is fully terminated, which takes a lot
longer.

Probably we need to make sure our k8s logic is still valid, and deal
with the additional time it takes a job to reach JobFailed, if it
affects anything other than this test.

Until then, let's unblock CI for the whole team by just increasing the
test time for TestExternalPodDelete and TestNodeWorkflows.

netlify · 2024-09-12T21:37:56Z

✅ Deploy Preview for determined-ui ready!

Name	Link
🔨 Latest commit	`282d891`
🔍 Latest deploy log	https://app.netlify.com/sites/determined-ui/deploys/66e4ab16f48f9d0008ba989b
😎 Deploy Preview	https://deploy-preview-9929--determined-ui.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

codecov · 2024-09-12T21:39:03Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 54.52%. Comparing base (867eb31) to head (282d891).
Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #9929      +/-   ##
==========================================
- Coverage   59.18%   54.52%   -4.67%     
==========================================
  Files         751     1252     +501     
  Lines      104462   156550   +52088     
  Branches     3598     3599       +1     
==========================================
+ Hits        61824    85354   +23530     
- Misses      42506    71064   +28558     
  Partials      132      132

Flag	Coverage Δ
backend	`45.12% <ø> (+1.32%)`	⬆️
harness	`72.75% <ø> (ø)`
web	`54.33% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

see 501 files with indirect coverage changes

amandavialva01

LGTM!
We should probably ticket this though,so we can devise a more permanent solution to the k8s API changes.

carolinaecalderon

LGTM

The latest versions of k8s have a new Job condition called JobFailureTarget which is a signal to the jobs controller to kill off the pods of the job. Our jobUpdatedCallback() function waits for the JobFailure condition, which now comes after the pod is fully terminated, which takes a lot longer. Probably we need to make sure our k8s logic is still valid, and deal with the additional time it takes a job to reach JobFailed, if it affects anything other than this test. Until then, let's unblock CI for the whole team by just increasing the test time for TestExternalPodDelete and TestNodeWorkflows.

rb-determined-ai requested a review from a team as a code owner September 12, 2024 21:37

rb-determined-ai requested a review from carolinaecalderon September 12, 2024 21:37

cla-bot bot added the cla-signed label Sep 12, 2024

amandavialva01 approved these changes Sep 12, 2024

View reviewed changes

rb-determined-ai force-pushed the rb/new-k8s branch from 910c942 to e746c06 Compare September 12, 2024 21:56

rb-determined-ai changed the title ~~ci: increase timeout for TestExternalPodDelete~~ ci: increase timeout for k8s intg tests Sep 12, 2024

rb-determined-ai force-pushed the rb/new-k8s branch from e746c06 to f3fd024 Compare September 12, 2024 22:08

kkunapuli approved these changes Sep 12, 2024

View reviewed changes

carolinaecalderon approved these changes Sep 13, 2024

View reviewed changes

rb-determined-ai force-pushed the rb/new-k8s branch from f3fd024 to 282d891 Compare September 13, 2024 21:13

rb-determined-ai merged commit 13b7b3f into main Sep 13, 2024
82 of 94 checks passed

rb-determined-ai deleted the rb/new-k8s branch September 13, 2024 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: increase timeout for k8s intg tests #9929

ci: increase timeout for k8s intg tests #9929

rb-determined-ai commented Sep 12, 2024 •

edited

Loading

netlify bot commented Sep 12, 2024 •

edited

Loading

codecov bot commented Sep 12, 2024 •

edited

Loading

amandavialva01 left a comment

carolinaecalderon left a comment

ci: increase timeout for k8s intg tests #9929

ci: increase timeout for k8s intg tests #9929

Conversation

rb-determined-ai commented Sep 12, 2024 • edited Loading

netlify bot commented Sep 12, 2024 • edited Loading

✅ Deploy Preview for determined-ui ready!

codecov bot commented Sep 12, 2024 • edited Loading

Codecov Report

amandavialva01 left a comment

Choose a reason for hiding this comment

carolinaecalderon left a comment

Choose a reason for hiding this comment

rb-determined-ai commented Sep 12, 2024 •

edited

Loading

netlify bot commented Sep 12, 2024 •

edited

Loading

codecov bot commented Sep 12, 2024 •

edited

Loading