-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ability to provide a soft timeout-minutes #953
Comments
Hi @awoimbee Thanks for opening this issue. I think this problem is particular is difficult because there are many locations where timeouts can occur. e.g. at the level of the Kublet, at the level of the HTTP request from the provider, from the Pulumi engine to the provider, and for the Pulumi program itself. I'm reading your feature request as a timeout for the Pulumi program itself. I have a few thoughts, please let me know if they're helpful for you.
IMO adding a timeout for the engine is rather dangerous for hard timeouts, and not necessarily useful for soft timeouts. Suppose a program creates a Kubernetes cluster on AWS; this operation can take 10-15 minutes. For hard cancellations, if the engine cancels this operation while the provider is midway through performing it, then statefile will have a false negative. The infrastructure will be created but the statefile will report it has failed and try again. For soft cancellations like you suggested, the engine will have to wait for the provider to finish ongoing operations anyway, so it will have to wait for the full 15 minutes regardless of the timeout. Therefore, I'm not sure how helpful a Putting my operator hat on, I think if I were in your shoes, I'd try to detect image pull failures sooner by decreasing the timeout at the kubelet level (adjusting |
Hi, thanks for the detailed response !
Yes indeed, keep it simple !
These are for slow requests, won't work in my case.
Yes, and it's not necessary as we already have the
Yes, my problem is definitely k8s related and a change on the pulumi k8s provider might also be able to solve my problem. |
Hello!
Issue details
I use
pulumi-kubernetes
, pulumi can get stuck on waiting for a deployment that can't work (eg missing container image) for a while.A normal workflow run takes ~3mins max, a broken one takes >20mins.
I want to put a limit to the time spent on pulumi. I use
timeout-minutes
but it leads to a broken pulumi state (because of hard termination).-> Please add a way to put a soft limit, where pulumi will cancel all operations and terminate gracefully.
Affected area/feature
The action itself, it could just send a SIGINT.
The text was updated successfully, but these errors were encountered: