-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retry on known flakey errors #211
Comments
The most frequent category of error I've seen in my brief time monitoring this stuff so far is
|
|
Network issues when fetching sources is another source of flakey failure. See https://github.com/tarides/infrastructure/issues/338#issuecomment-2229229672 This error happens during execution of opam. E.g.,
|
Notes from a discussion with @mtelvers today:
So our next step here is open an issue upstream to discuss and evaluate between those two options. |
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
Contributes to ocurrent/docker-base-images#211
The most frequent case of this we have been coping with has been solved, going by this week's builds, which, afaik, all completed without any need for restarts or intervention, save for the known issues on ocaml <4.08 for some distros. I'm going to let this fall back in the backlog then until we are troubled by new problems. |
Authentication errors due to networking issues or transient server-side problems are another class of failure that would benefit from retries (see https://github.com/tarides/infrastructure/issues/397).
|
|
Looks like a network error led to a failing git clone:
https://images.ci.ocaml.org/job/2024-10-23/174748-ocluster-build-bd1fff |
Base image builder regularly errors on this transient issue:
It would be useful to immediately retry on known flakey errors.
Prerequisite
Known flakey errors
Flakey errors on
docker-build
:dial tcp: lookup registry-1.docker.io: Temporary failure in name resolution
@ Retry on known flakey errors #211 (comment)failed to solve with frontend dockerfile.v0: failed to solve with frontend gateway.v0: frontend grpc server closed unexpectedly
@ Retry on known flakey errors #211 (comment)Source download failed for (.*): Curl failed
@ Retry on known flakey errors #211 (comment)Flakey errors on
docker-push
:error parsing HTTP 400 response body: invalid character '<' looking for beginning of value: "<html><body><h1>400 Bad request</h1>\nYour browser sent an invalid request.\n</body></html>\n\n"
@ Retry on known flakey errors #211 (comment)Flakey errors on docker authentication:
The text was updated successfully, but these errors were encountered: