-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
resubmission of successful jobs only works while Dagman is running #8089
Comments
maybe "as simple" as: if some jobs fail there's a Rescue DAG which we can trigger into action. But if will only contain failed nodes. ANd possibly the DAG stays in HOLD. Once the It will be nice to complement this with pointers to the code. Anyhow I am not sure that there are really important use cases where successful jobs needs to be resubmitted ! User report in https://cms-talk.web.cern.ch/t/crab-resubmit-force-not-working/32482/6 was something like "I can't believe that those jobs were successful and I want to try again". Nope ! |
in the meanwhile, here's a pointer to DAGMAN documentation CRABServer/src/python/TaskWorker/Actions/DagmanResubmitter.py Lines 97 to 113 in 39aef1a
and to AdjustSites.py which edits dagman logs CRABServer/scripts/AdjustSites.py Lines 458 to 486 in 39aef1a
|
there's already a follow up in dmwm/CRABClient#5273 where this was resolved by adding a warning. |
properly verify.
Why failed jobs can be resubmitted also later ?
If possible fix. Otherwise document.
Connects to dmwm/CRABClient#5273
The text was updated successfully, but these errors were encountered: