Bulk drone submission / operations #222
Replies: 2 comments 5 replies
-
I agree that supporting bulk submission or array jobs is a very good idea. As you mentioned, the I could imagine to queue "commands" and execute them in one go as soon as
This would indeed improve the scalability and reduce the load on the batch system as well. |
Beta Was this translation helpful? Give feedback.
-
To document the internal workflow and where we can hook in: Right now TARDIS uses COBalD's At the lowest level, we can hook into the At a higher level we can modify the
|
Beta Was this translation helpful? Give feedback.
-
The HTCondor batch system1 allows bulk operations and especially bulk submission. Using these from TARDIS could greatly improve scalability.
Bulk operations, e.g. removing two jobs at once, remove some overhead from invoking the API and concurrency protection. Bulk submission, e.g. submitting two identical jobs at once, allows for more efficient handling of jobs by the batch system.
From a programming standpoint, it is easy enough to retrofit bulk operations onto individual operations. For example, instead of directly running
condor_rm 123.0
andcondor_rm 321.0
, we can buffer the Job IDs and run a singlecondor_rm 123.0 321.0
after a moment. However, we should discuss whether any higher-level routines need changes to properly deal with the concurrency and delays introduced by such changes.Using bulk submission introduces some additional complexity: we need to separate common from individualised job features. Preliminary discussion has identified the
TardisDroneUuid
as a critical, individual job feature. We should discuss whether it is appropriate to use HTCondor's features like thequeue from
command, or whether there are other alternatives to making drones less individualised.I assume these would not be either-or changes. There are likely some low-hanging fruits, like aggregating deletions, that can be added separately and faster from other changes. Similarly, there are probably different levels of streamlining job submission.
Feel encouraged to discuss individual changes as well as "not perfect" changes if there's likely good benefit for reasonable effort.
1 This may also apply to other systems for which we have site adapters. Suggestions welcome.
Beta Was this translation helpful? Give feedback.
All reactions