You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Running scripts like the stabilization and cropping were causing a slowdown in the I/O for other users @talonchandler@ieivanov
After some debugging, I believe I've nailed down the issue. This issue can be replicated in two ways with datasets that have lots of positions and timepoints and/or channels.
Most slurm scripts parallize over T and C and submit individual jobs for each positions. Keep in mind that since we parallelize over these two dimensions each multiprocessing pool will spawn n number of subprocesses. So, if all positions are allocated, then each jobs will spawn its respective number of child processes causing lots of I/O calls.
Make a slurm script with more CPUs, memory and simultaneous number of subprocesses. If we do this we run into the same issue as we will have multiple jobs running in parallel and I/O calls which reduce our overall throughput.
Proposed solutions:
Make sure that the total amount of jobs per datasets is below a TBD threshold. The combination of T and C simultaneous processes and number of positions should be less than this threshold. I need to find a sweet spot for this.
Modify the slurmscripts so that we run the positions in batches. This can be done by creating dependencies between the jobs as follows:
Problem
Running scripts like the stabilization and cropping were causing a slowdown in the I/O for other users @talonchandler @ieivanov
After some debugging, I believe I've nailed down the issue. This issue can be replicated in two ways with datasets that have lots of positions and timepoints and/or channels.
n
number of subprocesses. So, if all positions are allocated, then each jobs will spawn its respective number of child processes causing lots of I/O calls.Proposed solutions:
The text was updated successfully, but these errors were encountered: