Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update GPU process cleanup logic in SLURM epilog script #1316

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

ilya-da
Copy link

@ilya-da ilya-da commented Sep 14, 2024

Simple resolution for issue #1315 i've opened earlier
Remove redundant 'tail' command in GPU process cleanup checks to ensure more accurate detection and termination of residual GPU processes. This change optimizes the script by directly filtering out comments and unnecessary lines from nvidia-smi output and not depend on how many comment lines nvidia-smi output may have

Remove redundant 'tail' command in GPU process cleanup checks to ensure more accurate detection and termination of residual GPU processes. This change optimizes the script by directly filtering out comments and unnecessary lines from nvidia-smi output and not depend on how many comment lines nvidia-smi output may have
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant