Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make CUDA image project templates #1539

Open
olevski opened this issue Jul 17, 2023 · 3 comments
Open

Make CUDA image project templates #1539

olevski opened this issue Jul 17, 2023 · 3 comments

Comments

@olevski
Copy link
Member

olevski commented Jul 17, 2023

Currently we have no templates with CUDA images. But we build CUDA images and publish them.

This makes using GPUs on renku complicated. The main reason is that when you want to use a GPU you have to:

  • go to the renkulab docker realses
  • pick the right image
  • upgrade your python project to use a new image

It also makes updating the image a problem because you have to repeat the whole process.

If we publish project templates with CUDA a lot of these issues go away.

@seanrmurphy
Copy link
Contributor

Just one thing to be aware of here - torch bring its own cuda and we have seen issues in the past in which there was a cuda version provided in a container image which conflicted with torch's cuda version. In this case Till was actually able to run his job on a container image which had no cuda support (as pip install torch installed all the cuda stuff he required) but when he tried an image which had pre-baked cuda, it did not work as there was a conflict between preinstalled cuda and that which torch installed.

@olevski
Copy link
Member Author

olevski commented Jul 17, 2023

@seanrmurphy this is good to know. If we can fully retire these cuda images I will be really happy. Firat asked for this so I will let him know. I thought it was impossible for torch to pull in all the requirements simply by doing pip install torch.

@seanrmurphy
Copy link
Contributor

Torch does this but tensorflow does not afaik. Imo, this is one of those cases where older stuff needs the cuda images but newer stuff does not.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Backlog
Development

No branches or pull requests

2 participants