Replies: 4 comments 2 replies
-
Here is a PR #13403 |
Beta Was this translation helpful? Give feedback.
0 replies
-
This looks better suited for an issue than a discussion. Can we close in favor of #13422? (Even though this was started first) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I ran into this issue where I want to run a program that does not use collective operations. I have an older NCCL version that does not have the function
ncclCommInitRankConfig
. This function appears in NCCL version 2.14.3. Whem loading the CUDA driver I get this errorIn the code if the library is not found it is not an error and will fail only later on channel creation. Isn't it more correct that a symbol load failure be also silenced here? If lazy-like loading is intended, not having the right NCCL version is the same as not having it at all.
Beta Was this translation helpful? Give feedback.
All reactions