-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does MInference supports CUDA11.8? #56
Comments
Hi @hensiesp32, thanks for your interest in MInference. It supports CUDA 11.8. We have released the wheel for CUDA 11.8 at this link. If you have any questions, feel free to leave a comment here. |
Thanks for your reply. Well, I want to test the needle-in-a-haystack experiment, I only used one A100-80G,however when the contexts length reach to 300k,it occurred an OOM error. Then i open the
so I want to know how do you test the needle-in-a-haystack with 1M context length? Or can we use multi-gpu to run it? |
Hi @hensiesp32,
|
Describe the issue
I am wandering if the MInference support cuda11.8? Our devices don't support cuda12.3
The text was updated successfully, but these errors were encountered: