-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
unable to open hip GPU device (gfx1030) on 4.0.x branch. #2238
Comments
on further debugging looks like the Device context is not populated: Thread 1 "hipTestDevice" received signal SIGSEGV, Segmentation fault. |
This happens with Tensile (gfx10) branch too: 5950x:~/github/Tensile/build$ /home/foo/github/Tensile/build/0_Build/client/tensile_client --config-file /home/foo/github/Tensile/build/1_BenchmarkProblems/Cijk_Ailk_Bljk_SB_00/00_BF/build/../source/ClientParameters.ini Aborted (core dumped) looks like rocclr is not initializing the context (?) |
hipErrorNoDevice means /opt/rocm/bin/rocminfo returns no devices. |
Thanks for the quick response. It is found ok.
I am also able to run rocr benchmark tests ( Posted here: ROCm/aomp#187) . Something is broken with rocclr in the hip-on-vdi path.
|
@powderluv, Sorry for the lack of response. Please try latest ROCm 6.0.2 (HIP 6.0.32831) to see if your issue still exists? If resolved, please close the ticket. Thanks. |
See corresponding bug here: ROCm/aomp#187
Based on the guidance there I was able to verify rocr with rocm_bandwidth_test.
HIP fails its directed tests with:
The text was updated successfully, but these errors were encountered: