-
Notifications
You must be signed in to change notification settings - Fork 143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crash with system-provided OpenMPI and HDF5_jll v1.14 #1079
Comments
ah, I've seen something similar! The problem appears to be that we're opening two different MPI libraries (the system one from MPI.jl, and the JLL one (from HDF5_jll). Easy workarounds:
In the longer term we need a better fix. @giordano @eschnett any suggestions on how we can deal with this? |
I thought |
Yeah, i don't quite get why it's pulling in OpenMPI_jll? |
Ah, I see. It augments based on the value of the MPI But the augmentation for |
My approch, of course, would be to use the Julia-provided |
Would it be possible to print a warning if a system-provided MPI installation, but no system-provided HDF5 is detected? |
I have been affected by this very problem. A package I am working on (let's call it PMFRG) depends on HDF5 and if I do
I get a crash without much explanation.
works. |
I believe using mpitrampoline as suggested above should help, see documentation added in JuliaParallel/MPI.jl#838 |
When I set up a simple project with the latest MPI and HDF5 packages and configure it to use the system-provided OpenMPI installation, the call to
MPI.Init()
crashes with “orte_init failed” errors. I am observing issue on both Ubuntu 18.04 (OpenMPI 3.1.2) and 20.04 (OpenMPI 4.0.3). Downgrading to HDF5_jll v1.12 fixes the issue.Steps to reproduce:
julia --project=.
]add MPI HDF5
using MPI; MPI.MPIPreferences.use_system_binary()
mpirun -n 4 julia --project -e "using MPI, HDF5; MPI.Init()"
(ormpiexecjl
), observe crash]add [email protected]
, rerun without crashOn Ubuntu 18.04, the error includes the line
mca_base_component_repository_open: unable to open mca_pmix_pmix3x: /home/user/.julia/artifacts/f9744710560ba3ddc00cd9df62ac7dfcd18c8649/lib/openmpi/mca_pmix_pmix3x.so: undefined symbol: opal_envar_t_class
, in case this is helpful.The text was updated successfully, but these errors were encountered: