-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ThreadsafetyTest: Problems with OpenMPI #292
Comments
This defect only has effect in builds with |
The second problem (the failing Depending on the outcome, we might have to ensure mutual exclusion for sync calls ourselves or rethink our synchronization model. Calling So in essence: Yes, this problem currently only bites us if |
I got feedback from the OpenMPI folks:
Sounds like the next release of OpenMPI will support our use-case. We might want to add this (and the broken alignment of shared windows, see #280) to a |
Ping: any opinion on the |
@devreal I'd be fine with that. Known issues are to be documented in |
Just as a note, I think that is a problem with the way the version got build. |
Interesting. What flags did you pass to configure? I usually build all OpenMPI versions myself. |
I passed no extra options to Sorry my bad, the error persists for OpenMPI 2.0.2. Should've read the output more carefully. For at least in OpenMPI 2.1.0 this error is gone, but then again it blocks in |
That is interesting as well, I was not aware that things changed there. I will follow up with the OpenMPI folks on that. Thanks for the report! |
Mhh, this test is failing for me as well and I've been trying to debug this for a while now. The results are wrong and it tends to hang, just as you observe it. However, I get wrong results with MPICH as well, so I assume that we do something nasty there. I just haven't figured out what exactly is going wrong... |
I am seeing issues on my local machine when running the
ThreadsafetyTest
s under OpenMPI. The used compiler is GCC 5.3.0, both in Debug and Release mode.OpenMPI 1.10.5
I am experiencing hangs in
ConcurrentPutGet
, where one or multiple threads are stuck inMPI_Win_flush
called bydart_put_blocking
:For some reason, the test seems to pass on CircleCI. It seems to be a progress issue and I seem to remember that @fuchsto had investigated something along those lines in OpenMPI. Did you file a bug report back then?
OpenMPI 2.0.2
The test suite does not run at all since OpenMPI 2.0.2 complains about missing support for
MPI_THREAD_MULTIPLE
in the OSC pt2pt component:This sounds like a bug in OpenMPI and I will send a report to the user mailing list to ask for clarification. The suggested workarounds are not very helpful (yes, I am running on a single node and no, I do not have Infiniband available on my laptop...) Without any investigation done so far, it's not clear to me though why the
openmpi2
test succeeds in CircleCI.MPICH 3.2
Works as expected 👍
The text was updated successfully, but these errors were encountered: