-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TeamLocalityTest: Subdomain tag too short #161
Comments
The original cause of this defect are invalid parameters, it seems. When multiple units are mapped to the same domain (which is an invalid configuration we can't avoid in CI), the test case I added a check and error log message for stability (branch |
Just tested with latest development and it still crashes when using 8 units. I am attaching the Valgrind log of one of the processes, which still shows invalid writes in the domain handling. |
Ok, thank you! (... but ... whhhyyyyy ... ?) How did you configure / call Valgrind? Is it the bundled memcheck of OpenMPI? |
Right now, things seem to fall apart completely again: No matter the test I run, I get a SIGSEGV during initialization if I have debug output enabled in DART:
I have no idea why things go wrong there. The |
Not sure if that is related but I also see this error mesasges on unit 0:
|
On it! |
It seems that there is a problem with the TeamLocalityTest. Valgrind reports:
Adding some debug output to
dart__base__locality__domain_group
, I see thatstrlen(group_subdomain_tags[sd]) < (group_parent_domain_tag_len + 1)
:Note that
(group_parent_domain_tag_len + 1)
is 15 in this case andgroup_subdomain_tags[sd]
contains'.0.0.0.0.1.0.0'
The text was updated successfully, but these errors were encountered: