-
Notifications
You must be signed in to change notification settings - Fork 868
WeeklyTelcon_20240611
Tommy Janjusic edited this page Jun 11, 2024
·
5 revisions
- Dialup Info: (Do not post to public mailing list or public wiki)
- Tommy Janjusic (NVIDIA)
- Jeff Squyres (CISCO)
- Luke Robison (Amazon)
- Edgar Gabriel (AMD)
- Todd Kordenbrock
- Manu Shantharam (AMD)
- Howard Pritchard (LANL)
- Wenduo Wang (Amazon)
- George Bosilca (NVIDIA)
- Josheph Schuhart (SBU)
- Todd Kordenbrock
- Aurelien Bouteiller (UTK)
- v4.1.x Issues; v4.1.x Questions
- v4.1.x Open PRs
- Next release when cycles allow it, or if there is a emergency
-
PRTE issue if PMIx upgrades from v4.x to v5.x without also rebuilding PRTE
- PMIx changed some internal data structures between v4.x and v5.x (specifically, they are not part of the PMIx public API).
- The PMIx public API did not change between v4.x and v5.x, so the libpmix .so number did not change to reflect an incompatibility.
- PRTE uses the internal PMIx data structures, however, and is affected by the changes between PMIx v4.x and v5.x.
- Meaning: if you build+install PRTE against PMIx v4.x and then upgrade to PMIx v5.x without also re-building PRTE, PRTE will almost certainly segv.
- Since PRTE is the back-end of mpirun, this affects Open MPI as well.
- Upstream PRTE is likely to put in a run-time check to print an error and abort (before segv'ing) when detecting this case.
- See https://github.com/openpmix/prrte/pull/1982 , pull this PR into our fork
- Runtime test suite, from IBM check with Josh H.
- Check requirements, there was an IBM CI running on PRRTE (sept. 2023),
-
Now we have our own PRRTE. People should watch out for pull requests https://github.com/open-mpi/prrte/pulls
-
cuda reduction, cuda mem topic (Joseph)
- https://github.com/open-mpi/ompi/pull/12356 , merge
- Add CUDA/HIP implementations of reduction operators #12569 (large PR, needs reviewers)
- only have the CUDA backend, generate HIP *
-
acoll PR, needs to be merged for v6.0
-
https://github.com/open-mpi/ompi/pull/12484
- needs to be reviewed, disabled by default
-
https://github.com/open-mpi/ompi/pull/12484
-
Large count stuff before fork
-
**Fork sometime in v6.0 July **
-
Introduce new JSON parser utility #12531
- Need to discuss MPI 4.1
- Need to discuss MPI 4.0
- Discuss set v5.0.4 timeline
- Discuss v6.0 feature list and timeline
- Discuss 6.0: BigCount
- Explore cherry-pick bots
- Explore an MTT alternative