Meeting 2024

Open MPI Developer's 2024 Meeting

Meeting logistics:

Date: April 24 - 26
Location:
AMD Austin
7171 Southwest Pkwy, Austin, TX 78735
Meeting Room B500 1A.340.Peony
Some nearby hotels:
- AC Hotel by Marriott Austin Hill
- Hampton Inn Austin/Oak Hill

Remote attendance information

The meeting rooms are integrated with MSTeams, there will be separate link for every day to attend the meeting for remote participants. This is a link to a non-public repo for the info (posting links publicly just invites spam; sorry folks).

If you do not have access to the non-public repo, please email Jeff Squyres.

Attendance

Please put your name down here if you plan to attend.

Edgar Gabriel (AMD)
Howard Pritchard (LANL)
Thomas Naughton (ORNL)
George Bosilca (NVidia)
Joseph Schuchart (UTK)
Kawthar Shafie Khorassani (AMD)
Manu Shantharam (AMD)
Luke Robison (AWS)
Jun Tang (AWS)
Wenduo Wang (AWS)
Tommy Janjusic (Nvidia)

Agenda items

The meeting is tentatively scheduled to start on April 24 around 1pm, and is expected to finish on April 26 around lunch time.

Please add Agenda items we need to discuss here.

Support for MPI 4.0 compliance (https://github.com/open-mpi/ompi/projects/2)
- Big count
  - Generate bigcount interfaces for Fortran and C - (https://github.com/open-mpi/ompi/pull/12226)
    
    Jake reviewed this PR
  - Standard ABI generation code and library refactor - (https://github.com/open-mpi/ompi/pull/12003)
  - COLL framework needs work to support MPI Bigcount - (https://github.com/open-mpi/ompi/pull/12478)
    
    Group consensus was to go with extending mca_coll_base_comm_coll_t to add '_c' entry points for functions that involve vector arguments of count/displacements. There are some issues with datatype convertor functions for big count. Discussed having a partial PR which only changes signatures of non v/w collectives.
    
    Would like to get big count feature in to a 5.1 release. Discussed possible ABI break concerns. Consensus is that it should not break ABI backward compatibility.
  - Big Count Collective Test Suite: (https://github.com/open-mpi/ompi-tests-public/pull/15)
  - Related datatype PR - (https://github.com/open-mpi/ompi/pull/12351)
  - ROMIO refresh
- MPI_T events
  - (https://github.com/open-mpi/ompi/pull/8057) How important is this feature? Joseph will ping scorep folks to see if they are planning MPI_T events.
Support for MPI 4.1 compliance (https://github.com/open-mpi/ompi/projects/4)
- Memory kind info objects
Edgar presents some slides summarizing this MPI 4.1 feature. Discussion of mpi_assert_memory_alloc_kinds. How would we actually use this with Open MPI? Slides also show work items. Some work required in prrte. Discuss lazy init of cuda, etc. We may not be able to do that much optimization based on memory kinds anyway out side of ptr check. Some discussion of the complications of restrictors and how that may complicate actually using this kind info for Open MPI internally. Also Open MPI can be configured with multiple device type support. Do we need to support different device types concurrently? Accelerator framework currently not set up to deal with this, it allows only one component to be active. Discuss multiple devices of a single type. Right now cuda and rocm not making use of APIs that use device IDs, but could, at least for cuda. Maybe items for a 6.0 release?
Support for MPI 4.2(?) ABI (https://github.com/mpi-forum/mpi-issues/issues/751)
- Operative question: when is MPI v4.2 expected to be ratified?
- related PR (https://github.com/open-mpi/ompi/pull/12033)
Collective Operations
- xhc/shared memory collectives
- GPU collectives
- Collective configuration file
- Memory allocation caching
Possibly drop HCOLL in 6.0. Could we combine/migrate some of adapt algorithms to libnbc? No not really, they use different approaches to non-blocking collectives. Coll framework has many components - can we possibly remove some of them? sm?

AWS focusing on optimizing collectives for EFA libfabric provider. Using MTL. Focus on HAN optimization - alltoall/alltoallv. Focus on Tuned/base - allreduce, allgather, reduce. Also working on a selection algorithm. Also considering a decision file based approach. PRs open for many of these.

Quite a few PRs open right now for various collective algorithms: XHC, smdirect, acoll, coll/am. Should we start merging some of these in?

Lots of discussion about selection and priority of components.

Discuss whether to merge in https://github.com/open-mpi/ompi/pull/11418. George will ping PR author to see about level of commitment and if author or his org will support, go ahead with merge.

Agree to merge in the acoll PR https://github.com/open-mpi/ompi/pull/12484 once it passes CI.

Could use an easy to report way to determine what component/algorithm is being used for a collective op - maybe targeted for debugging case.

Do we still need smdirect PR - https://github.com/open-mpi/ompi/pull/10470?

Agree to remove coll/sm component.
Accelerator support
- shared memory plans for 5.1 and beyond
- one-sided operations
IPC support in accelerators for 5.1. In main no components outside of accelerator framework make cuda calls. We do need IPC support in accelerator/cuda component.

GMAC parameter support? PMIx may have something similar. Idea would be to change priorities for accelerator related components without having to set multiple MCA parameters.

Joseph working on PR 12356 - https://github.com/open-mpi/ompi/pull/12356 and 12318 - https://github.com/open-mpi/ompi/pull/12318 related to accelerator support.

Discuss implications of RDMABUF method of memory registration. Currently this is being used within some Libfabric providers and UCX. At this point it does not appear that we need to handle dmabuf registration within Open MPI itself. This might change if network providers require using dmabuf methods for memory registration.

Did not discuss one-sided operations.
PRRTe future topic
- Notes from PRRTE logistics discussion
Review previously-created wiki pages for 5.1.x and 6.0.x in the context of planning for Open MPI vNEXT
- These were made a long time ago; it would probably be good to re-evaluate, see which items are realistic, which will actually happen, etc. Timing / version numbers may change / consolidate, too, if we re-integrate PRRTE for v6.0.x (e.g., is doing a v5.1.x worth it at all?).
- Proposed v5.1.x feature list
- Proposed v6.0.x feature list
What to do about SLURM?
- See https://github.com/open-mpi/ompi/issues/12471
- Ralph can attend via dialup to help with this discussion
The problem starts with SLURM release 23.11. SchedMD made changes both to the environment variables describing job id, etc. that impact the PRRTE RAS system's discovery mechanism.
For OFI group
- Adopt libfabric 2.0 API?
- Adopt dma-buf API
- mtl/ofi vs. btl/ofi performance differences
Misc
- MPI_Info_set handling https://github.com/open-mpi/ompi/pull/11823
- What is the bar for merging something into main ? Just a successful CI pass ? What if there are complains from the rest of the community ? What is the solution is known to be partial and incomplete ?
- Should we enable better downstream build pipeline security for those downloading from open-mpi.org?
  - For v5.0.x, we have md5, sha1, and sha256 checksums in the HTML on the download page.
  - Should we have these values in (more easily) machine-readable formats somewhere?
  - Should we be to cryptographically signing releases somehow? (tarballs do not support signatures)
  - What do others do (e.g., GNU projects)?
Action items
- Joseph will ping scorep folks about interest in MPI_T events. (DONE). Marc Andre says on the list for ScoreP. Tau using it and some unreleased version of MPI Advisor.
- George will ping author of PR 11418 about level of commitment to determine whether to merge this PR into main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meeting 2024

Open MPI Developer's 2024 Meeting

Remote attendance information

Attendance

Agenda items

Clone this wiki locally