-
Notifications
You must be signed in to change notification settings - Fork 868
WeeklyTelcon_20200303
Geoffrey Paulsen edited this page Mar 4, 2020
·
1 revision
- Dialup Info: (Do not post to public mailing list or public wiki)
- Geoffrey Paulsen (IBM)
- Howard Pritchard (LANL)
- Jeff Squyres (Cisco)
- Artem Polyakov (Mellanox)
- Edgar Gabriel (UH)
- Austen Lauria (IBM)
- Harumi Kuno (HPE)
- Joseph Schuchart
- Josh Hursey (IBM)
- Joshua Ladd (Mellanox)
- Michael Heinz (Intel)
- Noah Evans (Sandia)
- Ralph Castain (Intel)
- sbreyer
- Todd Kordenbrock (Sandia)
- William Zhang (AWS)
- Brendan Cunningham (Intel)
- Akshay Venkatesh (NVIDIA)
- Thomas Naughton (ORNL)
- Scott Breyer (Sandia?
- Nathan Hjelm (Google)
- Charles Shereda (LLNL)
- David Bernhold (ORNL)
- George Bosilca (UTK)
- Matthew Dosanjh (Sandia)
- Brandon Yates (Intel)
- Erik Zeiske
- Mark Allen (IBM)
- Matias Cabral (Intel)
- Xin Zhao (Mellanox)
- mohan (AWS)
-
MTT -
- If you change your MTT to startup PRRTE at begining of session, and just use prun.
- Can see times cut in half or more.
- This is good, but also need to test mpirun wrapper.
- Cisco is converting half of MPI installs to use prrte/prun
-
PRRTE Transition:
- Been in master a few weeks. Still finding and fixing issues. Progress.
- see: https://github.com/openpmix/prrte/issues/298 for additional mpirun launch items
-
OMPI master submodule pointers setup to track PMIx and PRRTE master.
- Hopefully long term, master can track release branches.
- But still ensure there's some regression tracking of master/master/master.
- But once things settle down, might not want everyone's masters tracking each other.
- But if we DONT have master/master/master then new features that span across repos will be challenging
Blockers All Open Blockers
Review v3.0.x Milestones v3.0.6
Review v3.1.x Milestones v3.1.6
- Michael is interested in a schedule.
- He will discuss and propose a date
Review v4.0.x Milestones v4.0.3
- v4.0.3 in the works.
- Config file needs to be committed to
- Merging in PRs
- Seeing some instability on github right now.
- Schedule:
- Feature Freeze: End of April
- Release: End of June
- Austen took an initial stab at issues and is starting a google sheets of v5.0 features.
- Issue - 7475 Blocker for Amazon.
- When marking items as blockers, please add rational.
- This is functionality is in v4.0, and if you don't specify a PML you'd get this modex.
- Apparently the check wasn't working in 3.0 or 4.0
- May need doc updates for command line changes, both prrte, and mca changes.
- Want to get some feedback some community.
- Huge list of features
- Biggest thing on master is prrte.
- Issues are being found and fixed.
- Cisco mtt failing due to -np
- Maybe mid-late summer. No discussion
- scale-testing, PRs have to opt-into it.
Review Master Master Pull Requests
- CI testing only tests build and did it run, but doesn't test HOW it ran.
- Environment setup can be a bit different.
- For example no-permissions in
/tmp
. Might pass on one machine, and fail on another without/tmp
permissions.