Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

coll/accelerator initial enhancements #12986

Merged

Conversation

edgargabriel
Copy link
Member

@edgargabriel edgargabriel commented Dec 18, 2024

  • use the device_id and the transfer direction for the accelerator.mem_copy operations: since this information is actually available, we might as well use it and give the accelerator components the ability to potential use this information for tuning purposes.

This PR is in preparation for additional work on the coll/accelerator component, I just want to avoid having a gigantic PR that is harder to digest.

Copy link
Member

@bosilca bosilca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good but I think we should have this mechanism provided from base and applied consistently across all collective modules. Maybe we can split this in two, merge the first commit (adding the dev_id and transfer type) and I volunteer to make the coll selection mechanism generic.

ompi/mca/coll/accelerator/coll_accelerator_component.c Outdated Show resolved Hide resolved
ompi/mca/coll/accelerator/coll_accelerator_component.c Outdated Show resolved Hide resolved
@edgargabriel
Copy link
Member Author

We had an offline discussion with @bosilca I will extract re-push this branch to contain only the first commit, and @bosilca will add code that achieves the same purpose as the second commit, but in coll/base.

make use of the correct device id associated with a device buffer
and provide the correct transfer type (DTOH or HTOD) for the accelerator
memcpy operations. This information can be potentially used for some
optimizations internally.

Signed-off-by: Edgar Gabriel <[email protected]>
@edgargabriel edgargabriel force-pushed the topic/coll-accelerator-extension branch from 519234a to 118c1b0 Compare December 19, 2024 16:18
@edgargabriel edgargabriel merged commit fced151 into open-mpi:main Dec 19, 2024
18 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants