Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prov/efa: Create efa_ep interface #10705

Merged
merged 3 commits into from
Jan 22, 2025
Merged

Conversation

shijin-aws
Copy link
Contributor

This PR contains two changes

  1. Migrate dgram/efa_dgram_ep.c to efa_ep.c as a common ep interface for both dgram and rdm ep type. dgram repo now has nothing and is removed.
  2. Extend efa ep interface to make it cover all the applied features that efa-rdm ep interface supports today. It also refactors and moves several internal efa_rdm_ep functions to efa_base_ep.c to cover both efa_direct and efa_rdm ep.

Refactor/changes are backed by corresponding unit tests.

@shijin-aws shijin-aws requested a review from a team January 16, 2025 00:11
prov/efa/src/efa_cntr.c Outdated Show resolved Hide resolved
prov/efa/src/efa_base_ep.c Outdated Show resolved Hide resolved
prov/efa/src/efa_base_ep.c Show resolved Hide resolved
prov/efa/src/efa_ep.c Show resolved Hide resolved
jiaxiyan
jiaxiyan previously approved these changes Jan 16, 2025
Migrate dgram/efa_dgram_ep.c to  efa_ep.c
as a common ep interface for both dgram and
rdm ep type. dgram repo now has nothing and
is removed.

Signed-off-by: Shi Jin <[email protected]>
@shijin-aws
Copy link
Contributor Author

bot:aws:retest

1 similar comment
@shijin-aws
Copy link
Contributor Author

bot:aws:retest

jiaxiyan
jiaxiyan previously approved these changes Jan 21, 2025
Extend efa ep interface to make it cover all the
applied features that efa-rdm ep interface supports
today.

It also refactors and moves several internal efa_rdm_ep functions
to efa_base_ep.c to cover both efa_direct and efa_rdm ep.

Signed-off-by: Shi Jin <[email protected]>
Currently, efa_base_ep's default rnr_retry is 3 which only
does a few retry in the firmware level for RNR. This is
due to the efa_rdm_ep supports libfabric level RNR retry.
However, the efa-direct ep doesn't support libfabric
level RNR retry. Then we should make it do infinite
RNR retry (7), which is also the default behavior of
SRD QP.

Signed-off-by: Shi Jin <[email protected]>
Copy link
Contributor

@sunkuamzn sunkuamzn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me, thank you!

@sunkuamzn
Copy link
Contributor

bot:aws:retest

@shijin-aws shijin-aws merged commit 736d452 into ofiwg:main Jan 22, 2025
12 of 13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants