Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

support benchmarking of CUDA Unified Memory #63

Open
drossetti opened this issue Oct 1, 2019 · 0 comments
Open

support benchmarking of CUDA Unified Memory #63

drossetti opened this issue Oct 1, 2019 · 0 comments

Comments

@drossetti
Copy link

perftest should support benchmarking of these new kinds of memory.

There are two basic variants of CUDA Unified Memory:

  1. managed memory, as allocated via cudaMallocManaged()
  2. system allocated memory, as allocated via malloc()

On IBM machines based POWER9, where the GPU is attached to the CPU via NVLINK, e.g. AC922 servers, the CUDA runtime supports GPUDirect RDMA on both variants.
For that to work, ODP must be enabled.

On other systems, like x86_64, support is still missing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant