Do not synchronize in `caching_allocator::{de}allocate` by miscco · Pull Request #7792 · NVIDIA/cccl

miscco · 2026-02-25T14:49:39Z

We should not need the additional synchronization.

davebayer

I don't believe this is correct. We should .sync() in .deallocate(), because if you just want to enqueue the deallocation in the stream, but you aren't yet done with the buffers, next allocation could pick up this buffer and overwrite your data.

miscco · 2026-02-25T16:02:08Z

I don't believe this is correct. We should .sync() in .deallocate(), because if you just want to enqueue the deallocation in the stream, but you aren't yet done with the buffers, next allocation could pick up this buffer and overwrite your data.

yeah you are right, Its not only the pointer, but it could be that the operation has not yet finished

davebayer

What we could do is that we could add synchronization using cuda::event. In the .deallocate(...) method, we would recor an event that would be a replacement for cudaFreeAsync in the queue. Then in .allocate(...) method, we would enqueue wait for the event before returning the pointer, which would replace cudaMallocAsync in the queue.

This way we can be sure there are no race conditions.

nvbench_helper/nvbench_helper/nvbench_helper.cuh

davebayer · 2026-02-25T16:53:27Z

nvbench_helper/nvbench_helper/nvbench_helper.cuh

+    }
+    else
+    {
+      const cudaError_t status = cudaMallocAsync(&result, num_bytes, __stream.get());


I am not sure if you can allocate memory with cudaMallocAsync and then free it with cudaFree

Pretty sure you can

You can: https://docs.nvidia.com/cuda/cuda-programming-guide/04-special-topics/stream-ordered-memory-allocation.html#freeing-memory
"Likewise, memory allocated with cudaMallocAsync can be freed with cudaFree()."

github-actions · 2026-02-25T17:46:32Z

🥳 CI Workflow Results

🟩 Finished in 1h 22m: Pass: 100%/33 | Total: 9h 00m | Max: 1h 05m | Hits: 85%/19433

See results here.

davebayer

But we should rework the allocator in the future

miscco requested a review from a team as a code owner February 25, 2026 14:49

miscco requested a review from jrhemstad February 25, 2026 14:49

github-project-automation bot added this to CCCL Feb 25, 2026

github-project-automation bot moved this to Todo in CCCL Feb 25, 2026

cccl-authenticator-app bot moved this from Todo to In Review in CCCL Feb 25, 2026

miscco force-pushed the fix_caching_allocator branch from b786fec to 81243c3 Compare February 25, 2026 15:08

bernhardmgruber approved these changes Feb 25, 2026

View reviewed changes

davebayer requested changes Feb 25, 2026

View reviewed changes

github-project-automation bot moved this from In Review to In Progress in CCCL Feb 25, 2026

This comment has been minimized.

Sign in to view

davebayer reviewed Feb 25, 2026

View reviewed changes

nvbench_helper/nvbench_helper/nvbench_helper.cuh Show resolved Hide resolved

nvbench_helper/nvbench_helper/nvbench_helper.cuh Show resolved Hide resolved

miscco added 2 commits February 25, 2026 17:20

Do not synchronize in caching_allocator::{de}allocate

531a0e4

Make sure we sync before deallocating

dd0842c

miscco force-pushed the fix_caching_allocator branch from ebeb540 to dd0842c Compare February 25, 2026 16:21

davebayer reviewed Feb 25, 2026

View reviewed changes

miscco enabled auto-merge (squash) February 25, 2026 17:49

davebayer approved these changes Feb 26, 2026

View reviewed changes

miscco merged commit 852d65d into NVIDIA:main Feb 26, 2026
54 checks passed

miscco deleted the fix_caching_allocator branch February 26, 2026 07:28

github-project-automation bot moved this from In Progress to In Review in CCCL Feb 26, 2026

github-project-automation bot moved this from In Review to Done in CCCL Feb 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not synchronize in `caching_allocator::{de}allocate`#7792

Do not synchronize in `caching_allocator::{de}allocate`#7792
miscco merged 2 commits intoNVIDIA:mainfrom
miscco:fix_caching_allocator

miscco commented Feb 25, 2026

Uh oh!

davebayer left a comment

Uh oh!

This comment has been minimized.

miscco commented Feb 25, 2026

Uh oh!

davebayer left a comment

Uh oh!

Uh oh!

Uh oh!

davebayer Feb 25, 2026

Uh oh!

miscco Feb 25, 2026

Uh oh!

pciolkosz Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

davebayer left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

miscco commented Feb 25, 2026

Uh oh!

davebayer left a comment

Choose a reason for hiding this comment

Uh oh!

This comment has been minimized.

miscco commented Feb 25, 2026

Uh oh!

davebayer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

davebayer Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

miscco Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

pciolkosz Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 25, 2026

🥳 CI Workflow Results

🟩 Finished in 1h 22m: Pass: 100%/33 | Total: 9h 00m | Max: 1h 05m | Hits: 85%/19433

Uh oh!

davebayer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants