Skip to content

Commit 4fe1beb

Browse files
Enable PDL for DeviceMergeSortBlockSortKernel
The kernel already contains a call to _CCCL_PDL_GRID_DEPENDENCY_SYNC, but PDL was not enabled when launching it. This was missed in NVIDIA#3114.
1 parent c0793cf commit 4fe1beb

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

cub/cub/device/dispatch/dispatch_merge_sort.cuh

+1-1
Original file line numberDiff line numberDiff line change
@@ -512,7 +512,7 @@ struct DispatchMergeSort
512512

513513
// Invoke DeviceMergeSortBlockSortKernel
514514
THRUST_NS_QUALIFIER::cuda_cub::launcher::triple_chevron(
515-
static_cast<int>(num_tiles), merge_sort_helper_t::policy_t::BLOCK_THREADS, 0, stream)
515+
static_cast<int>(num_tiles), merge_sort_helper_t::policy_t::BLOCK_THREADS, 0, stream, true)
516516
.doit(
517517
DeviceMergeSortBlockSortKernel<
518518
typename PolicyHub::MaxPolicy,

0 commit comments

Comments
 (0)