This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
CUB 1.4.1
Summary
CUB 1.4.1 is a minor release.
Enhancements
- Allow
cub::DeviceRadixSort
andcub::BlockRadixSort
on bool types.
Bug Fixes
- Fix minor CUDA 7.0 performance regressions in
cub::DeviceScan
andcub::DeviceReduceByKey
. - Remove requirement for callers to define the
CUB_CDP
macro when invoking CUB device-wide rountines using CUDA dynamic parallelism. - Fix headers not being included in the proper order (or missing includes) for some block-wide functions.