-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] CU_TENSOR_MAP_DATA_TYPE_16U6_ALIGN16B isnt defiend in CUDA toolkit 12.4 #2065
Labels
Comments
same issue,according to the reply, version v3.7.0-2-g389e4930 was used,even compiling any example of running example will return this same information. |
This is blocking the upgrade of CUTLASS in PyTorch to 3.8 as well, which is in turn blocking full PyTorch Blackwell support. |
Noted. We are working on a fix ASAP |
hopefully it is fixed in #2066 |
Hello together, many thanks i works now. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
CU_TENSOR_MAP_DATA_TYPE_16U6_ALIGN16B is undefined in CUDA toolkit 12.4. See Screenshot
Steps/Code to reproduce bug
mkdir cutlass/build
cd build
cmake .. -DCUTLASS_NVCC_ARCHS="89"
Expected behavior
see screenshot
Environment details (please complete the following information):
NVIDIA GeForce RTX 4070 Ti
Additional context
workaround ist update to cuda toolkit 12.8
The text was updated successfully, but these errors were encountered: