-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Memory corruption/undefined behavior on GemmUniversal in 3.4.0 - 3.6.0 🐛 #2017
Comments
Looks like you're rolling your own build system and command line flags. We don't support that. Are you able to repot using our build flags and cmake. Also please specify your CUDA toolkit version |
Hello @thakkarV , same issue when using CMake to build. 1. Copied file gather_scatter_fusion1.cu to the cutlass/examples/36_gather_scatter_fusion directory
Again gather_scatter_fusion1.cu, must be the first in the list. 3. Run the commands:
4. Got the error:
nvcc --version output:
|
Does it work correctly if you use a release build instead of debug? |
Yes, it works correctly when using GCC host compiler (Release configuration) |
Description of the bug:
Affected versions are 3.4.0 and 3.6.0 and in between.
When using example cutlass/examples/36_gather_scatter_fusion/gather_scatter_fusion.cu, and linking with some other code (attached to this report) I've got the error:
Steps to reproduce:
2.1) Made a copy of the example into gather_scatter_fusion1.cu.
2.2) Remove the main function and rename run function to run2 in gather_scatter_fusion1.cu.
2.3) Remove some code from the ending of the function run2 until
status = gemm_op();
statement, not keeping it.IMPORTANT: gather_scatter_fusion1.cu MUST be the first in nvcc command to reproduce the error, otherwise everything works fine!
Output result:
But expected:
Additional notes:
cudaLaunchKernel
, in debugger the arguments to it looks reasonable.cudaGetLastError()
returnscudaErrorInvalidValue
Environment:
GPU: A100
nvidia-smi: 470.161.03 CUDA Version: 11.4
gcc version: 13.3.0 (Ubuntu 13.3.0-6ubuntu2~24.04)
clang version: 18.1.3 (1ubuntu1)
Docker container:
nvcr.io/nvidia/cuda:12.6.3-devel-ubuntu24.04
Also reproduced on:
nvcr.io/nvidia/cuda:12.4.1-devel-ubuntu22.04
with corresponding default versions of tools/compilers.gather_scatter_fusion1.cu.txt
gather_scatter_fusion.cu.txt
cc: @IonThruster
The text was updated successfully, but these errors were encountered: