You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Passed block/warptile shapes, for example, are
<32,128,64>/<16,64,64>,
<32,128,64>/<32,32,64>,
<32,128,64>/<32,64,64>,
<32,256,64>/<32,64,64>,
<64,32,64>/<32,32,64>,
<64,32,64>/<64,32,64>, etc.
xiaonans
changed the title
[BUG] Modifying the block or warptile shapes and the output datatype in the unit test causes the tests to fail.
[BUG] Modifying the block/warptile shapes and the output datatype in the unit test causes the tests to fail.
Jan 16, 2025
Describe the bug
I modified the block/warptile shapes and the output datatype in https://github.com/NVIDIA/cutlass/blob/main/test/unit/gemm/device/gemm_s8t_s8n_s32t_tensor_op_s32_sm80.cu, and found some shapes cause the tests to fail.
I modified the ElementOutput to cutlass::half_t and tested various block/warptile shapes. While some shapes passed, others failed.
Passed block/warptile shapes, for example, are
<32,128,64>/<16,64,64>,
<32,128,64>/<32,32,64>,
<32,128,64>/<32,64,64>,
<32,256,64>/<32,64,64>,
<64,32,64>/<32,32,64>,
<64,32,64>/<64,32,64>, etc.
Failed block/warptile shapes, I found, are
<16,16,64>/<16,16,64>,
<16,128,64>/<16,128,64>,
<16,256,64>/<16,128,64>,
<32,16,64>/<32,16,64>,
<32,128,64>/<16,128,64>,
<32,128,64>/<32,128,64>,
<32,256,64>/<16,128,64>,
<32,256,64>/<32,128,64>,
<64,16,64>/<64,16,64>,
<64,128,64>/<16,128,64>,
<64,128,64>/<32,128,64>,
<64,128,64>/<64,128,64>,
<64,256,64>/<16,128,64>,
<64,256,64>/<32,128,64>,
<64,256,64>/<64,128,64>,
<128,16,64>/<128,16,64>,
<128,32,64>/<128,32,64>,
<128,64,64>/<128,64,64>,
<128,128,64>/<16,128,64>,
<128,128,64>/<32,128,64>,
<128,128,64>/<64,128,64>,
<128,256,64>/<128,128,64>,
<256,32,64>/<128,32,64>,
<256,64,64>/<128,64,64>,
<256,128,64>/<32,128,64>,
<256,128,64>/<64,128,64>,
<256,128,64>/<128,128,64>.
Steps/Code to reproduce bug
The text was updated successfully, but these errors were encountered: