-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
GitHub OSS CI Integration with Internal CI and Workflow
#2592
opened Aug 25, 2025 by
zekunf-nv
Loading…
fix: missing argparse argument static_shape for cutedsl ampere sgemm example
#2568
opened Aug 14, 2025 by
a-r-r-o-w
Loading…
Add missing CUDA_ARCH guard for
__nanosleep
in example
#2558
opened Aug 11, 2025 by
Flamefire
Loading…
fix: too early producer commit in examples/77_blackwell_fmha/kernel/sm100_fmha_mla_tma_warpspecialized.hpp
#2502
opened Jul 24, 2025 by
soundOfDestiny
Loading…
Fix: Dangerous Code Execution Function Could Allow External Attacks in python/CuTeDSL/base_dsl/typing.py
inactive-30d
#2465
opened Jul 14, 2025 by
kira-offgrid
Loading…
Mixed Precision Grouped Gemm with zero points and GPT-Q semantics closes #2261
#2457
opened Jul 11, 2025 by
ankutalev
Loading…
Add SM80/89 blockwise scaling kernel, support FP8 block/groupwise on Ada, INT8 on Ampere
#2328
opened May 24, 2025 by
solrex
Loading…
Fix CMake build error and -Wdeprecated-literal-operator warnings
#2305
opened May 15, 2025 by
BwL1289
Loading…
Support N={48, 80, 96, 112, ...} for SM100 EpilogueTileAuto
inactive-30d
inactive-90d
#2269
opened Apr 29, 2025 by
Algy
Loading…
Limit the number of SMs (sm_count) to user-provided value during profiling.
inactive-30d
inactive-90d
#2257
opened Apr 22, 2025 by
manishucsd
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.