Some question about nvcc compile options #1290

MARD1NO · 2024-01-04T07:32:31Z

MARD1NO
Jan 4, 2024

Recently I want to integrate some cutlass operators into PyTorch, and I found in CUDAExtension we can pass some compile options. In other project it usually set:

"--ptxas-options=-O2",
"--ptxas-options=-allow-expensive-optimizations=true",

In my GEMM test, I found turn off these 2 options will get better performance, but there is no documentations about ptx. I just want to know what these options means, and hope cutlass develop team will give me some suggestions! :D

Answered by hwu36

Jan 5, 2024

when you compile cutlass, just use cutlass's nvcc flags. that is what nvcc team uses to optimize cutlass.

View full answer

hwu36 · 2024-01-05T21:29:55Z

hwu36
Jan 5, 2024
Maintainer

when you compile cutlass, just use cutlass's nvcc flags. that is what nvcc team uses to optimize cutlass.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some question about nvcc compile options #1290

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Some question about nvcc compile options #1290

MARD1NO Jan 4, 2024

Replies: 1 comment

hwu36 Jan 5, 2024 Maintainer

MARD1NO
Jan 4, 2024

hwu36
Jan 5, 2024
Maintainer