Conversation
|
!test |
Description
|
| Relevant files | |||
|---|---|---|---|
| Tests |
|
PR Reviewer Guide
Here are some key observations to aid the review process:
| 🧪 No relevant tests |
| 🔒 No security concerns identified |
| ⚡ Recommended focus areas for review |
Removed Option Guard
isOptionEnabled(EnableOption::TmaReduction) check, which means TMA reduction will now be enabled whenever mayUseTma(props) returns true, regardless of configuration. This removes important runtime control over TMA reduction behavior and could lead to unintended performance regressions or correctness issues in production environments. |
Test failures
-
(Medium, 4)
nvFuser contiguity.size() mismatch in test_multidevice::test_welford across multiple runnersTest Name GB200 GB200 (dist.) H100 H100 (dist.) Source tests.python.multidevice.test_multidevice.test_welford ❌ ❌ ❌ ❌ -
(Medium, 2)
NVFuser internal assert on cpAsync Bulk in tests.python.direct.test_repro::test_shared_memory_usageTest Name GB200 Source tests.python.direct.test_repro.test_shared_memory_usage[nvfuser_direct_test=eager] ❌ tests.python.direct.test_repro.test_shared_memory_usage[nvfuser_direct_test=lru_cache] ❌ -
(Medium, 2)
NVFuser internal assertion failure in NVFuserTest.InnerReductionUnrollVectorization across multiple runnersTest Name GB200 H100 Source NVFuserTest.InnerReductionUnrollVectorization ❌ ❌ Link
No description provided.