v0.6.1
Highlights:
- Automatic differentiation
- Fix CUDA data layout and stack alignment (#918) (by Yuanming Hu)
- CUDA backend
- Fix CUDA data layout and stack alignment (#918) (by Yuanming Hu)
- Examples
- Add
example/bitmasked.py
(#905) (by 彭于斌)
- Add
- GUI
- Better event filtering system (#801) (by 彭于斌)
- Language and syntax
- Scalar math functions can now be applied element-wisely to vectors/matrices (#891) (by 彭于斌)
Full changelog:
- [ir][refactor] Move all frontend stmts to
frontend_ir.h
(#916) (by Ye Kuang) - [CUDA] [AutoDiff] Fix CUDA data layout and stack alignment (#918) (by Yuanming Hu)
- [Example] Add example/bitmasked.py (#905) (by 彭于斌)
- [opengl] [bug] Move GLSLRuntime into a separate buffer, fix ti.random() bug on NVIDIA (#912) (by 彭于斌)
- [ir][refactor] First step to move Frontend IR to its own file (#914) (by Ye Kuang)
- [async] Avoid unnecessary list generations and activations (#913) (by Yuanming Hu)
- [Lang] Scalar math functions can now be applied element-wisely to vectors/matrices (#891) (by 彭于斌)
- [ir] Deprecate
FrontendAtomicStmt
(#907) (by Ye Kuang) - [ir] [refactor] Remove the global
current_block
(#908) (by Ye Kuang) - [ir][refactor] Pass a context object to
Expression::flatten()
(#901) (by Ye Kuang) - [test] Add an argument -r to rerun failed test (#904) (by 彭于斌)
- [GUI] Better event filtering system (#801) (by 彭于斌)