-
Building efficient algorithms for training Machine Learning models [1]
-
Developing a cross-platform 128-bit floating-point dtype for NumPy
- ML Systems and Performance Engineering
- Developing better algorithms for efficient Model Training
- GPU programming and optimizations
- Distributed Systems & large-scale compute
Get in touch:
- X/swayaminsync
- Portfolio
- Personal Email [email protected]




