Skip to content
View rdspring1's full-sized avatar

Organizations

@RUSH-LAB

Block or report rdspring1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. NVIDIA/Fuser Public

    A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")

    C++ 314 55

  2. RUSH-LAB/LSH_Memory Public

    One-Shot Learning using Nearest-Neighbor Search (NNS) and Locality-Sensitive Hashing LSH

    Python 73 16

  3. PyTorch_GBW_LM Public

    PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

    Python 123 20

  4. Count-Sketch-Optimizers Public

    A compressed adaptive optimizer for training large-scale deep learning models using PyTorch

    Python 27 13

  5. LSH-Mutual-Information Public

    Use LSH Sampling for Mutual Information Estimation

    Python 5

  6. lightning-thunder Public

    Forked from Lightning-AI/lightning-thunder

    Source to source compiler for PyTorch. It makes PyTorch programs faster on single accelerators and distributed.

    Python

536 contributions in the last year

Contribution Graph
Day of Week March April May June July August September October November December January February March
Sunday
Monday
Tuesday
Wednesday
Thursday
Friday
Saturday
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More

Activity overview

Loading A graph representing rdspring1's contributions from March 24, 2024 to March 24, 2025. The contributions are 45% code review, 27% commits, 25% pull requests, 3% issues.

Contribution activity

March 2025

Created 6 commits in 1 repository

Created a pull request in NVIDIA/Fuser that received 9 comments

Enforce shared memory alignment for TMA LoadStoreOps

This PR enforces the bytes alignment requirements for TMA LoadStoreOps, which prevents IMA and incorrect results. If TMA LoadStoreOp is not detecte…

+51 −6 lines changed 9 comments
Opened 4 other pull requests in 1 repository
NVIDIA/Fuser 3 open 1 merged
Reviewed 21 pull requests in 1 repository
Opened 1 issue in 1 repository
NVIDIA/Fuser 1 open
Loading