You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don’t understand why MMA in-flight is used in SS_WarpSpecialized. As shown in the diagram below, I’ve illustrated my understanding with a pipeline diagram. If there is MMA in-flight, then after the computation of one stage is completed, the producer cannot immediately read the next set of values. Instead, it has to wait until the next MMA finishes before reading the current stage’s producer values. Does this waiting make sense?
The text was updated successfully, but these errors were encountered:
I don’t understand why MMA in-flight is used in SS_WarpSpecialized. As shown in the diagram below, I’ve illustrated my understanding with a pipeline diagram. If there is MMA in-flight, then after the computation of one stage is completed, the producer cannot immediately read the next set of values. Instead, it has to wait until the next MMA finishes before reading the current stage’s producer values. Does this waiting make sense?
The text was updated successfully, but these errors were encountered: