feat: add mixed-precision and agc to gradaccum optimizer #542
Job | Run time |
---|---|
38s | |
1m 25s | |
2m 12s | |
13m 59s | |
1m 26s | |
1m 40s | |
12m 9s | |
1m 28s | |
1m 33s | |
12m 4s | |
1m 10s | |
12m 50s | |
1m 7s | |
13m 41s | |
13m 2s | |
7m 53s | |
7m 20s | |
7m 56s | |
7m 22s | |
7m 46s | |
7m 8s | |
15m 24s | |
13m 47s | |
15m 20s | |
15m 26s | |
27m 4s | |
28m 54s | |
4h 11m 44s |