feat: add mixed-precision and agc to gradaccum optimizer #548
Job | Run time |
---|---|
34s | |
1m 31s | |
1m 6s | |
1m 24s | |
1m 50s | |
1m 7s | |
1m 11s | |
1m 29s | |
1m 7s | |
1m 10s | |
1m 3s | |
1m 6s | |
1m 8s | |
1m 6s | |
1m 3s | |
1m 8s | |
59s | |
1m 1s | |
1m 4s | |
1m 16s | |
1m 0s | |
58s | |
1m 9s | |
1m 6s | |
1s | |
1m 12s | |
1m 13s | |
30m 2s |