Add support for tensor parallelism and add OLMo2-26B model config / train script #843
Job | Run time |
---|---|
1m 28s | |
19s | |
2m 56s | |
1m 21s | |
2m 14s | |
1m 4s | |
22s | |
32s | |
33s | |
50s | |
19s | |
0s | |
11m 58s |
Job | Run time |
---|---|
1m 28s | |
19s | |
2m 56s | |
1m 21s | |
2m 14s | |
1m 4s | |
22s | |
32s | |
33s | |
50s | |
19s | |
0s | |
11m 58s |