Add support for tensor parallelism and add OLMo2-26B model config / train script #841
Job | Run time |
---|---|
1m 34s | |
2m 1s | |
23s | |
55s | |
1m 26s | |
1m 2s | |
20s | |
22s | |
33s | |
45s | |
18s | |
0s | |
9m 39s |
Job | Run time |
---|---|
1m 34s | |
2m 1s | |
23s | |
55s | |
1m 26s | |
1m 2s | |
20s | |
22s | |
33s | |
45s | |
18s | |
0s | |
9m 39s |