Add support for tensor parallelism and add OLMo2-26B model config / train script #838
Job | Run time |
---|---|
1m 26s | |
20s | |
1m 53s | |
1m 21s | |
56s | |
56s | |
20s | |
23s | |
26s | |
41s | |
20s | |
0s | |
9m 2s |
Job | Run time |
---|---|
1m 26s | |
20s | |
1m 53s | |
1m 21s | |
56s | |
56s | |
20s | |
23s | |
26s | |
41s | |
20s | |
0s | |
9m 2s |