Add support for tensor parallelism and add OLMo2-26B model config / train script #846
Job | Run time |
---|---|
19s | |
1m 42s | |
1m 24s | |
3m 37s | |
4m 57s | |
56s | |
24s | |
24s | |
20s | |
43s | |
22s | |
0s | |
15m 8s |
Job | Run time |
---|---|
19s | |
1m 42s | |
1m 24s | |
3m 37s | |
4m 57s | |
56s | |
24s | |
24s | |
20s | |
43s | |
22s | |
0s | |
15m 8s |