Add support for tensor parallelism and add OLMo2-26B model config / train script #842
Job | Run time |
---|---|
1m 47s | |
30s | |
2m 58s | |
1m 21s | |
1m 3s | |
2m 8s | |
21s | |
26s | |
22s | |
53s | |
19s | |
0s | |
12m 8s |
Job | Run time |
---|---|
1m 47s | |
30s | |
2m 58s | |
1m 21s | |
1m 3s | |
2m 8s | |
21s | |
26s | |
22s | |
53s | |
19s | |
0s | |
12m 8s |