Add support for tensor parallelism and add OLMo2-26B model config / train script #849
Job | Run time |
---|---|
3m 0s | |
5m 44s | |
18s | |
1m 23s | |
2m 19s | |
1m 3s | |
20s | |
25s | |
20s | |
46s | |
18s | |
0s | |
15m 56s |
Job | Run time |
---|---|
3m 0s | |
5m 44s | |
18s | |
1m 23s | |
2m 19s | |
1m 3s | |
20s | |
25s | |
20s | |
46s | |
18s | |
0s | |
15m 56s |