Add support for tensor parallelism and add OLMo2-26B model config / train script #847
Job | Run time |
---|---|
24s | |
3m 44s | |
1m 19s | |
1m 1s | |
4m 9s | |
20s | |
3m 6s | |
31s | |
21s | |
44s | |
24s | |
0s | |
16m 3s |
Job | Run time |
---|---|
24s | |
3m 44s | |
1m 19s | |
1m 1s | |
4m 9s | |
20s | |
3m 6s | |
31s | |
21s | |
44s | |
24s | |
0s | |
16m 3s |