Dynamic memory allocation. Drop Baichuan/InternLM support in favor of llama.cpp. #248
Job | Run time |
---|---|
4m 36s | |
3m 9s | |
3m 2s | |
2m 55s | |
2m 53s | |
2m 11s | |
3m 9s | |
2m 57s | |
1m 52s | |
26m 44s |
Job | Run time |
---|---|
4m 36s | |
3m 9s | |
3m 2s | |
2m 55s | |
2m 53s | |
2m 11s | |
3m 9s | |
2m 57s | |
1m 52s | |
26m 44s |