0.1.33
What's Changed
- updated llama.cpp by @github-actions in #115
- updated llama.cpp by @github-actions in #117
- Expose the complete API for dealing with KV cache and states by @zh217 in #116
- add with_main_gpu to LlamaModelParams by @danbev in #118
- updated llama cpp and removed cast to mut by @MarcusDunn in #119
New Contributors
Full Changelog: 0.1.32...0.1.33