Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

utilityai / llama-cpp-rs Public

Notifications You must be signed in to change notification settings
Fork 51
Star 179

Code
Issues 12
Pull requests 5
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: utilityai/llama-cpp-rs

Releases Tags

Releases · utilityai/llama-cpp-rs

0.1.35

05 Mar 01:53

MarcusDunn

0.1.35

7cc5b85

Compare

Choose a tag to compare

View all tags

0.1.35

What's Changed

small cleanup to pin code by @MarcusDunn in #123 Potentially breaking
updated llama.cpp by @github-actions in #124
updated llama.cpp by @github-actions in #125
updated llama.cpp by @github-actions in #126
Bump docker/setup-buildx-action from 3.0.0 to 3.1.0 by @dependabot in #129
updated llama.cpp by @github-actions in #128
updated llama.cpp by @github-actions in #131

Full Changelog: 0.1.34...0.1.35

Contributors

dependabot and MarcusDunn

Assets 2

All reactions

0.1.34

29 Feb 23:18

MarcusDunn

0.1.34

f69e4e5

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.1.34

What's Changed

Add CPU Feature Support by @Hirtol in #121
override model values by @MarcusDunn in #120
prep 0.1.34 by @MarcusDunn in #122

New Contributors

@Hirtol made their first contribution in #121

Full Changelog: 0.1.33...0.1.34

Contributors

MarcusDunn and Hirtol

Assets 2

All reactions

0.1.33

29 Feb 19:48

MarcusDunn

0.1.33

2849fcd

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.1.33

What's Changed

updated llama.cpp by @github-actions in #115
updated llama.cpp by @github-actions in #117
Expose the complete API for dealing with KV cache and states by @zh217 in #116
add with_main_gpu to LlamaModelParams by @danbev in #118
updated llama cpp and removed cast to mut by @MarcusDunn in #119

New Contributors

@danbev made their first contribution in #118

Full Changelog: 0.1.32...0.1.33

Contributors

danbev, zh217, and MarcusDunn

Assets 2

All reactions

0.1.32

27 Feb 17:30

MarcusDunn

0.1.32

3c27f80

Compare

Choose a tag to compare

View all tags

0.1.32

What's Changed

updated llama.cpp by @github-actions in #105
Bump cc from 1.0.83 to 1.0.88 by @dependabot in #106
added more sampling options by @MarcusDunn in #110
updated llama.cpp by @github-actions in #111
Expose functions llama_load_session_file and llama_save_session_file by @zh217 in #112
Improved docs for new sampling options @MarcusDunn
Fix clippy errors @MarcusDunn

New Contributors

@zh217 made their first contribution in #112

Full Changelog: 0.1.31...0.1.32

Contributors

zh217, dependabot, and MarcusDunn

Assets 2

All reactions

0.1.31

26 Feb 00:00

MarcusDunn

0.1.31

b20efbf

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.1.31

What's Changed

added docs for cublas support by @MarcusDunn in #103
added with_use_mlock and llama_supports_mlock by @MarcusDunn in #104
moved simple to its own binary for easier use + faster compile times by @MarcusDunn in #101

Full Changelog: 0.1.30...0.1.31

Contributors

MarcusDunn

Assets 2

All reactions

0.1.30

25 Feb 21:36

MarcusDunn

0.1.30

20e607c

Compare

Choose a tag to compare

View all tags

0.1.30

fixed not including ggml-metal.h in release

Full Changelog: 0.1.29...0.1.30

Assets 2

All reactions

0.1.29

25 Feb 19:41

MarcusDunn

0.1.29

3f021f8

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.1.29

h/t @SilasMarvin for bringing metal support across the finish line!

What's Changed

updated llama.cpp by @github-actions in #95
updated llama.cpp by @github-actions in #97
updated llama.cpp by @github-actions in #98
updated llama.cpp by @github-actions in #100
Working build.rs for apple metal by @SilasMarvin in #96
attempt to add metal on mac by @MarcusDunn in #65
Prep 0 1 29 by @MarcusDunn in #102

Full Changelog: 0.1.28...0.1.29

Contributors

SilasMarvin and MarcusDunn

Assets 2

All reactions

0.1.28

21 Feb 18:50

MarcusDunn

0.1.28

338cc79

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.1.28

Known Breaking

init_numa has been modified to accept an enum instead of a boolean.

What's Changed

Bump anyhow from 1.0.79 to 1.0.80 by @dependabot in #89
Bump clap from 4.5.0 to 4.5.1 by @dependabot in #90
prep 0.1.28 by @MarcusDunn in #94
Process user defined tokens by @SilasMarvin in #93
updated llama.cpp (includes breaking backend init changes) by @github-actions in #92