-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Context and model enhancements #510
Merged
MarcusDunn
merged 8 commits into
utilityai:main
from
brittlewis12:context-and-model-enhancements
Sep 28, 2024
Merged
Context and model enhancements #510
MarcusDunn
merged 8 commits into
utilityai:main
from
brittlewis12:context-and-model-enhancements
Sep 28, 2024
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
* enables checking against all stop tokens defined by tokenizer for a given model -- EOS, EOT, etc
MarcusDunn
reviewed
Sep 25, 2024
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. I'd like a signature change to reflect how negative numbers work. This is a great addition.
* enable interesting cache manipulation use cases, from removing recent messages, truncating non- special token stop sequences, & more. * express logic relying on negative values as Options of `u16`, to ensure positive values that fit into `i32` with safe conversion - this means sequence and llama_pos values above ~65k will not be addressable directly, and will need to use the `None` semantics
* return `Result`s to handle failed u32 -> i32 conversion * unify kv cache seq rm methods
brittlewis12
force-pushed
the
context-and-model-enhancements
branch
from
September 27, 2024 02:09
3fc30eb
to
7d1b2d5
Compare
MarcusDunn
approved these changes
Sep 27, 2024
Looks good. If Linux tests and Mac build pass I'll merge. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
As mentioned in #505 (comment), various enhancements to Context & Model capablities:
llama_token_is_eog
equivalentllama_kv_cache_seq_rm
Context method for kv cache manipulationflash_attn
,offload_kqv
context parameters