-
Notifications
You must be signed in to change notification settings - Fork 54
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Marcus Dunn
committed
Sep 6, 2024
1 parent
ea798fa
commit fc2551f
Showing
1 changed file
with
1 addition
and
1 deletion.
There are no files selected for viewing
Submodule llama.cpp
updated
31 files
+1 −1 | .devops/full-cuda.Dockerfile | |
+46 −7 | .devops/nix/devshells.nix | |
+8 −10 | .devops/nix/nixpkgs-instances.nix | |
+36 −0 | .devops/nix/package-gguf-py.nix | |
+153 −229 | .devops/nix/package.nix | |
+66 −0 | .devops/nix/python-scripts.nix | |
+31 −9 | .devops/nix/scope.nix | |
+1 −1 | .github/workflows/build.yml | |
+4 −4 | CMakePresets.json | |
+3 −21 | README.md | |
+4 −0 | common/common.cpp | |
+1 −1 | docs/docker.md | |
+43 −22 | examples/llama-bench/README.md | |
+88 −56 | examples/llama-bench/llama-bench.cpp | |
+2 −2 | examples/main/main.cpp | |
+313 −446 | examples/server/server.cpp | |
+19 −19 | examples/server/tests/features/steps/steps.py | |
+3 −0 | examples/server/tests/features/wrong_usages.feature | |
+33 −0 | examples/server/utils.hpp | |
+6 −6 | flake.lock | |
+4 −1 | flake.nix | |
+612 −0 | ggml/src/ggml-aarch64.c | |
+9 −1 | ggml/src/ggml-backend.c | |
+12 −2 | ggml/src/ggml-cuda.cu | |
+19 −16 | ggml/src/ggml-cuda/cpy.cu | |
+2 −2 | ggml/src/ggml-sycl/dmmv.cpp | |
+3 −0 | ggml/src/ggml.c | |
+1 −0 | gguf-py/pyproject.toml | |
+1 −1 | grammars/README.md | |
+1 −1 | pyproject.toml | |
+47 −40 | src/llama.cpp |