Update README.md

Updated README according to the new llama.cpp quantization method
bofenghuang · Apr 4, 2023 · 6574401 · 6574401
1 parent 10a65e1
commit 6574401
Showing 1 changed file with 1 addition and 1 deletion.
diff --git a/README.md b/README.md
@@ -127,7 +127,7 @@ tree models
 python convert-pth-to-ggml.py ./models/7B/ 1
 
 # further quantize the model to 4-bit
-python quantize.py 7B
+./quantize ./models/7B/ggml-model-f16.bin ./models/7B/ggml-model-q4_0.bin 2
 ```
 
 ### 5. Run the inference