Slow speed Vicuna - 7B Help plz #45

C0deXG · 2023-05-13T18:53:30Z

when i ask a qestions it is soo slow it is taking forever to write one sentence how can i make it faster btw am using vicuna 7B to make it light wight for me and am using mac OS m2 chip and that doesnt even help :( so can i host the gpt-llama.cpp on render if so yes when i run sh ./scripts/test-installation.sh what should i put for the port and the locations of the file since am using render to render the model to make it faster ?

The text was updated successfully, but these errors were encountered:

C0deXG · 2023-05-13T20:36:33Z

when i ask a qestions it is soo slow it is taking forever to write one sentence how can i make it faster btw am using vicuna 7B to make it light wight for me and am using mac OS m2 chip and that doesnt even help :( so can i host the gpt-llama.cpp on render if so yes when i run sh ./scripts/test-installation.sh what should i put for the port and the locations of the file since am using render to render the model to make it faster ?

fallow up: if i use render for example and i run on my pc or somewhere else sh ./scripts/test-installation.sh and it ask me the port am running since render uses URL base how am i gonna get this to work web-base or host the backend/model and where to host it

keldenl · 2023-05-13T21:03:41Z

try using mlock, that had historically helped me when i've had memory issues

msj121 · 2023-06-11T15:52:48Z

Also sometimes lowering the thread count helps, because it oversaturates, or perhaps uses a slower worker thread.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow speed Vicuna - 7B Help plz #45

Slow speed Vicuna - 7B Help plz #45

C0deXG commented May 13, 2023 •

edited

Loading

C0deXG commented May 13, 2023

keldenl commented May 13, 2023

msj121 commented Jun 11, 2023

Slow speed Vicuna - 7B Help plz #45

Slow speed Vicuna - 7B Help plz #45

Comments

C0deXG commented May 13, 2023 • edited Loading

C0deXG commented May 13, 2023

keldenl commented May 13, 2023

msj121 commented Jun 11, 2023

C0deXG commented May 13, 2023 •

edited

Loading