-
Notifications
You must be signed in to change notification settings - Fork 7.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Private GPT has infinite loop of responce #2112
Comments
Can you give us more details about your environment? Probably, it will related to GPU and vRAM. |
Full logs: |
It should work equally using postman and requests. Can you increate request timeout?
And two mode things to take into account:
|
The main problem is that when I send a request, even through Postman, the response is generated multiple times and degrades each time. 2024-11-04 15:36:54 13:36:54.133 [INFO ] httpx - HTTP Request: POST http://localhost:11434/api/chat "HTTP/1.1 200 OK" After that its generate responce again. I need somehow accept only first responce. |
Question
I have an issue with Private GPT:
When I send a prompt or chat completion with a large context (file size > 5 KB or multiple context files), the chat takes a long time to generate a response but never sends it. It just keeps generating a response, and the delay gets worse. Eventually, it sends a timeout error.
I don’t know how to fix this. I need to get its initial response, but in the end, I don’t receive anything
The text was updated successfully, but these errors were encountered: