Model suggestion #39

andri-jpg · 2023-04-30T11:54:33Z

andri-jpg
Apr 30, 2023

I have a suggestion for this project to be even better. we just use koboldcpp. because it can run Pygmalion 6b with only 4gb of ram, it can be accelerated with GPU. it took 12 seconds for the reply response. with this AIwaifu can be smarter because it uses the 6b model
link to koboldcpp https://github.com/LostRuins/koboldcpp

I also made a koboldcpp version of AIwaifu https://github.com/andri-jpg/AIwaifu-png

it's just that, I interact with the localhost using selenium, I haven't been able to integrate directly into localhost with python

for the demo, you can see here https://youtu.be/TzU27v9Hf6Q

HRNPH · 2023-05-03T12:25:16Z

HRNPH
May 3, 2023
Maintainer

I've thought of using C++ implementation of those model
but probably wouldn't do it soon since I'm pretty busy with work nowadays
I think a solid pipeline need to be laid for the Language Model Part first since I want us to be able to change for better model(if we have one) in the future

1 reply

andri-jpg May 3, 2023
Author

I completely agree with your opinion. We do need to have a solid pipeline for the Language Model Part so that we can change it with a better model in the future. Thank you for providing your response.

HRNPH · 2023-06-09T06:31:20Z

HRNPH
Jun 9, 2023
Maintainer

Following from #44
I'll add up that we can actually quantize model to 4-bit ourself using GGML
which would be the method I prefer since we can change the model and then quantize it

Soon There'll be a released of 4bit Quantization directly from Huggingface itself so maybe we don't really need to do anything? lol

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model suggestion #39

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Model suggestion #39

andri-jpg Apr 30, 2023

Replies: 2 comments · 1 reply

HRNPH May 3, 2023 Maintainer

andri-jpg May 3, 2023 Author

HRNPH Jun 9, 2023 Maintainer

andri-jpg
Apr 30, 2023

Replies: 2 comments 1 reply

HRNPH
May 3, 2023
Maintainer

andri-jpg May 3, 2023
Author

HRNPH
Jun 9, 2023
Maintainer