Add auto-gptq integration #175

PanQiWei · 2023-04-26T10:40:10Z

using auto-gptq to simplify code and quantization, by this, user can use quantized model to inference with or without triton installed, and can even run on CPU.

…ependencies

PanQiWei · 2023-04-26T11:57:31Z

国内镜像源可能暂时还没有同步到 auto-gptq，安装依赖时需要指定官方源 -i https://pypi.org/simple

Hzfinfdu · 2023-04-26T14:00:35Z

感谢您的PR. 看了一下autogptq的安装，默认会重装torch和cuda ext。这对于多数用户来说感觉不够友好，能否为MOSS设计一个pip install 的最小依赖集合，可以在现有的环境上便捷地安装？

yhyu13 · 2023-04-27T03:04:56Z

@PanQiWei 装了auto-gptq，是不是量化就不用自己配置cuda环境，然后从gptq源码编译whl和pytorch extension？auto-gptq有要求对应的pytorch cuda版本？或transformer版本

PanQiWei · 2023-04-27T03:20:53Z

@Hzfinfdu 我对 setup_env.py 脚本做了更新，添加了四个选项 --reinstall_torch, --install_auto_gptq, --no_cuda_ext_for_auto_gptq 和 --install_triton, 可以让用户更灵活地配置环境

PanQiWei · 2023-04-27T03:22:50Z

@PanQiWei 装了auto-gptq，是不是量化就不用自己配置cuda环境，然后从gptq源码编译whl和pytorch extension？auto-gptq有要求对应的pytorch cuda版本？或transformer版本

@yhyu13 是的，pytorch 最低要求 1.13.0, transformers 是最低要求 4.26.1

PanQiWei · 2023-04-29T03:16:48Z

新增使用 auto-gptq 和 SFT 数据在本地执行模型量化的脚本，注意如需使用该脚本，需要从 AutoGPTQ 项目主分支拉取最新源码安装 auto-gptq

wml1993 · 2023-05-06T05:29:57Z

代码还没有合并到主repo上是因为有问题吗？

PanQiWei · 2023-05-06T08:21:33Z

代码还没有合并到主repo上是因为有问题吗？

我还没进行完整的应用测试，包括 auto-gptq 发布了新的版本，兼容问题也需要测测，我争取周末做一下

integrate auto-gptq

88ab907

xiami2019 requested a review from Hzfinfdu April 26, 2023 11:29

PanQiWei and others added 3 commits April 26, 2023 19:41

Merge branch 'main' into main

5c9bef0

update requirements.txt

b249707

add simple script to automatic create conda environment and install d…

a6c8b77

…ependencies

PanQiWei added 2 commits April 27, 2023 10:55

make torch installation optional

d2b413c

handle more exceptions

33e5b5a

PanQiWei added 7 commits April 27, 2023 11:05

make installation of cuda extension and triton optional

a2a0f4a

bug fix

b19f5d6

bug fix

6c55032

change flag name for more understandable

8e9fdcd

bug fix

f7406a5

bug fix

4e672b9

add help messages

d318bed

make installation of auto_gptq optional

04562ca

slang98 mentioned this pull request Apr 28, 2023

错误：Unexpected MMA layout version found #149

Open

add script to quantize moss using auto-gptq

0230cfc

PanQiWei changed the title ~~Add auto-gptq integration~~ [WIP] Add auto-gptq integration Apr 29, 2023

install auto-gptq from source code

45e1b94

PanQiWei marked this pull request as draft May 6, 2023 08:20

PanQiWei changed the title ~~[WIP] Add auto-gptq integration~~ Add auto-gptq integration May 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add auto-gptq integration #175

Add auto-gptq integration #175

PanQiWei commented Apr 26, 2023 •

edited

Loading

PanQiWei commented Apr 26, 2023

Hzfinfdu commented Apr 26, 2023

yhyu13 commented Apr 27, 2023

PanQiWei commented Apr 27, 2023 •

edited

Loading

PanQiWei commented Apr 27, 2023

PanQiWei commented Apr 29, 2023 •

edited

Loading

wml1993 commented May 6, 2023

PanQiWei commented May 6, 2023

Add auto-gptq integration #175

Are you sure you want to change the base?

Add auto-gptq integration #175

Conversation

PanQiWei commented Apr 26, 2023 • edited Loading

PanQiWei commented Apr 26, 2023

Hzfinfdu commented Apr 26, 2023

yhyu13 commented Apr 27, 2023

PanQiWei commented Apr 27, 2023 • edited Loading

PanQiWei commented Apr 27, 2023

PanQiWei commented Apr 29, 2023 • edited Loading

wml1993 commented May 6, 2023

PanQiWei commented May 6, 2023

PanQiWei commented Apr 26, 2023 •

edited

Loading

PanQiWei commented Apr 27, 2023 •

edited

Loading

PanQiWei commented Apr 29, 2023 •

edited

Loading