GPU扩容 #2226

mandone · 2024-09-04T06:47:36Z

System Info / 系統信息

V100 * 8

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

docker / docker
pip install / 通过 pip install 安装
installation from source / 从源码安装

Version info / 版本信息

13.3

The command used to start Xinference / 用以启动 xinference 的命令

xinference-local --host 0.0.0.0 --port 8080

Reproduction / 复现过程

目前在机器上用了两张V100卡部署GLM4，现在发现显存不太够用，如何无缝衔接的再添加两张显卡

Expected behavior / 期待表现

目前在机器上用了两张V100卡部署GLM4，现在发现显存不太够用，如何无缝衔接的再添加两张显卡

qinxuye · 2024-09-04T09:08:00Z

n-gpu 指定卡的个数。

mandone · 2024-09-04T10:34:20Z

那是要重新启动后台服务？ @qinxuye

qinxuye · 2024-09-04T11:18:04Z

不需要，停掉模型重新 launch。

mandone · 2024-09-04T23:43:27Z

也就是模型短暂是不可用的对吧

xiaoyesoso · 2024-11-25T01:57:22Z

如何不停模型服务，直接扩容呢？

qinxuye · 2024-11-25T03:07:28Z

开源不支持这个特性。

XprobeBot added the gpu label Sep 4, 2024

XprobeBot added this to the v0.15 milestone Sep 4, 2024

qinxuye closed this as completed Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU扩容 #2226

GPU扩容 #2226

mandone commented Sep 4, 2024

qinxuye commented Sep 4, 2024

mandone commented Sep 4, 2024

qinxuye commented Sep 4, 2024

mandone commented Sep 4, 2024 •

edited

Loading

xiaoyesoso commented Nov 25, 2024

qinxuye commented Nov 25, 2024

GPU扩容 #2226

GPU扩容 #2226

Comments

mandone commented Sep 4, 2024

System Info / 系統信息

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

Version info / 版本信息

The command used to start Xinference / 用以启动 xinference 的命令

Reproduction / 复现过程

Expected behavior / 期待表现

qinxuye commented Sep 4, 2024

mandone commented Sep 4, 2024

qinxuye commented Sep 4, 2024

mandone commented Sep 4, 2024 • edited Loading

xiaoyesoso commented Nov 25, 2024

qinxuye commented Nov 25, 2024

mandone commented Sep 4, 2024 •

edited

Loading