You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
File "/opt/conda/lib/python3.10/site-packages/trio-0.24.0-py3.10.egg/trio/_core/_io_epoll.py", line 203, in EpollIOManager
_epoll: select.epoll = attr.ib(factory=select.epoll)
AttributeError: module 'eventlet.green.select' has no attribute 'epoll'
还有就是:单卡模型推理服务 ,加载模型后隔一小段时间后就会崩溃,然后出现以下提示
ValueError: The model's max seq len (8192) is larger than the maximum number of tokens that can be stored in KV cache (5392). Try increasing gpu_memory_utilization or decreasing max_model_len when initializing the engine.
运行的环境是魔搭阿里云DSW,T_T。。。求大佬帮忙解决
The text was updated successfully, but these errors were encountered:
kagentsys --query="刘德华老婆是谁?" --llm_name="kagentlms_qwen_7b_mat" --use_local_llm --local_llm_host="localhost" --local_llm_port=8899 --lang="zh"
File "/opt/conda/lib/python3.10/site-packages/trio-0.24.0-py3.10.egg/trio/_core/_io_epoll.py", line 203, in EpollIOManager
_epoll: select.epoll = attr.ib(factory=select.epoll)
AttributeError: module 'eventlet.green.select' has no attribute 'epoll'
还有就是:单卡模型推理服务 ,加载模型后隔一小段时间后就会崩溃,然后出现以下提示
ValueError: The model's max seq len (8192) is larger than the maximum number of tokens that can be stored in KV cache (5392). Try increasing
gpu_memory_utilization
or decreasingmax_model_len
when initializing the engine.运行的环境是魔搭阿里云DSW,T_T。。。求大佬帮忙解决
The text was updated successfully, but these errors were encountered: