This repository consists of methods to run LLMs in PyTorch, ONNX and Llama.cpp with operators dispatch to NPU.
This is an early access flow, and expected to be upgraded in upcoming release.
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||
This repository consists of methods to run LLMs in PyTorch, ONNX and Llama.cpp with operators dispatch to NPU.
This is an early access flow, and expected to be upgraded in upcoming release.