This repository has been archived by the owner on Sep 18, 2024. It is now read-only.
Support customizing the type of quantization #3892
chenbohua3
started this conversation in
New Feature Design Discussion
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Different backends have different types of quantization, such as:
Currently the type of quantization is hard-coded in each quantizer which is not flexible. So is there any needs to customize it by user setting? A potential way is to set through
configure_list
like:The keyword
dtype
andqscheme
and corresponding values are derived from PyTorch quantization which may be familiar to PyTorch usersBeta Was this translation helpful? Give feedback.
All reactions