inference ENH: `quantization=None` should be more intuitive when launching models rather than `quantization='none'`

ENH: `quantization=None` should be more intuitive when launching models rather than `quantization='none'`

Open ChengjieLi28 opened this issue 1 year ago • 2 comments

Note that the issue tracker is NOT the place for general support.

Nov 07 '23 07:11 ChengjieLi28

Using quantization=None indeed can launch the model and run successfully. However, even if the model is loaded with quantization=None, it will still be converted to quantization='none' when creating the model instance. In this case, None represents the failure to correctly match the model. You can refer to the code for more details https://github.com/xorbitsai/inference/blob/main/xinference/model/llm/core.py#L202.

Mar 11 '24 03:03 Ago327

This issue is stale because it has been open for 7 days with no activity.

Aug 08 '24 19:08 github-actions[bot]

This issue was closed because it has been inactive for 5 days since being marked as stale.

Aug 13 '24 19:08 github-actions[bot]

inference inference copied to clipboard

ENH: `quantization=None` should be more intuitive when launching models rather than `quantization='none'`

inference
inference copied to clipboard