Chat2DB icon indicating copy to clipboard operation
Chat2DB copied to clipboard

[DOUBT] Unofficial Quantized Model in HuggingFace Repo Safe and Perfect to use ?

Open Greatz08 opened this issue 1 year ago • 2 comments

image

There are no Official Quantized Model So Will Team Support Quantized Version Official or Not ? As you can see Following are the only unofficial quantized models available so i thought to ask regarding that like is it safe to use ? Will it provide similar performance ? Has anyone from team or community member tested those ones out ?

Greatz08 avatar Nov 02 '24 15:11 Greatz08

Hello, the official AI model can be downloaded here: https://github.com/CodePhiliaX/Chat2DB-GLM?tab=readme-ov-file image

tmlx1990 avatar Nov 04 '24 02:11 tmlx1990

@tmlx1990 yeah i did saw this too but i want quantized (compressed) version of model as most of them wont be able to run this model locally as most have 8-10GB vram only so i and hopefully many others will be searching for quantized version of original model so if possible please create different quantized models which we can also run on our local system easily. Q4,Q5 K_L and K_M quantized model variants will be best for run under 8-10 GB VRAM without too much compromise on quality of model .

Greatz08 avatar Nov 05 '24 01:11 Greatz08