Billy Cao

https://www.linkedin.com/in/aliencaocao/ [email protected]

Hwa Chong Institution @hcirs Singapore Graduate of Hwa Chong Institution Singapore | Google Certified Professional ML Engineer

Results 302 comments of


                                            Billy Cao

Add Support for 2/3/8-bit GPTQ Quantization Models

Yes i meant the gptq hf branch. I figured it out myself by removing all quip and marlin codes and it works for me.

用 RT-DETR 官方给出的推理代码推理完的结果置信度都低于0.1，导致预测无结果。

我有一样的问题，而且推理出的结果完全不对 CUDA 11.8, CUDNN 8.9, torch 2.1.0.dev20230425+cu118 ![soccer](https://github.com/PaddlePaddle/PaddleDetection/assets/20109683/d3f9d698-e7a3-414e-8f12-56d8ca712c7a) ![soccer](https://github.com/PaddlePaddle/PaddleDetection/assets/20109683/d32e47cc-83ae-4949-85de-0704f54a200c)

`Illegal instruction` when loading ppocrv4 chinese rec model on CPU

It is because of lack of AVX512, it has already been confirmed by https://github.com/PaddlePaddle/PaddleOCR/issues/10346, https://github.com/PaddlePaddle/PaddleOCR/issues/10675. I reproduce on both a Xeon and a AMD Ryzen 3, both don't support AVX512)....

`Illegal instruction` when loading ppocrv4 chinese rec model on CPU

Well as I said it is not a intel issue as i have it on AMD too. But if by "intel" you mean intel x86 then sure.

`Illegal instruction` when loading ppocrv4 chinese rec model on CPU

> Considering this works on paddlepaddle 2.4.2 BTW this dont work on 2.4.2, it just defaults to some alternate, unusablely slow code path, which is https://github.com/PaddlePaddle/PaddleOCR/issues/10346 So something changed from...

`Illegal instruction` when loading ppocrv4 chinese rec model on CPU

Thanks for the fix

windows最新版本好像挂了

我就按照教程的配置，最新的3.0beta都还可以用

Better llava next.

@NielsRogge I thought the pad token id being wrong was fixed as mentioned by you in https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf/discussions/2

Better llava next.

So this would close https://github.com/huggingface/transformers/issues/29832 right? Any idea on if this change could solve https://github.com/huggingface/transformers/issues/29835?

[Question] Does this repo support LLaVA 1.6 inference for now?

Just got the gradio web demo to work, see linked PR.

‹
1
2
...
7
8
9
10
11
12
13
...
30
31
›