reneix comments

Results 1 comments of


                                            reneix

> You can try to use the qwen2.5-14b model after INT4 quantization to reduce the GPU memory. got, will have a try