CodeZ-Hao comments

Results 5 comments of


                                            CodeZ-Hao

convert_gpu_weights.py crashed by CUDA out of memory, even with --force_cpu

@ovowei 验证此PR未解决此问题，不论是配置--max_gpu_memory参数还是--force_cpu，仍报 `CUDA out of memory`： ``` root@hao-Super-Server:/work/ktransformers/ktransformers/kt-kernel# python scripts/convert_gpu_weights.py --model_id /media/data/models/GLM-4.6/ --output_dir /models/ZhipuAI/GLM-4.6-GPTQ4 --trust_remote_code --force_cpu --quant_type W4A16 🔧 Forced CPU-only mode 🚀 Starting quantization process Model: /media/data/models/GLM-4.6/ Output: /models/ZhipuAI/GLM-4.6-GPTQ4...

CodeZ-Hao

convert_gpu_weights.py crashed by CUDA out of memory, even with --force_cpu

[Bug] 0.3.2版本卡在Getting inference context from sched_client. sched_rpc started with PID: xxx

fix OOM when converting gpu weights

fix OOM when converting gpu weights

linux下使用vivaldi浏览器时，创建浏览器窗口后无法找到 last_tab，等待超时