verigle comments

Results 29 comments of


                                            verigle

CUDA out of memory issue during validation

I hava similar problem， the GPU memory always increase， and then Out of Memory. ``` Evaluation [ 0/5000] eta: 1:57:37 time: 1.4116 data: 0.3283 max mem: 9518 Evaluation [ 10/5000]...

BLIP2 Cuda out of memory issue

can you give some examples for how to modified code to finetuning a fraction of layers.

can I use the mirror of hf-mirror.com

I just want to replace the domain of url "huggingface.co" to "hf-mirror.com" for regiested model "[TabbyML/DeepseekCoder-6.7B](https://huggingface.co/deepseek-ai/deepseek-coder-6.7b-base)" https://tabby.tabbyml.com/docs/models/

Failed to fetch model 'TabbyML/DeepseekCoder-6.7B' due to 'Invalid mirror <modelscope.cn>

thank you for your reply, can i download it manually then put it on a directory？

integrity check failed, the download may be incomplete, please try again

这个问题有解决办法了吗？

关于repAdapter_Router 代码的疑问

感谢

PyTorch Implementation Coming

is there any plan to release finetune code of pytorch ?

PyTorch Implementation Coming

is there any plan to release samples code of covariates support and finetune support?

[Bug] DeepSeek R1 serve crash occasionally on 2*H100

Watchdog timeout (self.watchdog_timeout=300) +1

[Bug]: Fail to bind LLM used by RAPTOR

I try it with gpustack with qwen2.5 7b, the same thing happens.