南栖 comments

Results 53 comments of


                                            南栖

Support Qwen2

> @NilanEkanayake So there are some pre converted Qwen models on HuggingFace if you search for "qwen llama". > > In terms of Qwen 1.5 / 2 - if it's...

> @songkq Oh it should be supported if you use Llama-Factory's llamafy script. Ie maybe try https://huggingface.co/models?search=qwen%20llama. On the other hand, if some don't exist, you can try out Llama-Factory's...

Support Qwen2

> > @songkq Oh it should be supported if you use Llama-Factory's llamafy script. Ie maybe try https://huggingface.co/models?search=qwen%20llama. On the other hand, if some don't exist, you can try out...

有没有可能通过对通用对话的模型进行fine-tune给机器人赋予一个固定的人设和状态？并且对客观条件做出正确的反应？

我觉得你可以冻结预训练模型的层，可以保留最后一层用来下游微调训练，这样不会改变模型整体的学习内容

腾讯格式的权重转换成HF格式的转换脚本在哪里？

转成huggingface后效果咋样，会有损失吗？

哪个大佬救救孩子吧，这个问题好几天了，都没有解决

我的建议是lora微调代码用[tloen](https://github.com/tloen) / [alpaca-lora](https://github.com/tloen/alpaca-lora)斯坦福官方的lora微调代码，这个代码我也用了，微调没法用int8微调，tesla A40 微调lora爆显存，但是斯坦福alpaca可以int8 lora微调 13G显存就能微调70B

在docker环境下，run_LoRa有问题，3张32G的V100也跑不起来，用之前的finetune就可以跑起来

是的,我也发现这个问题了,所以后来换了斯坦福的微调代码,单块tesla a100 可以微调llama 650亿 [tloen](https://github.com/tloen) / [alpaca-lora](https://github.com/tloen/alpaca-lora)

关于效果的疑问

嗯，明白了，期待13b的中文模型。

IndexError: Invalid key: 0 is out of bounds for size 0

Same problem.

Loss Nan when I quant Qwen1.5 14B chat model

> I install auto_gptq 0.7.0 Change auto_gptq version