fastllm chinese-llama-alpaca 模型 BUG

chinese-llama-alpaca 模型 BUG

Open levishen opened this issue 1 year ago • 3 comments

如题，会出现爆显存的问题，并打印如下错误： status = 7 2049 1 128 Error: cublas error. terminate called after throwing an instance of 'char const*' Aborted (core dumped)

Jul 09 '23 10:07 levishen

是不是输入长度超过2048了，早期的LLAMA好像限制了长度不超过2048 （其实就是rotary_embdding的时候位置编码只开到了2048），我之后把这个值开大应该就可以了

Jul 09 '23 11:07 ztxz16

输入文本是：北京有什么景点？

长度不会超过2048呀

Jul 09 '23 11:07 levishen

Jul 09 '23 12:07 levishen