LongQLoRA icon indicating copy to clipboard operation
LongQLoRA copied to clipboard

LongQLoRA: Extent Context Length of LLMs Efficiently

Results 5 LongQLoRA issues
Sort by recently updated
recently updated
newest added

Thanks for the work, I reproduced the training, but merging lora modules got an error. if q_len % group_size > 0: raise ValueError("q_len %d should be divisible by group size...

generartion -> generation

It also combine qlora of longlora, what's the difference between longQLora and longlora+qlora? Model url as follow: https://huggingface.co/Yukang/LongAlpaca-7B-qlora-weights

按照教程,15k的上下文,8张v100 32G的跑不起来, 10K的可以跑起来

Hello author, could you please tell me how the pre-training dataset and the instruction fine-tuning dataset are made? Thank you very much!