Ruonan Wang comments

Results 55 comments of


                                            Ruonan Wang

issue with qlora fine-tuning on Flex GPU

> Are you using any metric to check for model accuracy after QLora finetuning. > I had used my custom dataset for finetuning and my inference results are not good....

Unable to run the finetuning sample with transformers==4.36.0 version.

Hi @xiangyang-95 , could you please share us your detailed code / error message and your pip list so that we maybe can reproduce your errors : )

Unable to run the finetuning sample with transformers==4.36.0 version.

By the way, based on my experience, if you want to finetune mixtral, it would be better to use transformers==4.36.1 to avoid error in saving checkpoint.

Baichuan2-13B with bigdl-bf16 does not apply greedy_search when calling model.generate

I can't reproduce this issue. Based on my test, below code will call `self.greedy_search`. ### code ```python import torch import intel_extension_for_pytorch as ipex from bigdl.llm.transformers import AutoModelForCausalLM from transformers import...

Baichuan2-13B with bigdl-bf16 does not apply greedy_search when calling model.generate

> Is `import intel_extension_for_pytorch as ipex` necessary? As import will do some init works. @rnwang04 It's not necessary, here use ipex as I validate in a GPU conda env. I...

how to use xpu to fine tuning bai chuan 13b model with lora method

Here is a lora finetuning script of llama for your reference : https://github.com/intel-analytics/BigDL/blob/main/python/llm/example/GPU/QLoRA-FineTuning/alpaca-qlora/lora_finetune_llama2_7b_arc_1_card.sh Based on it, if you want to finetune baichuan 13b, there needs some modifications. 1. First you...

Not able to profile LLAMA2 on iGFX (windows)

Hi @vmadananth , pytorch xpu profiler is not supported on Windows now. You can use vtune to obtain some kernel level profile (https://www.intel.com/content/www/us/en/docs/vtune-profiler/get-started-guide/2023/windows-os.html). But now there is no accurate way...

Ruonan Wang

issue with qlora fine-tuning on Flex GPU

Unable to run the finetuning sample with transformers==4.36.0 version.

Unable to run the finetuning sample with transformers==4.36.0 version.

Baichuan2-13B with bigdl-bf16 does not apply greedy_search when calling model.generate

Baichuan2-13B with bigdl-bf16 does not apply greedy_search when calling model.generate

how to use xpu to fine tuning bai chuan 13b model with lora method

Not able to profile LLAMA2 on iGFX (windows)

Not able to profile LLAMA2 on iGFX (windows)

Failing to run ipex-llm ollama on Intel Arc A770

Failing to run ipex-llm ollama on Intel Arc A770