Zach comments

Results 18 comments of


                                            Zach

text to text finetuning

not for text to text

Example for how I may continue fine tuning the peft model

The Path leads to what? Is it the hf_ckpt converted folder? To the previous fine tuned alpaca-lora folder?

A bug occurred when using the DDP mode

remove the os.environ, when you run your code do the following CUDA_VISIBLE_DEVICES=2,7 torchrun --nproc_per_node=2 --master_port=1234 fileName.py Its safer to do this instead of using os environ

How to train on multiple GPU (w/ small vRAM)

Have you tried using torchrun?

finetune 65B model on A100-80G with lora

How many gpus are needed to train 65B? I have been able to train 30B but I am pretty sure that was the limit for my capabilities.

finetune 65B model on A100-80G with lora

Do you have a guess for the amount of VRAM you'd need for 65B? I'd be curious to try it out

How to load a model pre-trained on a 52k dataset and continue fine-tuning with another dataset.json?

Just going to put a . in here as I'm facing the same issue. I've talked with T-Atlas a little bit over e-mail and we're hitting the exact same wall

How to use Chinese corpus for fine tuning?

What dataset are you using? Are you fine tuning the already fine tuned model?

How to use Chinese corpus for fine tuning?

> > What dataset are you using? Are you fine tuning the already fine tuned model? > > I tried to fine-tune it with some safe question and answer data...

Inference GPU requirements?

I got this error when I converted the weights to HF model (I'm assuming you ran the convert file, export_hf_checkpoint.py). You have two options if you're running inference on the...