sujithjoseph comments

Results 28 comments of


                                            sujithjoseph

How to save / form the config.json after fine-tuning - Flan T5 11b

If I shard the xxl base model like this ``` model.save_pretrained("sharded", max_shard_size="2000MB") ``` will it help in then finetuning it with larger batch size or should I load it int-8...

How to save / form the config.json after fine-tuning - Flan T5 11b

Since I have CUDA 11.6 driver installed (vertex ai), I was using torch 1.12.1+cu116 . During installation, I see this ``` ERROR: pip's dependency resolver does not currently take into...

How to save / form the config.json after fine-tuning - Flan T5 11b

@pacman100 , I am not able to import prepare_model_for_training from main. I did pip install -U git+https://github.com/huggingface/peft.git. Should I install this branch - https://github.com/huggingface/peft/tree/younesbelkada-flan-t5-xl ? ImportError: cannot import name 'prepare_model_for_training'...

How to save / form the config.json after fine-tuning - Flan T5 11b

pip install --upgrade -e git+https://github.com/huggingface/peft.git#egg=peft pip install --upgrade git+https://github.com/huggingface/peft.git This helped to fix it.

How to save / form the config.json after fine-tuning - Flan T5 11b

```from time import time model.eval() inputs = tokenizer(f'Explain Artificial Intelligence ', return_tensors="pt") print(inputs) times = [] #in ms for i in range(100): with torch.no_grad(): #with torch.cuda.amp.autocast(): start = time() outputs...

How to save / form the config.json after fine-tuning - Flan T5 11b

``` from time import time model.eval() inputs = tokenizer(f'Explain Artificial Intelligence ', return_tensors="pt") print(inputs) times = [] #in ms for i in range(100): with torch.no_grad(): #with torch.cuda.amp.autocast(): start = time()...

How to save / form the config.json after fine-tuning - Flan T5 11b

This only happens when i load the model in 8-bit alone. ``` config = PeftConfig.from_pretrained(peft_model_id) model = AutoModelForSeq2SeqLM.from_pretrained(config.base_model_name_or_path, device_map={'':0}, load_in_8bit=True,torch_dtype=torch.float16) device = torch.device("cuda") model.cuda() model = prepare_model_for_training(model) model = PeftModel.from_pretrained(model,...