stanford_alpaca issues

How to provide extra contexrt as a pdf file?

I am working on a project using this model, but I hit a roadblock by not knowing how to provide it with extra context as a pdf or text file....

gamerjazzar

[Windows]: RuntimeError: Distributed package doesn't have NCCL built in

Hi, i try to run train.py in Windows. Help me please solve the problem. # System parameters 12th Gen Intel(R) Core(TM) i5-12600KF 3.70 GHz 32 GB Cuda 11.8 Windows 11...

SkibaSAY

Training bug for 13b, 30b, and 65b

5

Has anyone been able to finetune any of the models larger than 7b successfully? I'm training on 8 A100s with 80GB of RAM each which is more than enough space....

alexgshaw

incorrect model_max_length

1

I don't understand why the default model_max_length is set to 512, and the example training bash script on the main readme doesn't pass in an argument for that as 2048...

joemkwon

Why do we pass both question and answer as input to the model during training?

I may be wrong here but from what I traced from their train.py file, it seems like they are training the model by passing both question + answer as the...

ruiyigan

DeepSpeed compilation (cpu_adam issue)

Thanks for the repo. If I use torchrun as suggested by the webpage (see below) it fails due to an error while compiling cpuadam within the deepspeed library. The actual...

JohnTailor

The OOM problem caused by the Transformers version

2

A month ago, I train the alpaca with 4 A100 GPUs (each 80G) and `per_device_train_batch_size=4`. Here `transformers==4.28.1`. Today I retrain the alpaca with the same hardwares and the same code,...

kiseliu

Model Training Never Starts - Can't Finetune Anymore

1

I have been trying to finetune for a while and I succeeded using this repository. However, the next day I tried to finetune again using the same steps and it...

cumbersomeamir