stanford_alpaca issues

Results 224 stanford_alpaca issues

Sort by recently updated

Simpler requirements

Now that transformers suport LLAMA it might be a good idea to simplify readme/installation instructions for finetuning instead of having to pick out a specific commit from this PR https://github.com/huggingface/transformers/pull/21955

gregor-soniox

Add inference code

Tested with own fine-tuned 7B alpaca model ``` python inference.py \ --model_name_or_path {model_path} ``` ``` Instruction: Tell me about alpacas. | 2499 | Al | -15.960 | 0.00% | 29886...

wade3han

Modify train.py to support CPU

Can someone modify code train.py to support CPU? Thank you in advance

suhitk001

Are there any particular reason for using `### ` for instruction, input and response?

I know that in markdown format `### ` is treated as section/subsection title. Are there any relevance there? Edit to make the question clearer: In the training script, the inputs...

timothylimyl

Does this code still work when fine-tune with encoder-decoder (BLOOMZ or mT0) ?

I'm worry this code doesn't run when use pre-trained BLOOMZ or mT0 [https://github.com/bigscience-workshop/xmtf]. Have anyone fine-tuned this ?

nqchieutb01

NET/IB : Got completion from peer 11.214.147.122<39138> with error 12, opcode 0, len 0, vendor err 129

[E ProcessGroupNCCL.cpp:455] Some NCCL operations have failed or timed out. Due to the asynchronous nature of CUDA kernels, subsequent GPU operations might run on corrupted/incomplete data. [E ProcessGroupNCCL.cpp:460] To avoid...

lmx760581375

hash is missing :fatal: bad object 68d640f7c368bcaaaecfc678f11908ebbd3d6176

Hi , I am trying to get commit as mentioned in FINETUNING section , however, the mentioned hash is not present,. fatal: bad object 68d640f7c368bcaaaecfc678f11908ebbd3d6176. is it available in archive...

Risingabhi

Is anyone using a single A100 80GB for training?

Ahtesham00

The instructions are as clear as mud.

Hi. I hope you are having a great new year so far. I can't make heads or tails of what I am supposed to do to be able to train...

FrankDMartinez

Deepspeed Training vs FSDP?

Someone told me there is a deepspeed training option in the code, can I ask why its not the default? Do we know if it's far faster, and if so,...

teknium1

stanford_alpaca
stanford_alpaca copied to clipboard

Metadata

Simpler requirements

Add inference code

Modify train.py to support CPU

Are there any particular reason for using `### ` for instruction, input and response?

Does this code still work when fine-tune with encoder-decoder (BLOOMZ or mT0) ?

NET/IB : Got completion from peer 11.214.147.122<39138> with error 12, opcode 0, len 0, vendor err 129

hash is missing :fatal: bad object 68d640f7c368bcaaaecfc678f11908ebbd3d6176

Is anyone using a single A100 80GB for training?

The instructions are as clear as mud.

Deepspeed Training vs FSDP?

← Metadata

Owner

Metadata

stanford_alpaca stanford_alpaca copied to clipboard

Metadata

← Metadata

Owner

Metadata

stanford_alpaca
stanford_alpaca copied to clipboard