stanford_alpaca issues

Thanks for this important work in pushing open LLMs forward. You mention, that you deviate from `self-instruct` in that you add an explicit (fixed) prompt for each and every instruction/input/output...

stefnnn

Train using 13b llama model

4

This repo is awesome .please let me know steps to use llama 13b to train similar json data like alpaca_data.json I have my custom data content and want to train....

dev2021-ctrl

How to make it work on Google Cloud TPU?

2

Hi. I got some free quotas for Google Cloud TPU and I tried to run the training on it these two days. I did the following: 1. Create a TPU...

aicheung

Have you ever test the continuous conversation capability of Alpaca?

1

ChatGPT's continuous conversation capability amuses me a lot. So I wonder does Alpaca perform well on this?

zhaoshitian

BBH stats?

Sorry to see the demo go dark. Hope you guys are doing ok. Wondering if you could run benchmarks with the weights you have against BIG-Bench Hard and share the...

i-am-neo

Update requirements.txt

2

I got this error ```bash python3.10/site-packages/transformers-4.27.0.dev0-py3.10.egg/transformers/trainer.py", line 1460, in _wrap_model self.model = model = FSDP( TypeError: FullyShardedDataParallel.__init__() got an unexpected keyword argument 'forward_prefetch' ``` [torch1.12](https://pytorch.org/docs/1.12/search.html?q=forward_prefetch&check_keywords=yes&area=default) does not support `forward_prefetch` ....

tpoisonooo

Exception: Could not find the transformer layer class to wrap in the model.

xiaoweiweixiao

eos token missing during training

1

In the current [fine-tuning implementation](https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/tokenization_llama.py#L59), the eos token `` is not automatically added to the end of the input id. Therefore, the model is never trained to produce eos after...

nsndimt

stanford_alpaca
stanford_alpaca copied to clipboard

Metadata

Correct prompt.txt mistakes

Update README.md: Grammar

Elaborate on used prompt

Train using 13b llama model

How to make it work on Google Cloud TPU?

Have you ever test the continuous conversation capability of Alpaca?

BBH stats?

Update requirements.txt

Exception: Could not find the transformer layer class to wrap in the model.

eos token missing during training

← Metadata

Owner

Metadata

stanford_alpaca stanford_alpaca copied to clipboard

Metadata

← Metadata

Owner

Metadata

stanford_alpaca
stanford_alpaca copied to clipboard