stanford_alpaca issues

Fine-tuning Does not work

6

``` Traceback (most recent call last): File "/home/ubuntu/stanford_alpaca/train.py", line 231, in train() File "/home/ubuntu/stanford_alpaca/train.py", line 225, in train trainer.train() File "/home/ubuntu/anaconda3/envs/lama/lib/python3.10/site-packages/transformers-4.27.0.dev0-py3.10.egg/transformers/trainer.py", line 1628, in train return inner_training_loop( File "/home/ubuntu/anaconda3/envs/lama/lib/python3.10/site-packages/transformers-4.27.0.dev0-py3.10.egg/transformers/trainer.py", line...

akanyaani

Why only sample from seed tasks? Any trial on sampling from both the seed tasks and the generated tasks?

https://github.com/tatsu-lab/stanford_alpaca/blob/73cac8be49a66ca5d159ee9199428804e1e6aabe/generate_instruction.py#L155

desperadoola

how to use the training data to fine-tune the open source chat-glm

lunar333

Error Need help

1

``` F:\!DEV\stanford_alpaca>torchrun --nproc_per_node=4 --master_port=2233 train.py --model_name_or_path "facebook/opt-6.7b" --data_path ./alpaca_data.json --bf16 True --output_dir results --num_train_epochs 3 --per_device_train_batch_size 4 --per_device_eval_batch_size 4 --gradient_accumulation_steps 8 --evaluation_strategy "no" --save_strategy "steps" --save_steps 2000 --save_total_limit 1 --learning_rate...

Kolhax

transformers version fix

As mentioned in the repo, Hugging Face's transformers library by installing from a particular fork (i.e. this https://github.com/huggingface/transformers/pull/21955 to be merged). So updating requirements.txt to install that version of transformers...

akanyaani

Fix broken links in the README

Uses link references to fix the links to the two cited papers. Keeps the syntax similar to LaTeX, no changes to rendered document except for the links inside "[1]" and...

lucasnbsb

Simple question

Hi there, when I get it right Alpaca is a pretrained, locally running AI. Is it possible to pass new input to Alpaca, f.e. let Alpaca summarize a new text?

benberlin7

Any tutorial to go through the process? Just fresh to LLM

I just get access to Meta LLaMa model parameters. Whats to do next? Thanks lol

wangdelta

Fixing typos in the prompt

instrucitons -> instructions necssary -> necessary "a instr.." -> "an instr..."

perone

The question about visualization

Is there a way to visualize the activation status of each layer in the llama model? I want to observe how the language model responds to different types of problems,...

lurenlym

stanford_alpaca
stanford_alpaca copied to clipboard

Metadata

Fine-tuning Does not work

Why only sample from seed tasks? Any trial on sampling from both the seed tasks and the generated tasks?

how to use the training data to fine-tune the open source chat-glm

Error Need help

transformers version fix

Fix broken links in the README

Simple question

Any tutorial to go through the process? Just fresh to LLM

Fixing typos in the prompt

The question about visualization

← Metadata

Owner

Metadata

stanford_alpaca stanford_alpaca copied to clipboard

Metadata

← Metadata

Owner

Metadata

stanford_alpaca
stanford_alpaca copied to clipboard