stanford_alpaca issues

Results 224 stanford_alpaca issues

Sort by recently updated

inference kwargs

Thanks for the great work, I reproduced the training, but at inference time tends to generate shorter text. I am using: `generated = model.generate(batch["input_ids"], max_length=512)` Does the interface on the...

1024er

Update the README instructions, especially the PR install command included

Added explicit instructions. ``` pip uninstall transformers pip install git+https://github.com/zphang/transformers.git@68d640f7c368bcaaaecfc678f11908ebbd3d6176 ```

pervrosen

Fine-Tuning very slow (6h->24h??)

Hello, first of all thank you for releasing the training code for alpaca, we really appreaciate it. I am running the fine-tuning script on an 4xA100-SXM4-80GB, and currently getting an...

chavinlo

Question about training precision

In the provided training command: ```bash torchrun --nproc_per_node=4 --master_port= train.py \ --model_name_or_path \ --data_path ./alpaca_data.json \ --bf16 True \ --output_dir \ --num_train_epochs 3 \ --per_device_train_batch_size 4 \ --per_device_eval_batch_size 4 \...

152334H

How to plot the pie chart ?

Once you collect 52k synthetic dataset, how did you plot the pie chart [here](https://github.com/tatsu-lab/stanford_alpaca/blob/main/assets/parse_analysis.png) ? Thanks !

robinsongh381

gpt-3.5-turbo?

Hi, is there any chance there will be a version of the data generator that supports the gpt-3.5-turbo model?

jordancole21

Update requirements.txt

adarsh057

Will you release data collected on demo page ?

diimdeep

OOM issue

Can this finetuning script fit into A10, which only has 24GB GPU memory? I am trying to fine-tune the model on 4 A10 GPUs using a batch size of 1,...

puyuanliu

Resuming from checkpoint

My first run of the trainer could not save the model because the evaluate() call fails. I have removed that method call and now would like to resume from the...

KurtFeynmanGodel

stanford_alpaca
stanford_alpaca copied to clipboard

Metadata

inference kwargs

Update the README instructions, especially the PR install command included

Fine-Tuning very slow (6h->24h??)

Question about training precision

How to plot the pie chart ?

gpt-3.5-turbo?

Update requirements.txt

Will you release data collected on demo page ?

OOM issue

Resuming from checkpoint

← Metadata

Owner

Metadata

stanford_alpaca stanford_alpaca copied to clipboard

Metadata

← Metadata

Owner

Metadata

stanford_alpaca
stanford_alpaca copied to clipboard