stanford_alpaca issues

how to fine-tune on V100

3

need help!

Morxrc

CUDA out of memory for a single core A100 80G GPU

11

I encountered the CUDA OOM on a single core A100 80G using your training code? Can i fix this by changing anything?

leondelee

Due to OOM, who can finetune LLaMA using bitsandbytes for an 8-bit setting on a single 3090?

Dear @all Due to OOM as mentioned in previous issues, who can finetune LLaMA using bitsandbytes for an 8-bit setting on a single 3090? If yes, please share your experiments...

linhduongtuan

Generation problem after / before instruction fine-tuning

Environment: 6xA6000 48GB with Ubuntu 22.04, Pytorch 1.13.0 I ran into a generation problem after following your instruction to convert LLaMA-7B weight using your attached script. I simply used the...

hxssgaa

Reduce the length of your prompt.

1

prompt_batches: 0%| | 0/1 [00:00

19245222

Can you share the log of your finetuning code?

1

As the name implies, can you share the training log?

gaopengpjlab

Comparing training log [Shared my training log]

3

I am currently training the model, and I am hoping to compare it with others. I am only using only 2 A100-80G. Here is my wanb log: https://wandb.ai/charliezjw/huggingface/runs/hil1q6lt

charliezjw

How to train with the Bible content?

3

Hi, What is the steps to train it with this specific Bible content? Example: https://raw.githubusercontent.com/tushortz/variety-bible-text/master/bibles/kjv.txt Can you show me the steps to train it? And the other question is: The...

paulocoutinhox

LLaMA-13B (HF) Fails with OOM on a dual A100-80GB

1

LLaMA-13B (HF) Fails with OOM on a dual A100-80GB. For those who managed to run alpaca against the 13b model, what specs and torchun setting did you use? `torchrun --nproc_per_node=2...

jtang613

Separate training code and dependencies to make who want to fine-tune only easier

1. write the new requiremens for train only: `requires.train.txt` 2. split the old utils.py into utils.py and openai_utils.py 3. apply changes on generate_instruction.py and README

HUGHNew

stanford_alpaca
stanford_alpaca copied to clipboard

Metadata

how to fine-tune on V100

CUDA out of memory for a single core A100 80G GPU

Due to OOM, who can finetune LLaMA using bitsandbytes for an 8-bit setting on a single 3090?

Generation problem after / before instruction fine-tuning

Reduce the length of your prompt.

Can you share the log of your finetuning code?

Comparing training log [Shared my training log]

How to train with the Bible content?

LLaMA-13B (HF) Fails with OOM on a dual A100-80GB

Separate training code and dependencies to make who want to fine-tune only easier

← Metadata

Owner

Metadata

stanford_alpaca stanford_alpaca copied to clipboard

Metadata

← Metadata

Owner

Metadata

stanford_alpaca
stanford_alpaca copied to clipboard