FastChat issues

Unable to save the mode weights - GPU OOM

9

I am finetuning vicuna using 4 * A100-80G GPUs. I meet some problem after finish training, ``` {'loss': 1.3641, 'learning_rate': 4.815273327803183e-08, 'epoch': 0.97} {'loss': 1.35, 'learning_rate': 2.7095433213097933e-08, 'epoch': 0.97} {'loss':...

Jeffwan

Got a huggingface validation error: Repo id must use alphanumeric char

1

HFValidationError: Repo id must use alphanumeric chars or '-', '_', '.', '--' and '..' are forbidden, '-' and '.' cannot start or end the name, max length is 96: 'vicuna-7B/'....

bdutta19

Loss jumps around but does not go down in local fine-tuning: a few problems

1

Thanks a lot for the great contribution! Here are the training logs: ...... ...... {'loss': 0.6356, 'learning_rate': 1.9382459105399634e-05, 'epoch': 0.42} {'loss': 0.6391, 'learning_rate': 1.938015964960626e-05, 'epoch': 0.42} {'loss': 0.5389, 'learning_rate': 1.9377856057588756e-05,...

mikeda100

finetune with lora error

4

finetune with lora CUDA_VISIBLE_DEVICES="2,3,4,5,6,7" torchrun --nnodes=1 --nproc_per_node=6 \ fastchat/train/train_lora.py \ --model_name_or_path vicuna/vicuna-7b \ --data_path vicuna/data/data.json \ --fp16 \ --report_to none \ --output_dir ./checkpoints \ --num_train_epochs 3 \ --per_device_train_batch_size 1 \...

llplay

Support FLAN-T5

4

- [ ] Support cli inference of Flan-T5 - [ ] Support web UI serving of Flan-T5 - [ ] Support fine-tuning of Flan-T5

zhisbug

good first issue

CUDA OOM When Using Flash Attention

4

Hello, Thank you for sharing your awesome work! I'm trying to train Vicuna on my own dataset. I walked through the installation process from source. I had to install `pytorch`...

HaniItani

Expected is_sm80 to be true, but got false.

6

Hi there, I am trying to fine tune vicuna-7b with 2 GTX 3090 cards. ```bash torchrun --nnodes=1 --nproc_per_node=2 \ fastchat/train/train_mem.py \ --model_name_or_path vicuna-7b \ --data_path playground/data/alpaca-data-conversation.json \ --bf16 True \...

dstsmallbird

UI contains cross-site scripting (XSS) vulnerabilities

9

the UI is not filtering input/output appropriately

lts-rad

good first issue

help wanted

Memory leak, Windows

2

When using CUDA, there appears to be a memory leak on Windows systems with either the CLI or UI. Any messages sent to the model will cause the GPU memory...

Aemon-Algiz

ERROR of Flash_attn when finetuning with Deepspeed

1

I tried to finetune Vicuna using my own data with deepspeed, however, I met the following error: ![error](https://user-images.githubusercontent.com/128484317/230714825-8a47be2b-ecaa-4c03-a802-24bf7cfc6c69.PNG) I tried to solve this error by changing torch and deepspeed version,...

VVNMA

FastChat
FastChat copied to clipboard

Metadata

Unable to save the mode weights - GPU OOM

Got a huggingface validation error: Repo id must use alphanumeric char

Loss jumps around but does not go down in local fine-tuning: a few problems

finetune with lora error

Support FLAN-T5

CUDA OOM When Using Flash Attention

Expected is_sm80 to be true, but got false.

UI contains cross-site scripting (XSS) vulnerabilities

Memory leak, Windows

ERROR of Flash_attn when finetuning with Deepspeed

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard