Angainor Development comments

Results 70 comments of


                                            Angainor Development

add_api

Gradio magic, yep, comes with a "free" API, just need proper setting of functions to make visible.

Can't train the original llama-7b model on a 3090

I'm training 7b and 13b on 3090. This repo is supposed to load the models with 8bits, so 13B model = 13GB vram on the GPU. Your error seems to...

generate error by ziqingyang/chinese-alpaca-lora-7b

Are you sure you used the right (model, peft) couple? Check your peft config json , `adapter_config.json` to make sure you used the correct base model for inference.

generate error by ziqingyang/chinese-alpaca-lora-7b

This model is not a standard one, it comes with its own tokenizer https://huggingface.co/ziqingyang/chinese-alpaca-lora-7b/tree/main and has a dedicated repo with related doc https://github.com/ymcui/Chinese-LLaMA-Alpaca The error is likely related to the...

Poor results when fine-tuning with `alpaca_data.json` and suggested settings.

1 single epoch is too low. First gen rank 8 loras were using 3 epochs. Current params with 4 lora modules and rank 16 use closer to 10 epochs. Also,...

Poor results when fine-tuning with `alpaca_data.json` and suggested settings.

See just below https://github.com/tloen/alpaca-lora#official-weights --group_by_length however I do not recommend.

SIGBUS Error During Training with Multiple GPUs

The tokenizer warning you can ignore. Since you have mixed gpus, I'd begin with trying with one kind of GPU only at a time, see what it gives.

SIGBUS Error During Training with Multiple GPUs

How many gpus? What command line did you use? (full params please)

fintune 13B model, train_loss always 0.0

Apart from the --group_by_length param, it's very similair to how I run it, and your batch params seem consistent. Loss at 0 definitely is weird. I'd try clearing the .hf...

finetuned with a 10-line dataset, not work as expected.

Yeah, 10 items is not a training dataset. Way too few tokens to move the lora weights, unless you train for long enough and then break the model.