qlora issues

Results 229 qlora issues

Sort by recently updated

Bug Fix: 443 Bytes `adapter_model.bin` files

Aims to fix #38 and #41 Currently, we get extremely small adapter files on checkpoint. This seems to be due to some issue in the PEFT library. One of the...

KKcorps

Is this just the example not clear? Value of max_memory in README example.

max_memory=max_memory, NameError: name 'max_memory' is not defined For testing, running on a smaller machine, 128GB, 48GB GPU ram Maybe the example could use a suggestion eg... max_memory={0: "44GiB", "cpu": "110GiB"})

linuxmagic-mp

OverflowError: out of range integral type conversion attempted while running python qlora.py

`python qlora.py --model_name_or_path decapoda-research/llama-13b-hf` (I have updated the tokenizer_config.json and config.json as per the various discussions [here](https://huggingface.co/decapoda-research/llama-13b-hf/discussions/) `tokenizer_class`: `LlamaTokenizer` and `architectures`: `LlamaForCausalLM`) ================================================================================== adding LoRA modules... trainable params: 125173760.0 ||...

amdnsr

lora weights are not saved correctly

The saved adapter_model.bin is only 441kb. https://github.com/artidoro/qlora/issues/38

taishan1994

Check for LlamaTokenizerFast rather than infer type from path name.

Check for LlamaTokenizerFast rather than infer type from path name. Fix cases where non-standard llama model path names gets bypassed in tokenizer check. The tokenizer is init with use_fast=True and...

Qubitium

Adapter model is just 400 bytes when using finetune.py

I am using the `finetune.py` script with default params it works perfectly but when I get the adapter model in the checkpoint directory, the size of adapter model is just...

KKcorps

RuntimeError: self and mat2 must have the same dtype

---------pip package version transformers-4.30.0.dev0 accelerate 0.20.0.dev0 bitsandbytes 0.39.0 peft-0.3.0.dev0 ----------python cmd python qlora.py --model_name_or_path /home/bmb/models/facebook/opt-125m ---------error Traceback (most recent call last): File "/home/bmb/projects/qlora/qlora.py", line 766, in train() File "/home/bmb/projects/qlora/qlora.py", line...

baibaiw5

qlora
qlora copied to clipboard

Metadata

Bug Fix: 443 Bytes `adapter_model.bin` files

Is this just the example not clear? Value of max_memory in README example.

OverflowError: out of range integral type conversion attempted while running python qlora.py

lora weights are not saved correctly

Check for LlamaTokenizerFast rather than infer type from path name.

Adapter model is just 400 bytes when using finetune.py

RuntimeError: self and mat2 must have the same dtype

LORA Merge fails in 4-bit mode

How does QLora work on GLUE as there is no load_in_4bit for the AutoModelForSequenceClassification

Fine-tuning with unlabelled data? (Causal language modelling)

← Metadata

Owner

Metadata

qlora qlora copied to clipboard

Metadata

← Metadata

Owner

Metadata

qlora
qlora copied to clipboard