LoRA issues

Results 106 LoRA issues

Sort by recently updated

Layers.py not being executed

Hi, I made some changes to the layers.py, however on running the mrpc task on roberta_base, it seems that __init__() and forward() of the linear layers is not being called...

Aradhye2002

Can't reproduce the results for GLUE and hyperparameter misalignment

Hi, Thanks for the great work. I am trying to reproduce the result of Roberta-large on the NLU tasks, however, I got a CoLA score = 0 and MNLI =...

nbasyl

How to optimize B, which is the all zero matrix in the Lora method?

Sauloo-huen

Can't reproduce the results for GLUE CoLA

My steps: ``` git clone https://github.com/microsoft/LoRA.git cd LoRA pip install -e . cd examples/NLU pip install -e . ``` Change `export num_gpus=8` to `export num_gpus=1` in `roberta_large_cola.sh` Then `CUDA_VISIBLE_DEVICES=0 bash...

fxmarty

How to compute that GPT-2 M (FTTop2 ) trainable parameters number is 25.19M?

in the table 3 of the paper "LORA: LOW-RANK ADAPTATION OF LARGE LAN- GUAGE MODELS" it says that finetuning top2 layer of GPT2 meidium require 25.19M trainable parameters. how to...

floatingbigcat

[Minor] Possible typos in weight initialization

The recent commit https://github.com/microsoft/LoRA/commit/a0a92e0f26c067cf94747bdbf1ce73793fa44d19 flipped `A` and `B` in the comment for the LoRA `Linear` module: https://github.com/microsoft/LoRA/blob/a0a92e0f26c067cf94747bdbf1ce73793fa44d19/loralib/layers.py#L119-L125 The LoRA `Embedding` module similarly has the initialization flipped (not sure if this...

awgu

LoRA
LoRA copied to clipboard

Metadata

Layers.py not being executed

Can't reproduce the results for GLUE and hyperparameter misalignment

How to optimize B, which is the all zero matrix in the Lora method?

Can't reproduce the results for GLUE CoLA

How to compute that GPT-2 M (FTTop2 ) trainable parameters number is 25.19M?

[Minor] Possible typos in weight initialization

Question about seed numbers.

The description and the behavior don't match

Is the output the entire model？

Question about reproducing RoBERTa base Fine-tune

← Metadata

Owner

Metadata

LoRA LoRA copied to clipboard

Metadata

← Metadata

Owner

Metadata

LoRA
LoRA copied to clipboard