pyreft issues

forward() got an unexpected keyword argument 'unit_locations'

I have an error on trainer.trian(). Plese help me! ## Error ``` TypeError: LlamaForCausalLM.forward() got an unexpected keyword argument 'unit_locations' ``` ## code ```python import pyreft import torch import transformers...

xerkey

[P1] Loading REFT fro RoBERTa Models

4

I was training and saving REFT modules for the RoBERTa model. But loading them seems to be not possible with the current implementation. I get the following Error: ``` Traceback...

hSterz

question

[P0] Make `make_last_position_supervised_data_module` parallelizable to speed up processing!

2

Hey team, I am having issues with large datasets (~10k samples or more). Calling the `make_last_position_supervised_data_module` function is slower than the training itself. The root cause is that the function...

truskovskiyk

enhancement

[P1] Convert reft model to hf model

1

I want to use some code data sets to fine-tune the basic model in the code task field. However, when I am evaluating, I hope to convert the generated reft...

thu-yn

question

[P1] QLoReFT

big model go brrrr

aryamanarora

enhancement

[P1] ReFT+PEFT by using ReftModel to wrap PeftModel

2

**Descriptions:** By taking a quick look at the PEFT library, it wraps nn.module as a PEFT nn.module which accepts gradient, is trainable, and just like another nn.module. This is highly...

frankaging

enhancement

[P0] Additional intervention arguments are not saved correctly, e.g. `add_bias`

**Descriptions:** For customizable interventions, people might want to save the interventions with their customized arguments. For instance, the dropout ratio, the activation function type or whether to add a special...

frankaging

bug

[P1] TGI and vLLM support

7

1. Are there plans for inference support. This is needed if it's to be used by devs in production. 2. Is fine tuning much faster than LoRA? - Optimization and...

RonanKMcGovern

question

[P1] Location of code for "LM training and serving with ReFT"

2

The ReadMe mentions the ability to serve at scale with continuous batching. Even if not vLLM or TGI, is there some work that someone could point me to on this?...

RonanKMcGovern

enhancement

[P2] Pyreft tensorboard integration

As shown by issue #69, Pyreft did not work well with tensorboard callbacks. We may need to modify Pyvene to remove the serialization of "types" in configs.

PinetreePantry

bug

pyreft
pyreft copied to clipboard

Metadata

forward() got an unexpected keyword argument 'unit_locations'

[P1] Loading REFT fro RoBERTa Models

[P0] Make `make_last_position_supervised_data_module` parallelizable to speed up processing!

[P1] Convert reft model to hf model

[P1] QLoReFT

[P1] ReFT+PEFT by using ReftModel to wrap PeftModel

[P0] Additional intervention arguments are not saved correctly, e.g. `add_bias`

[P1] TGI and vLLM support

[P1] Location of code for "LM training and serving with ReFT"

[P2] Pyreft tensorboard integration

← Metadata

Owner

Metadata

pyreft pyreft copied to clipboard

Metadata

← Metadata

Owner

Metadata

pyreft
pyreft copied to clipboard