trlx icon indicating copy to clipboard operation
trlx copied to clipboard

T5 FLAN 11B Config for a consumer GPU

Open LouisCastricato opened this issue 2 years ago • 3 comments

🚀 The feature, motivation, and pitch

Now that @jon-tow @ethankim00 have merged LoRA and 8bit adam, we can create an example where one RLHF tunes T5 FLAN 11B on a consumer GPU with minimal CPU offloading.

If we can get sentiment ppo working in a 24GB of VRAM constrained environment (like a 3090), I think that would be a great demo to show people who want to run trlX at home.

@reciprocated said that he has gotten CPU offloading working before.

Alternatives

We could also do 6B with no offloading on a 3090 now as well probably.

Additional context

No response

LouisCastricato avatar Jan 09 '23 18:01 LouisCastricato

I thiiink for LoRA, t5 needs an entry in utils.modeling.py::MODIFIED_MODULES_DICT

Or at least one would be nice to have.

aaronrmm avatar Jan 23 '23 15:01 aaronrmm

Do you want add that?

LouisCastricato avatar Jan 23 '23 20:01 LouisCastricato

I'm down to try

aaronrmm avatar Jan 24 '23 03:01 aaronrmm