trlx
trlx copied to clipboard
Support for CodeGenForCausalLM
🚀 The feature, motivation, and pitch
I'm trying to apply RL in a code generation LM: https://huggingface.co/docs/transformers/model_doc/codegen, unfortunately getting the error below:
ValueError: Unsupported architecture: `CodeGenForCausalLM`. The following architectures are available for model
branching:
['GPTJForCausalLM', 'GPT2LMHeadModel', 'GPTNeoForCausalLM', 'GPTNeoXForCausalLM', 'OPTForCausalLM', 'BloomModel',
'BloomForCausalLM', 'LlamaModel', 'LlamaForCausalLM']
I wonder if it would be possible to add support for code modes. Thanks!
Alternatives
No response
Additional context
No response
@jon-tow
mark
Also there is neeed of Support for GPTBigCodeForCausalLM