mergekit
mergekit copied to clipboard
Add support for GPTBigCodeForCausalLM
Addresses #80.
Also does some plumbing adjustments to more robustly handle GPT 2 based models. Going to keep this as a draft for a while until I can test it sufficiently.