multilingual-modeling icon indicating copy to clipboard operation
multilingual-modeling copied to clipboard

Composable SFT

Open haileyschoelkopf opened this issue 2 years ago • 12 comments

https://arxiv.org/pdf/2110.07560.pdf <-- Paper https://github.com/cambridgeltl/composable-sft <-- code

TODOs:

  • Determine hyperparameters we should use for comparable testing. This will mean, likely, x train steps + 50k rewinded steps with one iteration in their method. Or maybe 5 iterations + 10k train steps twice? Idk yet
  • Add loading an SFT from path (NOT MAIN PRIORITY)

If we want to train both adapters and Composable SFT at once, this will require some extra code. Probably not TOO bad, but would need extra testing to account for freezing all correct parameters

haileyschoelkopf avatar Jun 28 '22 01:06 haileyschoelkopf