minions
minions copied to clipboard
Add train_minions utils for training memory estimate
Added PR to create a train_minions.py file in utils which automatically detects the underlying hardware and selects the best model + training config (Full/LoRA). There are a number of todos
support for quanitzationsupport fdsp- support QLoRA
- support MoE
Assumes a sequence length of 512 (should take this as input?)- Overhead for parallelization methods