BigsnarfDude

Results 24 comments of BigsnarfDude

EXAMPLE https://github.com/ml-explore/mlx-examples/tree/main/lora SAVING if you look at this file and grok the usage https://github.com/ml-explore/mlx-examples/blame/main/lora/lora.py#L327 you see it being used in training loop. docs here: https://ml-explore.github.io/mlx/build/html/python/_autosummary/mlx.core.savez.html LOADING loading weights from npz...

@sandeepimpressico looks like framework has got new code for checkpoint. just the npz file is all that is needed for checkpoints save and weights_load. here is the code: https://github.com/ml-explore/mlx-examples/commit/d8680a89f986492dbc27c36af3294034db26458f

``` in shakespeare directory -rw-r--r-- 1 user staff 1.1M 12 Apr 18:21 input.txt -rw-r--r-- 1 user staff 1.1K 5 Apr 20:55 prepare.py -rw-r--r-- 1 user staff 161B 5 Apr 20:55...

I've trained GPT2 from scratch on single RTX 4070 16GB card with no issues. Takes a few days and you have to tune training configuration to make sure its small....