Awni Hannun comments

Results 1014 comments of


                                            Awni Hannun

Convert Musicgen to MLX

We have a discord link here: https://github.com/ml-explore/mlx/discussions/733 You shouldn't need to implement weight norm in C++. That can all be done in Python using existing ops.

Stable diffusion xl

Looks nice! I requested a review from @angeloskath as I'm not that familiar with this example.

After finished fine-tune training.. Invalid header in file issue..!

That is definitely not expected. The adapter file should have a `npz` extension (our `savez` automatically adds this even if you don't specify it). Could you double check that the...

Prompt Lookup Decoding - merged under Speculative example

> @awni perhaps we can leave this as T5 and then make an attempt at swapping to Llama in a new PR? Yea that sounds like a great plan to...

Prompt Lookup Decoding - merged under Speculative example

Sorry for the delay!! I will review and get this in early next week

mistral example seems to hang on Loading model from disk.

Strange.. I just ran it no problem on an M1 Max. Is it possible the weights are ill formatted? Maybe the conversion didn't work?

mistral example seems to hang on Loading model from disk.

If this is still an issue, please reopen.

mistral example seems to hang on Loading model from disk.

@Diniden we've updated MLX and fixed a bug with loading from NumPy so try `pip install --upgrade mlx` and see if that helps. I would also double check the weights...

mistral example seems to hang on Loading model from disk.

Hmm, I wonder why it's so slow. I assume you are using the Metal back-end? ``` python -c "import mlx.core as mx; print(mx.default_device())" ``` Should give `Device(gpu, 0)`. If it's...

mistral example seems to hang on Loading model from disk.

I have a 32GB machine and the Mistral example runs pretty quickly. It seems like somewhere between 16 and 32 is the cutoff right now to get good perf, but...