Markus Enzweiler comments

Results 18 comments of


                                            Markus Enzweiler

Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm

I have noticed a significantly decreased throughput vs. the info in the original README. Both on a 16 MB M1 Macbook Pro. I am also seeing that without this PR...

Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm

@awni I did not record the "Previous" results. It is taken from the original ```README.md```. I have tried to reproduce it with earlier mlx versions (down to 0.0.2) but am...

Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm

Updated requirements.txt to require mlx>=0.0.9. Rebased and force-pushed.

Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm

Tested with mlx 0.0.9. Ready to be merged.

PyTorch (MPS) is faster than MLX for training and inference for ResNets and Transformers (tested on 2 tasks)

Adding another data point here (M1 16 GB, first gen MB Pro) with mlx 0.0.7, PyTorch 2.1.0 and the ```benchmarking``` branch from https://github.com/SarthakYadav/mlx-examples. ## MNIST num_layers = 4 hidden_dim =...

PyTorch (MPS) is faster than MLX for training and inference for ResNets and Transformers (tested on 2 tasks)

@SunnyBeike In my experiments with small (image) CNNs, see above, I am not seeing different GPU frequencies according to ```asitop```. For both torch and mlx, I am getting near 100%...

[BUG] cifar example fails with ValueError when replacing LayerNorm with BatchNorm with default params

> I was able to repro using the code @menzHSE provided. If you pass `retain_graph=True` to the `eval()` method it suppresses the error, but it trains much slower, as expected....

Example of a Convolutional Variational Autoencoder (CVAE) on MNIST

Edit: Fixed in https://github.com/ml-explore/mlx/pull/409 There is one existing issue that I am not sure how to handle. When loading the model with ```strict=True``` I am getting a shape error for...

Example of a Convolutional Variational Autoencoder (CVAE) on MNIST

Awesome! I'll keep working on it and will wait for the next mlx release anyway. Regarding the batch norm issue: I tried but could not reproduce it with a minimal...

Example of a Convolutional Variational Autoencoder (CVAE) on MNIST

Verified that https://github.com/ml-explore/mlx/pull/409 fixed the loading in strict mode with mlx@1d90a76