Brian Park comments

Results 5 comments of


                                            Brian Park

assert len(args) >= len(self.undetermined) AssertionError

I have the same issue when doing densenet121 from torchvision's pretrained models.

What is the Expected Inference Performance

@tcapelle I tried the LLaMA example on my M1 Pro 32GB. It's indeed slow, and I think that's mostly due to the weights being FP32. I haven't checked Mistral example...

What is the Expected Inference Performance

@rovo79 Correct. As per #18, ANE API is closed source and not publicly accessible. I believe the only way to touch ANE today is via CoreML.

It would be possible to add other kind of graphs to your analysis?

Hi Raidell, Thank you for your praise and suggestions. The graphs and analysis you show in the link are definitely interesting! I stopped developing and updating this repository a while...

Optimization Plans for Conv2D CPU Execution

Has there been an attempt or discussion to port in BNNS conv into MLX? [It's listed as TODO](https://github.com/ml-explore/mlx/blob/e6fecbb3e1e2ecb51247280f3738d670241777db/mlx/backend/accelerate/conv.cpp#L17). I've looked into it personally, but I'm noticing some limitations with BNNS...