Oliver Schiessl
Results
2
comments of
Oliver Schiessl
> I think the problem is that numpy doesn't support bf16 and in [sharded_inference_engine.py](https://github.com/exo-explore/exo/blob/main/exo/inference/mlx/sharded_inference_engine.py), the program tries to convert mlx tensors to numpy array hence causing the error. > >...
> [@lordoliver](https://github.com/lordoliver) can you check if it slows down non-bf16 models too? or if it breaks the code in worst case scenario? I am not much into it, maybe its...