Alex Cheema issues

Results 117 issues of


                                            Alex Cheema

[BOUNTY - $100] DummyInferenceEngine

- We support several `InferenceEngine` implementations including MLX, tinygrad and PyTorch - Sometimes you want to test everything **except** the inference itself - It would be helpful to have a...

[BOUNTY - $100] Vision Model Integration Test

- As expected since we don't have tests for vision models, they broke unexpectedly: https://github.com/exo-explore/exo/issues/327 - Create a test similar to the existing one (`chatgpt_api_integration_test_mlx`) we have for text in...

benchmark for different inference engines

[BOUNTY - $1000] Get exo Python node running on iOS

# Background I experimented with a rust-based exo implementation that used [UniFFI](https://mozilla.github.io/uniffi-rs/latest/) for foreign language bindings so I could run it from a Swift iOS app. I didn't like this...

[BOUNTY - $100] Support running any model from huggingface

Like this: https://x.com/reach_vb/status/1846545312548360319 ```sh exo run hf. co/bartowski/Llama-3.2-1B-Instruct-GGUF:Q8_0 ``` This should work out of the box with #139

Auto selection of inference engine

Right now, inference engines are not interoperable e.g. you can't run one machine on `MLX` and another on `tinygrad` - they need to both be using the same inference engine....

[BOUNTY - $100] Support Llama 3.2 1B on tinygrad

- Currently we support Llama 3.2 1B on MLX but not tinygrad - Add support for Llama 3.2 1B - Might just work out of the box, if not I...

[BOUNTY - $100] Support changing manual configuration while running

#368 + #381 added support for manual configuration but if you update the configuration file while exo is running it won't update. This should be a simple fix, hence $100...

simple Dockerfile for rurnning mlx

on hold

Fix prompt caching

- Tried adding this recently - Commented out because it was causing issues - No reason this shouldn't work, should be a simple fix