Alex Cheema issues

Results 117 issues of


                                            Alex Cheema

Add debug panel to terminal ui with a flag

- Tokens per second - Time to first token - In-flight requests - List of all models downloaded already

Fix configure_mlx.sh for different devices

The limits set in `configure_mlx.sh` should be dynamic.

Add support for this model with the MLX backend: https://huggingface.co/black-forest-labs/FLUX.1-dev There's already an example of FLUX using MLX here: https://github.com/ml-explore/mlx-examples/blob/main/flux/flux/model.py

just testing

Just testing.

tinygrad threading issue: SQLite objects created in a thread can only be used in that same thread

How to reproduce: - Run a cluster on 2 nodes - Run a request - Restart one node - Run another request ``` Traceback (most recent call last): File "/Users/alex/exo/exo/api/chatgpt_api.py",...

migrate from circleci to github actions

test

TUI Improvements

- Make the TUI updates asynchronous. Probably just a framerate e.g. once per second - The TUI shouldn't update every time a node is active - we should just show...

Alex Cheema