mlx-examples
mlx-examples copied to clipboard
mlx_lm.batch
Can you add the functionality of MLX batches similar to ParaLLM?
https://github.com/willccbb/mlx_parallm/issues/9#issuecomment-2646443736
There is an open PR to add batch support for mlx-lm: https://github.com/ml-explore/mlx-examples/pull/948
Will try and get it landed soon if possible.
There is an open PR to add batch support for mlx-lm: #948
Will try and get it landed soon if possible.
Really need it, we definitely will face concurrent calling from users.