Batch inference support (self-assigning)

Open willccbb opened this issue 1 year ago • 2 comments

Would be nice to have batch inference support similar to mlx_parallm, happy to try and add soon. @Blaizzy can you assign this to me?

Jul 02 '24 22:07 willccbb

Hey Will,

Yes, that would be awesome!

I have assigned the task to you 😀

Jul 03 '24 07:07 Blaizzy

Please comment #40 here so I can assign it to you and discuss details on a single issue.

Jul 03 '24 09:07 Blaizzy