lmql icon indicating copy to clipboard operation
lmql copied to clipboard

Using RayServe instead of lmql serve-model

Open ranipakeyur opened this issue 2 years ago • 1 comments

Is it possible to serve model through Ray Serve Instead of lmql serve-model ? If so how to modify "from" clause to access the Ray Serve API endpoint ?

ranipakeyur avatar Aug 06 '23 15:08 ranipakeyur

It looks like Ray Serve does offer a relatively flexible access to the model. Most importantly, for support, we need access to the next-token distribution and a way to shift it according to a logit bias. @charles-dyfis-net is working on Replicate support, there may be some parallels here.

lbeurerkellner avatar Aug 17 '23 08:08 lbeurerkellner