Joshua Rosenkranz
Joshua Rosenkranz
Is there any way to speed that up (lambda x: x + 1), as the function being passed to java from python is stateless? Or, if not, is there a...
> Do you have link to the actual models so we can try this out and see how is the performance ? > > I made a lot of comments...
Closing as this has been merged via https://github.com/huggingface/text-generation-inference/pull/1865
> Hi! What's the status on this PR? I'd like to train a few speculator models, but I'm not sure how to get started, due to a lack of documentation......
> Is this expected to be merged soon? @philschmid We are expecting to have speculator training merged sometime in next 2 weeks.
@philschmid This has been finished and merged in #114. @philschmid The speculator training implementation is now available in main. Please let us know if you have any feedback or questions....
Closing in favor of #114