Sheng Liu comments

Results 9 comments of


                                            Sheng Liu

llama.cpp on spark

> The model can be re-used within the machine. We typically recommend creating a Predictor per thread. So if you are re-loading the model multiple times, you can avoid it...

llama.cpp on spark

> @oreo-yum > > The `predictor` for PyTorch is pretty light-weighted. Only the model takes significant memory. You can use a global `Predictor` (assume you use PyTorch) per JVM if...

'Could not automatically map SimilarityCurie001 to a tokeniser. Please use `tiktok.get_encoding` to explicitly get the tokeniser you expect.'

@skeretna what is the working langchain version? I update my langchain package and get the same fail, but it works before the updating. what's sad is I couldn't change the...

Sheng Liu

llama.cpp on spark

llama.cpp on spark

'Could not automatically map SimilarityCurie001 to a tokeniser. Please use `tiktok.get_encoding` to explicitly get the tokeniser you expect.'

OpenAI API compatible serving would deserve its own project.

The stop parameter in openai API doesn't work since v0.2.5

Training new Vicuna based on fully open-source OpenLLaMA

Training new Vicuna based on fully open-source OpenLLaMA

Vicuna & langchain ??

Vicuna & langchain ??