Woojun Jeong
Results
1
issues of
Woojun Jeong
adding an API example that provides responses similar to OpenAI's chat completion and completion. This example is about 30% faster than existing similar examples because they are based on llama-cpp-python,...