Results 1 issues of Woojun Jeong

adding an API example that provides responses similar to OpenAI's chat completion and completion. This example is about 30% faster than existing similar examples because they are based on llama-cpp-python,...