Oleh Kuznetsov
Oleh Kuznetsov
### Issue you'd like to raise. Looks like the inference time of the LLamacpp model is a lot slower when using LlamaCpp wrapper (compared to the llama-cpp original wrapper). Here...
# Update llamacpp demonstration notebook Add instructions to install with BLAS backend, and update the example of model usage. Fixes #5071. However, it is more like a prevention of similar...
### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. ### Expected Behavior Evaluations should happen at the...
[x] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug. **Describe the bug** Unable to instantiate `HuggingfaceEmbeddings`, because it doesn't implement asynchronous embed methods of base...
According to [example](https://github.com/ollama/ollama-python/blob/main/examples/tools.py) and definition of Message model, it seems like there is no way to actually represent executed tool message as a Message model, only using dictionary? I'm arriving...