llama-stack
llama-stack copied to clipboard
Composable building blocks to build Llama Apps
### 🚀 Describe the new functionality needed Users may want to precompute embeddings outside of the Stack using a variety of other tools (e.g., https://github.com/meta-llama/llama-stack/pull/1563, https://github.com/meta-llama/llama-stack/pull/1866, https://github.com/meta-llama/llama-stack/pull/1290). Allowing users to...
# What does this PR do? Adds a new endpoint that is compatible with OpenAI for embeddings api. `/openai/v1/embeddings` Added providers for OpenAI, LiteLLM and SentenceTransformer. ## Test Plan ```...
### 🚀 Describe the new functionality needed Merge queue: https://docs.github.com/en/repositories/configuring-branches-and-merges-in-your-repository/configuring-pull-request-merges/managing-a-merge-queue ### 💡 Why is this needed? What if we don't build it? This is to avoid the situation where CI...
### System Info Latest llama-stack from main commit 2603f10f95fcd302297158adb709d2a84c9f60af ### 🐛 Describe the bug When testing the responses API with the ollama provider, I'm getting frequent errors with unique id...
### 🚀 Describe the new functionality needed Implement Embedding, keyword, and hybrid search for available providers. ### 💡 Why is this needed? What if we don't build it? We need...
### System Info python -m "torch.utils.collect_env" /Users/bmurdock/.pyenv/versions/3.10.16/lib/python3.10/runpy.py:126: RuntimeWarning: 'torch.utils.collect_env' found in sys.modules after import of package 'torch.utils', but prior to execution of 'torch.utils.collect_env'; this may result in unpredictable behaviour warn(RuntimeWarning(msg))...
# What does this PR do? Add support for hybrid search mode in SQLite-vec provider, which combines keyword and vector search for better results. The implementation: - Adds hybrid search...
# What does this PR do? [Provide a short summary of what this PR does and why. Link to relevant issues if applicable.] Removes the ability to run llama stack...
I think the implementation needs more simplification. Spent way too much time trying to get the tests pass with models not co-operating :( Finally had to switch claude-sonnet to get...
## What does this PR do? Adds comprehensive test suite for the synthetic data kit provider implementation, including both unit and integration tests. This ensures the provider's functionality, configuration handling,...