llama-stack
llama-stack copied to clipboard
Composable building blocks to build Llama Apps
# What does this PR do? Scheduler: cancel tasks on shutdown. Otherwise the currently running tasks will never exit (before they actually complete), which means the process can't be properly...
Replace png image with mermaid diagram. Benefits: - Scalability - Maintainability - Mermaid diagrams allows hyperlinks. [Preview](https://github.com/makelinux/llama-stack/tree/250501%2B052645?tab=readme-ov-file#overview) Screeenshoot of the page with the diagram: 
### 🚀 Describe the new functionality needed When tracing is enabled and llama-stack makes a request to an external service, it should propagate the trace header. ([OpenTelemetry docs - Context...
### System Info Ubuntu 24.04 CUDA version: 12.8 NVIDIA A40 GPU GPU driver: 570.86.10 ### Information - [ ] The official example scripts - [x] My own modified scripts ###...
Our current provider configurations in templates use absolute paths (e.g., `~/.llama/distributions/...`) for storage locations, creating a tight coupling between configuration and deployment environment. This makes e.g. containerized deployments, particularly in...
### 🚀 Describe the new functionality needed We're missing the API endpoint to delete Responses from the conversation state - see https://platform.openai.com/docs/api-reference/responses/delete . ### 💡 Why is this needed? What...
### System Info watsonx Inference is not having Embedding Model in run.yaml ### 🐛 Describe the bug Currently , there is no embedding model support added in watsonx inference. Hence,...
### System Info all ### Information - [x] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug It looks like there is a...
### System Info - Mac os Silicone - Conda ### Information - [x] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug Stack...
### 🚀 Describe the new functionality needed Model Context Protocol allows applications to provide context for LLMs in a standardized way. It follows a server-client architecture. MCP servers standardize external...