llama-stack icon indicating copy to clipboard operation
llama-stack copied to clipboard

Composable building blocks to build Llama Apps

Results 360 llama-stack issues
Sort by recently updated
recently updated
newest added

# What does this PR do? Scheduler: cancel tasks on shutdown. Otherwise the currently running tasks will never exit (before they actually complete), which means the process can't be properly...

CLA Signed

Replace png image with mermaid diagram. Benefits: - Scalability - Maintainability - Mermaid diagrams allows hyperlinks. [Preview](https://github.com/makelinux/llama-stack/tree/250501%2B052645?tab=readme-ov-file#overview) Screeenshoot of the page with the diagram: ![image](https://github.com/user-attachments/assets/16e45ea8-a1d4-4b95-9fd8-21a749ac7332)

CLA Signed

### 🚀 Describe the new functionality needed When tracing is enabled and llama-stack makes a request to an external service, it should propagate the trace header. ([OpenTelemetry docs - Context...

enhancement

### System Info Ubuntu 24.04 CUDA version: 12.8 NVIDIA A40 GPU GPU driver: 570.86.10 ### Information - [ ] The official example scripts - [x] My own modified scripts ###...

bug
stale

Our current provider configurations in templates use absolute paths (e.g., `~/.llama/distributions/...`) for storage locations, creating a tight coupling between configuration and deployment environment. This makes e.g. containerized deployments, particularly in...

### 🚀 Describe the new functionality needed We're missing the API endpoint to delete Responses from the conversation state - see https://platform.openai.com/docs/api-reference/responses/delete . ### 💡 Why is this needed? What...

enhancement

### System Info watsonx Inference is not having Embedding Model in run.yaml ### 🐛 Describe the bug Currently , there is no embedding model support added in watsonx inference. Hence,...

bug

### System Info all ### Information - [x] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug It looks like there is a...

bug
stale

### System Info - Mac os Silicone - Conda ### Information - [x] The official example scripts - [ ] My own modified scripts ### 🐛 Describe the bug Stack...

bug
stale

### 🚀 Describe the new functionality needed Model Context Protocol allows applications to provide context for LLMs in a standardized way. It follows a server-client architecture. MCP servers standardize external...

enhancement
stale