llama-stack icon indicating copy to clipboard operation
llama-stack copied to clipboard

Composable building blocks to build Llama Apps

Results 360 llama-stack issues
Sort by recently updated
recently updated
newest added

# What does this PR do? Introduce a new workflow to run integration and agent tests against the OpenAI provider, and a subset of inference tests against Fireworks AI. Responses...

CLA Signed

### 🚀 Describe the new functionality needed I would like to use OpenTelemetry-Tracing to track my agent's activities. I created an agent as follows and registered a local 'calculator' and...

enhancement

### 🚀 Describe the new functionality needed As I was writing an inference provider that needs to dynamically register models while starting (not through static declaration in run.yaml), I got...

enhancement

### 🚀 Describe the new functionality needed - Spec and build what is needed to build the v1 version of the playground ### 💡 Why is this needed? What if...

enhancement

### 🚀 Describe the new functionality needed All endpoints need a way to be tested and sent back to the endpoint to see what is not working. (25 examples) ###...

enhancement
stale

### 🚀 Describe the new functionality needed We want to be able to keep track of message passing flow in multi-agent workflows. e.g. https://github.com/meta-llama/llama-stack/blob/main/docs/notebooks/Llama_Stack_Agent_Workflows.ipynb 1/ expose api to keep track...

enhancement
stale

### 🚀 Describe the new functionality needed We need to refactor llm-as-judge to make it easy for user to perform CRUD operations. - unregister scoring functions - persisting judge prompts...

enhancement
stale

### System Info :128: RuntimeWarning: 'torch.utils.collect_env' found in sys.modules after import of package 'torch.utils', but prior to execution of 'torch.utils.collect_env'; this may result in unpredictable behaviour Collecting environment information... PyTorch...

bug
stale

### System Info Collecting environment information... PyTorch version: 2.6.0+cu124 Is debug build: False CUDA used to build PyTorch: 12.4 ROCM used to build PyTorch: N/A OS: Debian GNU/Linux 12 (bookworm)...

bug
stale

### 🚀 Describe the new functionality needed add token usage metadata from inference steps to agents when getting turn execution results ### 💡 Why is this needed? What if we...

enhancement