llama-stack issues

Support "stop" parameter in inference providers

2

### 🚀 Describe the new functionality needed "stop" parameter: https://platform.openai.com/docs/api-reference/completions/create#completions-create-stop ### 💡 Why is this needed? What if we don't build it? Only vLLM inference provider is supported/tested through https://github.com/meta-llama/llama-stack/pull/1715...

terrytangyuan

enhancement

stale

fix: make TGI work well

1

# What does this PR do? [Provide a short summary of what this PR does and why. Link to relevant issues if applicable.] [//]: # (If resolving an issue, uncomment...

hardikjshah

CLA Signed

stale

prototype: use pyproject and uv to build distribution

2

Goals: * remove the need of a custom tool to install a collection of python packages AKA `llama stack build` * use the power of 'uv', which was designed to...

leseb

CLA Signed

How should MCP with Authorization work within llama-stack?

6

### 🚀 Describe the new functionality needed Develop acceptance criteria for llama-stack (MCP with auth) and scope other requirements to enable MCP Servers with both server-side and client-side auth. ###...

reluctantfuturist

enhancement

feat: Add Nvidia e2e beginner notebook and tool calling notebook

6

# What does this PR do? This PR contains two sets of notebooks that serve as reference material for developers getting started with Llama Stack using the NVIDIA Provider. Developers...

JashG

CLA Signed

Road to v1

2

### 🚀 Describe the new functionality needed # Overview The goal for Llama Stack v1 is to enable ISVs and enterprise developers to build AI applications in on-prem and VPC...

reluctantfuturist

enhancement

feat: Bring Your Own API (BYOA)

3

# What does this PR do? Prototype on a new feature to allow new APIs to be plugged in Llama Stack. Opened for early feedback on the approach and test...

leseb

CLA Signed

docs: Add documentation on distro stores

### 🚀 Describe the new functionality needed The llama stack distro server has several stores for state: 1. kvstore - for routing info 2. sqlstore - to store chat completion...

raghotham

enhancement

Add APIs to retrieve chat completion history

### 🚀 Describe the new functionality needed Add higher-level APIs, e.g. list chat completions ### 💡 Why is this needed? What if we don't build it? Enable more efficient and...

reluctantfuturist

enhancement

Chat completions log view

### 🚀 Describe the new functionality needed Log view for chat completions ### 💡 Why is this needed? What if we don't build it? Implement a log view for chat...

reluctantfuturist

enhancement

llama-stack
llama-stack copied to clipboard

Metadata

Support "stop" parameter in inference providers

fix: make TGI work well

prototype: use pyproject and uv to build distribution

How should MCP with Authorization work within llama-stack?

feat: Add Nvidia e2e beginner notebook and tool calling notebook

Road to v1

feat: Bring Your Own API (BYOA)

docs: Add documentation on distro stores

Add APIs to retrieve chat completion history

Chat completions log view

← Metadata

Owner

Metadata

llama-stack llama-stack copied to clipboard

Metadata

← Metadata

Owner

Metadata

llama-stack
llama-stack copied to clipboard