llama-stack Use inference APIs for executing Llama Guard

Use inference APIs for executing Llama Guard

Open ashwinb opened this issue 1 year ago • 0 comments

We should use Inference APIs to execute Llama Guard instead of directly needing to use HuggingFace APIs. The actual inference consideration is handled by Inference.

Sep 26 '24 21:09 ashwinb

llama-stack llama-stack copied to clipboard

Use inference APIs for executing Llama Guard

llama-stack
llama-stack copied to clipboard