llama-stack
llama-stack copied to clipboard
Use inference APIs for executing Llama Guard
We should use Inference APIs to execute Llama Guard instead of directly needing to use HuggingFace APIs. The actual inference consideration is handled by Inference.