llama-stack
llama-stack copied to clipboard
completion() for tgi
test plan:
PROVIDER_ID=test-tgi PROVIDER_CONFIG=tests/inference/provider_config_example.yaml MODEL_IDS="Llama3.2-1B-Instruct" pytest -s tests/inference/test_inference.py --tb=short --disable-warnings -rs