llama-stack
llama-stack copied to clipboard
Composable building blocks to build Llama Apps
This would help us catch pre-commit issues like the ones fixed in https://github.com/meta-llama/llama-stack/pull/236.
# DevX Flow ##### Step 1. Register Eval Dataset ``` python -m llama_stack.apis.datasets.client ``` ##### Step 2. Run Eval Scorer ``` python -m llama_stack.apis.evals.client ``` - (benchmark) run full preprocess->generation->postprocess->score...
We need to have a capability to add new tools or disable/remove tools sometimes after an agent has been deployed. Similar to current methods with `@webmethod` decorators for agents, could...
Failed to run stack through conda:llama stack run stack-3.2-1B --port 5000 --disable-ipv6 https://github.com/meta-llama/llama-stack/issues/194 ,I don't know why stack needs to link it to the address [: ffff: 0.0.2.208] Failed to...
# Script Usage ``` $ cd scripts/docker/tgi $ ls compose.yaml tgi-run.yaml $ docker compose up ``` **Expected output** - TGI container is spawn up in port 5009 - Llama Stack...
This is not used since we are processing the response, not the request.
I used the official Docker image and downloaded the weight file from Meta. The md5sum test proved that the file was fine, but it still failed to run, which left...
Hello, I encountered some problems when loading the Llama3.2-90B-Vision-Instruct model with FP8. Can you help me take a look? Version of llama_stack and llama_models: ``` llama_models == 0.0.41 llama_stack ==...
Address issue #233 , allowing users to pass a comma-separated list of model IDs with the --model-ids argument and matching meta URLs via --meta-url.
Hi Expert, I just tried to to install llama-stack and run the test with **Llama3.2-1B** but I found the response is really weird. Since my GPU RAM is only 6GB,...