fastertransformer_backend
fastertransformer_backend copied to clipboard
run end_to_end_test_llama.py error
Running python3 tools/end_to_end_test_llama.py, an error was prompted, [400] HTTP end point doesn't support models with decoupled transaction policy