transformer-deploy
transformer-deploy copied to clipboard
Llama support
Would it be possible to run llama using this? Is the gpt2 example hackable to run llama on tensorrt?
My question also^ - very much interested in trying to get LLMs like Llama to work on Triton CC @pommedeterresautee
Also very interested