transformer-deploy
transformer-deploy copied to clipboard
Llama support
Would it be possible to run llama using this? Is the gpt2 example hackable to run llama on tensorrt?
Would it be possible to run llama using this? Is the gpt2 example hackable to run llama on tensorrt?