TensorRT
TensorRT copied to clipboard
Llama distributed example
The PR addresses
- Llama3 end to end example with complex graph lowering
- Removal of hardcoded components of rotary embedding example