ABQ-LLM
ABQ-LLM copied to clipboard
How to reproduce the results of end-to-end throughput experiment?
Thanks for your great work!
I want to know how to reproduce the results of end-to-end throughput experiments? That is e2e_speed. png. Can you provide the complete code integrated into FastTransformer?
Thanks!