ipex-llm
ipex-llm copied to clipboard
starcoder2 optimization results
With the starcoder2-3B the 2nd+ token latency is not well-performed, do you have any ideas about it? Thanks!
Hi @RobinJing,
We recently applied further optimization to starcoder2-3b. Please have a try again with ipex-llm >= 2.1.0b20240405 and let us know for any further problems :)