Li Hui
Results
42
comments of
Li Hui
> Does flashinfer has better performance? Because the radix cache is disabled in the current version, performance will be reduced for general input and output. Flashinfer is effective for long...
> [@lambert0312](https://github.com/lambert0312) [@ProphetPeng](https://github.com/ProphetPeng) Please try pulling the latest main branch, now `--enable-flashinfer-mla` and radix cache can be used together. @Fridge003 I've verified it, no problem.