FasterTransformer
FasterTransformer copied to clipboard

Published 20 hours ago •

Reame
Issues

gptneox & gptj int8 quantization & share context

Open rahuan opened this issue 1 year ago • 0 comments

support gptneox int8 & share context gptj int8 - will throw exception when running in int8 mode, will need to be fixed

Jun 07 '23 08:06 rahuan