FasterTransformer icon indicating copy to clipboard operation
FasterTransformer copied to clipboard

gptneox & gptj int8 quantization & share context

Open rahuan opened this issue 1 year ago • 0 comments

support gptneox int8 & share context gptj int8 - will throw exception when running in int8 mode, will need to be fixed

rahuan avatar Jun 07 '23 08:06 rahuan