lorax
lorax copied to clipboard
Fp6 quant from deepspeed
trafficstars
Feature request
https://github.com/huggingface/text-generation-inference/issues/1633
Motivation
Throughout and latency
Your contribution
@tgaddair what do you think?
Hey @flozi00, looks promising! Do you have bandwidth to open a PR for this?
Yes, probably middle or end of the week