byt5
byt5 copied to clipboard
How to convert ByT5 model to ONNX format?
Hi,
ONNX allows to compress transformers models and speed up the inference time on CPU and GPU.
Who could share code / notebook to convert mT5 and ByT5 models to ONNX format?
There is the library fastT5 for T5 conversion (great!) but it has not been updated to the latest version of transformers and therefore, it does not accept mT5 and ByT5 models until today.
Thanks.