runbooks icon indicating copy to clipboard operation
runbooks copied to clipboard

ONNX for Model saving

Open nstogner opened this issue 2 years ago • 3 comments

ONNX provides an open standard for saving models (portable b/t tensorflow, pytorch, etc.).

Should this come into play in how Models are saved?

https://onnx.ai/

nstogner avatar Jul 04 '23 11:07 nstogner

Perhaps we could provide a way of importing Models from various hubs and all models would be saved using ONNX. This would then allow for decoupling where the Model was pulled from and what kind of libraries the developer might use to work with those models.

https://huggingface.co/docs/transformers/serialization

This would have implications on how Models are saved. Perhaps it makes sense to save the models outside of a container image (i.e. not bundled with code).

nstogner avatar Jul 04 '23 11:07 nstogner

I think this makes a lot of sense especially once we have our own serving layer

samos123 avatar Jul 04 '23 22:07 samos123

+1 - I didn't realize this emerging standard existed. Reading up a bit, it sounds like it makes for a better interop story when deploying.

brandonjbjelland avatar Jul 05 '23 04:07 brandonjbjelland