mesh
mesh copied to clipboard
Mesh TensorFlow: Model Parallelism Made Easier
Hi, I've been trying to deploy an mtf model to the NVIDIA Triton Inference Server by converting the SavedModel (output of model.export()) to an onnx file with no luck. I've...
Hi , Thanks for your great work. I am reading your implementation of switch transformers(MoE), I am wondering if you have released the code for finetuning the MoE models on...
Automated refactoring to make code Python 3 compatible.
Decode Unicode strings in inference mode. This is already done in eval mode. This CL applies the same logic for inference.
Use multiple target objectives for distillation. Also see cl/356382304
Add loss functions for multiple-target objectives for distillation.