sigs
sigs copied to clipboard
How to support different system configuration or backend framework versions between the conversion and execution servers
The system configuration or backend framework versions between the conversion and execution servers can be different in practice. One might use a CPU machine to convert the model and a GPU machine to run the converted model. Do we want to suggest certain general mechanism to support this scenario?