onnxsim_large_model
onnxsim_large_model copied to clipboard
Model size are not reduced after simplification
I tried to simplify TinyLlama with the code, but the simplified onnx file is almost with the same size with non-simplified one. It is appreciated if you can provide onnx sizes of the original Llama onnx model and the one after simplification.