Wenbing Li comments

Results 43 comments of


                                            Wenbing Li

Add ONNXRuntime extensions support

> For my own case, I would love to be able to have my data preprocessing directly available in Triton, such as tokenization. I can do it locally if I...

Why exported gpt2.onnx has 1 inputs?

check there for for a complete GPT-2 model inference example https://github.com/microsoft/onnxruntime-extensions/blob/main/tutorials/gpt2bs.py

problem with the speed of the Onnx model

this is too general, and looks it is not relevant to this repo.

Certain models may be missing a reshape op

Thanks for the reporting, @turneram The model earlier than opset 7 were lack of maintenance, and I suppose these would be removed in the future.

Deprecated old models may need to be updated

@YuhengHuang42 , Thanks for reporting the issue. These model most are legal while ONNXRuntime doesn't support some old opset for sake of its binary size. So actually it's ONNXRuntime issue....

How could I convert output tensor to text generation?

checked the PR for a end-to-end GTP-2 model: https://github.com/onnx/models/pull/445

Window10 onnx->tensorrt classify and unet

what's that? an onnx to tensorrt conversion tool?

StyleGAN2 onnx model

this looks tensorflow model, can you try this tool https://github.com/onnx/tensorflow-onnx to convert it to ONNX model?

Alexnet preprocess

I didn't see any pre/post process steps for Alexnet model, https://github.com/onnx/models/tree/master/vision/classification/alexnet . Can you share more information about the pre/post process?

Text generation with GPT2, ORT .NET & BlingFire

check this model for an example to see how you can decode the output. https://github.com/onnx/models/tree/master/text/machine_comprehension/gpt2-bs