Jiarui Fang（方佳瑞） comments

Results 220 comments of


                                            Jiarui Fang（方佳瑞）

RuntimeError: code is not compiled with CUDA.

README写了，编译时候应该 cmake .. -DWITH_GPU=ON，或者你下载成了cpu-only的镜像了

ONNXRT can not be applied in Albert

I want to compare Turbo with ONNXRT on ALBERT. However, I found some ops in PyTorch does not support ONNX.

ONNXRT can not be applied in Albert

I have noticed this issue and left a comment on the PyTorch issue.

Benchmark testing for accuracy

Accuracy is checked in unitests. Therefore benchmark is only for performance evaluation. Scripts in ./python/example compared the accuracy results

运行python bert_example.py报错

你在镜像里编译一下最新代码吧，你用的example代码是最新的，镜像里的代码不一定是。你现在用turbo做backend，却用onnxrt的方式调用 https://github.com/Tencent/TurboTransformers/blob/f43f35b792/example/python/bert_example.py#L65

运行python bert_example.py报错

你要不直接用我提供的镜像试试？ FROM thufeifeibear/turbo_transformers_cpu:latest 在这个镜像里编译你如果要自己从头开始编译，自己在container里看一下conda install哪个具体的包出问题，然后更改一下。

Error when loading roberta with transformers 3.4.0

May be AutoModel is a new interface. Is `RobertaForSequenceClassification ` ok for you?

Error when loading roberta with transformers 3.4.0

The Turbo is not ready for transformers 3.4.0. You can maintain a local turbo version for youself.

Any plan on supporting Vision Transformer?

Hey, We noticed that the Transformers are becoming popular in CV. We currently have no plan to update new models. However, we believe it is easy to be adopted in...

Developing CPU INT8 quantization

Motivation: we use FBGEMM in order to have consistent accuracy as PyTorch dynamic quantization. As TurboTransformer's optimizations are focused on Non-GEMM operations, we can reuse PyTorch QLinear code as much...