Jersey comments

Results 12 comments of


                                            Jersey

运行example中的ls_bert.py，报lsi.Bert报错

The inference module only works on Linux.

报错 CUBLAS_STATUS_NOT_SUPPORTED，不知道该怎么解决

What's your CUDA driver version? Maybe the driver doesn't support these two cards. Or you can try a different CUDA version.

I have trained vit model for classification.how to speed it by lightseq?tks

If the model is trained by huggingface, you can refer to [inference example](https://github.com/bytedance/lightseq/blob/master/examples/inference/python/README.md) for inference speed up.

CUDA ERROR

You may try to downgrade the driver version or change a different CUDA version.

It's on the plan. For now, you can use Quantization Training and Inference by [building from source code](https://github.com/bytedance/lightseq/blob/master/docs/inference/build.md) and running the quant examples.

how to use lightseq inference engine

For now, lightseq inference cannot support structure modification. You can describe your network and we will evaluate if it can be supported.

推理加速效果不错，cpu 300%下降到40%，速度提升 3倍

Thanks for your affirmation.

About Vit encoder output consistency during inference?

We have tested the consistency of VIT, you can refer to the [infer example](https://github.com/bytedance/lightseq/blob/master/examples/inference/python/test/ls_vit.py) to check your usage is correct.

About Vit encoder output consistency during inference?

Yes, it's HuggingFace's modeling_vit. vit and bert have the same structure except the embedding layer.

Is it compatible with swin-t

Swin Transformer is not supported for now