lightseq icon indicating copy to clipboard operation
lightseq copied to clipboard

how to use lightseq inference engine

Open lyzKF opened this issue 2 years ago • 7 comments

hello, I have 1 question about how to use lightseq inference engine

I trained an en2de model based on fairseq, which is a variant of the transformer, i.e. I modified the FFN layer. Can I use the lightseq inference engine to speed up the model? If not, what should I do

looking forward for your reply, thank you!

lyzKF avatar Jun 18 '22 01:06 lyzKF

For now, lightseq inference cannot support structure modification. You can describe your network and we will evaluate if it can be supported.

zjersey avatar Jun 21 '22 11:06 zjersey

@zjersey yep, get it. if i modify it by myself, i should modify the model.proto, encoder.cc.cu, decoder.cc.cu and some kernel function files, then i compile source codes. is that. right?

lyzKF avatar Jun 21 '22 12:06 lyzKF

Yes, but there are a lot of details that can be confusing for third-party developers.

lyzKF

hexisyztem avatar Jun 22 '22 02:06 hexisyztem

@zjersey @hexisyztem yep, get it, thank you.👍👍👍

lyzKF avatar Jun 22 '22 03:06 lyzKF

Compile environment you can refer to this: https://github.com/bytedance/lightseq/blob/master/docker/Tritonserver/Dockerfile.

hexisyztem avatar Jun 22 '22 03:06 hexisyztem

If you have any questions, you can ask me and I will try my best to help. In the future, we will consider providing users with fine-grained operators to facilitate self-assembly of models

hexisyztem avatar Jun 22 '22 03:06 hexisyztem

@hexisyztem thank you👍,i will email you, if i was confusing the details about "lightseq" inference engine.

lyzKF avatar Jun 22 '22 04:06 lyzKF