FasterTransformer icon indicating copy to clipboard operation
FasterTransformer copied to clipboard

Support for mbart models?/ Could we get the output logits of decoders before the beam search?

Open leoozy opened this issue 2 years ago • 5 comments

The mbart model is implemented by hugging face

leoozy avatar Jul 28 '21 04:07 leoozy

It requires to modify the source codes.

byshiue avatar Jul 28 '21 04:07 byshiue

@byshiue Thank you for your reply. If I just use the python interface, could I get the logits output of the decoder?

leoozy avatar Jul 28 '21 04:07 leoozy

If you use decoder op, then the output is the results of transformer block. If you use decoding op, then you need to modify the op and FT source codes.

byshiue avatar Jul 28 '21 04:07 byshiue

@byshiue Thank you. I will try to implement the mbart model

leoozy avatar Jul 28 '21 04:07 leoozy

@leoozy hi,I also want to use mbart model is implemented by hugging face, can you implement it for mbart model? and if yes, can you share it with me? Thank you very much.

xiuzhilu avatar Oct 22 '21 09:10 xiuzhilu