duke

Results 2 comments of duke

this picture is for training. in inference, MTP draft worker takes same input and previous hidden state, then outputs draft token

> > this picture is for training. in inference, MTP draft worker takes same input and previous hidden state, then outputs draft token > > Thanks for the reply, that...