duke
Results
2
comments of
duke
this picture is for training. in inference, MTP draft worker takes same input and previous hidden state, then outputs draft token
> > this picture is for training. in inference, MTP draft worker takes same input and previous hidden state, then outputs draft token > > Thanks for the reply, that...