Greg

Results 1 comments of Greg

I have read articles about Flux and noticed that the paper mentions a ​​`TP+SP` approach in Transformer, not pure `TP`. To confirm: During the ​​decoding phase of the inference stage​​,...