xFasterTransformer
xFasterTransformer copied to clipboard
Add new engine for MoE using Expert&Tokens parallelism