FasterTransformer TP=2， Loss of accuracy

TP=2， Loss of accuracy

Open coderchem opened this issue 2 years ago • 2 comments

hello,I carried out TP=2,multi-gpu operation on llama model of 7b, and found that the comparison accuracy of the result was lost by 5%. TP=2 as far as I know should not change the accuracy. Why?

Jul 28 '23 03:07 coderchem

hi, can you post reproduce step.

Jul 29 '23 10:07 hurun

FasterTransformer does not support llama officially and FasterTransformer development has transitioned to TensorRT-LLM. TensorRT-LLM has supported LLaMa, please take a try.

Oct 20 '23 07:10 byshiue

FasterTransformer FasterTransformer copied to clipboard

TP=2， Loss of accuracy

FasterTransformer
FasterTransformer copied to clipboard