llama3
llama3 copied to clipboard

Published 20 hours ago •

Reame
Issues

Scaling configurations (Table 4) in the paper "The Llama 3 Herd of Models"

Open fyang064 opened this issue 6 months ago • 0 comments

In the Table 4 of the paper, GPU total number 16384 is not matching with the parallelism group [8, 16, 16, 4]. Is this a mistake in the paper?

Aug 13 '24 17:08 fyang064