llama2.c
llama2.c copied to clipboard
fix allocation of scaling factors
allocate only one scaling factor per group
huh. i'm only doing a quick skim atm. did i mess up the sizing of this oops
you allocate dim elements for the quants and also dim elements for the scaling factors. but there are only dim / gs scaling factors :)