ctq

Results 2 comments of ctq

I also found it challenging to integrate the loglikelihood and generate functions into lm-evaluation-harness. If the author could open-source the relevant parts of lm-evaluation-harness, it would be greatly helpful to...

@Qubitium I also encountered the same problem. I believe that after packing a layer, deleting the corresponding FP32 fake quantized weights and releasing CPU memory could help when packing large...