quantLm14
Results
2
comments of
quantLm14
@cuichenx Any update on long seq optimisations? Or for this branch to be merged in main?
> Yes. Git as big of a single gpu as you possibly can. Ram you need 2TB @Qubitium I am trying to quantize on h200. With single gpu ram of...