Samip Dahal

Results 1 issues of Samip Dahal

Hi, I was trying out the compression library for ZeroQuant quantization (for GPT-J model). While I was able to compress the model, I didn't see any throughput/latency gain from the...