Samip Dahal
Results
1
issues of
Samip Dahal
Hi, I was trying out the compression library for ZeroQuant quantization (for GPT-J model). While I was able to compress the model, I didn't see any throughput/latency gain from the...