fast-danksharding
fast-danksharding copied to clipboard
Optimize NTT calls
Currently, the main flow uses ntt_end2end which generates and copies the twiddle factors to the gpu on every run.
This can be optimized by: