Aleksa Gordić
Aleksa Gordić
Hey @bitmarkcc! Did you follow the README? You should first run the Python code, it'll generate all the necessary bin/state files before you run C/CUDA code. If something is not...
yeah, we're focused on using the exact setup of gpt-2/3 right now, the idea is not to make this as configurable as possible, many things are hardcoded which goes against...
I won't be able to help you with Modal, but I'll just say that our goal is ultimately to have custom CUDA kernels that outperform cuDNN. So you should, ideally,...