minGPT
minGPT copied to clipboard
#71 use config n_head instead of hardcoded 4 heads
use config n_head instead of hardcoded 4 heads in model attention block