Little-GPT
Little-GPT copied to clipboard
GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!
GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!