efficient-kan
efficient-kan copied to clipboard
Can MLP replace transformer?
Hello author, I would like to know if the efficient implementation of MLP can replace the MLP module in transformer. What are the disadvantages and advantages?
https://github.com/akaashdash/kansformers