curated-transformers
curated-transformers copied to clipboard
Convert QKV projection splitting methods into Torch modules