ao
ao copied to clipboard
[WIP] Apply SuperBlock to GPT-Fast
Still work in progress. To run:
cd torchao/_models/llama
python generate.py --checkpoint_path ${CHECKPOINT_PATH}/model.pth --superblock