gpt-fast
gpt-fast copied to clipboard

Published 20 hours ago •

Reame
Issues

flex_attention ver.

Open joydddd opened this issue 1 year ago • 2 comments

Implement gpt-fast using flex_attention HOP.

replies on this PR: https://github.com/pytorch/pytorch/pull/132157

Jul 30 '24 22:07 joydddd