gpt-fast
                                
                                 gpt-fast copied to clipboard
                                
                                    gpt-fast copied to clipboard
                            
                            
                            
                        flex_attention ver.
Implement gpt-fast using flex_attention HOP.
replies on this PR: https://github.com/pytorch/pytorch/pull/132157