Bharat Prabhakar
Bharat Prabhakar
Yup, facing the same issue. I'm trying to train on a different speech dataset though and the attention plot is blank even after 200k steps. Any help would be appreciated.
I think the issue might be with the Tensorflow version. @Jinex2012 can you tell me what TF version are you using? It's failing for me on v1.8.
So it was weird in my case. I was training using Tensorflow 1.8 (GPU) when I faced this issue. Basically when I tried printing the attention values that are being...
Yep, getting the same error ``` NotImplementedError: No operator found for this attention: AttentionOpDispatch(dtype=torch.float32, device=device(type='cuda', index=0), k=80, has_dropout=False, attn_bias_type=, kv_len=1024, q_len=1024, kv=80, batch_size=8, num_heads=1, requires_grad=True) ``` when trying to run...
@patil-suraj Can you you share which commit did you build xformers from?