dilated-attention-pytorch issues

Results 4 dilated-attention-pytorch issues

Sort by recently updated

ZeroDivisionError: integer division or modulo by zero

i got this During the Benchmark ```python # assert len(unknown_axes) == 1, 'this is enforced when recipe is created, so commented out' --> 186 if isinstance(length, int) and isinstance(known_product, int)...

younesselbrag

Q: Attention Calculation

Hi @fkodom, I really like your implementation and I wanted to use dilated attention into a vanilla transformer model to try how things work. Right now, I am facing a...

mohamedelbahnasawi

Backward pass

Hi! First of all, thanks for your great implementation. I think it is very awesome, I like it a lot. I was wondering if you have also implemented a backward...

Coluding

Training on yet-another-retnet script

Hello Frank! I love what you have created, and am having a great time going through and parsing through your implementation of the paper. It appears you have nailed the...

Akbarable

dilated-attention-pytorch
dilated-attention-pytorch copied to clipboard

Metadata

ZeroDivisionError: integer division or modulo by zero

Q: Attention Calculation

Backward pass

Training on yet-another-retnet script

← Metadata

Owner

Metadata

dilated-attention-pytorch dilated-attention-pytorch copied to clipboard

Metadata

ZeroDivisionError: integer division or modulo by zero

Q: Attention Calculation

Backward pass

Training on yet-another-retnet script

← Metadata

Owner

Metadata

dilated-attention-pytorch
dilated-attention-pytorch copied to clipboard