Amit Moryossef comments

Results 279 comments of


                                            Amit Moryossef

NaViT is slower than expected

The problem is with small images. Here's a `vit-pytorch` only implementation: ```py import time import statistics import torch from vit_pytorch.vit import ViT from vit_pytorch.na_vit import NaViT as NaViT_orig from vit_pytorch.na_vit_nested_tensor...

NaViT is slower than expected

Now that #353 is merged. Here's a better benchmark script with more images (512 variable-width images (16px tall, 32-80px wide)) ```py import time import statistics import random import torch from...

NaViT is slower than expected

With #354 merged ``` pip install "vit-pytorch==1.16.3" ``` | Model | Time per batch | vs ViT | |-----------------|----------------|----------------| | ViT (padded) | 6.1±0.1ms | 1x (baseline) | | NaViT...

Add support (and automated github workflow pytests) for newer python versions

I agree, we should at the very least update support to 3.12

Out of memory when running convert_dir_to_note_sequences

@AI-Guru I have the same problem. Did you manage to convert lakh dataset to tfrecord? If so, could you please share how?

No support for 4D attention? `RuntimeError: cu_seqlens_q must have shape (batch_size + 1)`

Thanks @mhoangvslev , however, my attention mask is `[batch_size, 1, max_seqlen, max_seqlen]`

No support for 4D attention? `RuntimeError: cu_seqlens_q must have shape (batch_size + 1)`

I need to use 4D for

No support for 4D attention? `RuntimeError: cu_seqlens_q must have shape (batch_size + 1)`

Hi @tridao , would really love support for this 🙏🏻

No support for 4D attention? `RuntimeError: cu_seqlens_q must have shape (batch_size + 1)`

Hi @tridao - any progress on this? sorry, I am not technical enough to understand all the low level stuff here...

No support for 4D attention? `RuntimeError: cu_seqlens_q must have shape (batch_size + 1)`

I had Claude run a benchmark. `flex_attention` works with 4D masks. Everything ran on an NVIDIA DGX Spark ---- ● Results: Batch=128, Seq=128, Realistic Masks | Implementation | pythia-14m 2D...