Henry Mao comments

Results 57 comments of


Henry Mao

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

Also, a quick note: @jueseph Replicating your notebook, I got MSE error as such: ![Screenshot from 2020-12-16 18-10-47](https://user-images.githubusercontent.com/1828968/102434662-11d44600-3fca-11eb-9fe7-cfdc93e3a6d8.png) https://pytorch.org/docs/stable/generated/torch.nn.MultiheadAttention.html But pytorch's MHA requires tensor to be of [sequence, batch, features],...

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

@lucidrains Have you tested your FastAttention implementation against JAX implementation (perhaps a unit test e.g. same input tensor), like the one I've written here for my incomplete implementation: https://github.com/calclavia/Performer-Pytorch/blob/main/test.py#L11

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

@jueseph Tried running your notebook test from a different implementation https://github.com/r0mainK/outperformer/blob/main/src/performer.py ![Screenshot from 2020-12-17 00-47-49](https://user-images.githubusercontent.com/1828968/102464686-85457a00-4001-11eb-901c-34d770acd379.png) ![Screenshot from 2020-12-17 01-09-09](https://user-images.githubusercontent.com/1828968/102466904-7ca27300-4004-11eb-8305-94b919ead5da.png) Similar result. Which implies that this could be correct behavior. The...

Henry Mao

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

SelfAttention layer seems to have large error relative to nn.MultiheadAttention?

Combine resizing with other options like webp

StreamInterceptor type doesn't support Promise intercept

Rust WebAssembly module in an ES module wrapper from wasm-pack fails to load in Next.js

Firebase Emulator simply does not initialize .env || .env.local || .env.default