Henry Mao

Results 57 comments of Henry Mao

Also, a quick note: @jueseph Replicating your notebook, I got MSE error as such: ![Screenshot from 2020-12-16 18-10-47](https://user-images.githubusercontent.com/1828968/102434662-11d44600-3fca-11eb-9fe7-cfdc93e3a6d8.png) https://pytorch.org/docs/stable/generated/torch.nn.MultiheadAttention.html But pytorch's MHA requires tensor to be of [sequence, batch, features],...

@lucidrains Have you tested your FastAttention implementation against JAX implementation (perhaps a unit test e.g. same input tensor), like the one I've written here for my incomplete implementation: https://github.com/calclavia/Performer-Pytorch/blob/main/test.py#L11

@jueseph Tried running your notebook test from a different implementation https://github.com/r0mainK/outperformer/blob/main/src/performer.py ![Screenshot from 2020-12-17 00-47-49](https://user-images.githubusercontent.com/1828968/102464686-85457a00-4001-11eb-901c-34d770acd379.png) ![Screenshot from 2020-12-17 01-09-09](https://user-images.githubusercontent.com/1828968/102466904-7ca27300-4004-11eb-8305-94b919ead5da.png) Similar result. Which implies that this could be correct behavior. The...

Would be great if interceptors could be async

> I was able to get a WASM module compiled from Rust with wasm-pack working both locally and on Vercel, except in a non-edge API route on Vercel. > >...

Seems related to this issue: https://stackoverflow.com/questions/71254427/firebase-functions-dotenv-variable-undefined