Mitchell Wortsman

https://mitchellnw.github.io/ [email protected]

University of Washington

Results 15 issues of


                                            Mitchell Wortsman

Update table 1 in README to contain SigLIP, DFN

8

comment

[WIP] Testing the lion optimizer

39

comment

Used the implementation from https://github.com/lucidrains/lion-pytorch (thanks @lucidrains) Paper https://arxiv.org/abs/2302.06675

OpenFlamingo.int8()

int8 inference

fp8 training on h100s

`memory_efficient_attention` runs in f32 with `autocast`

4

comment

Hello, I am using `torch.cuda.amp.autocast` with `bfloat16`. I noticed that the xformers `RotaryEmbedding` produces `float32` outputs, which then requires casting before passing to `memory_efficient_attention`. However, this raises the question --...

bug

‹
1
2