Ali Ladjevardi comments

Results 5 comments of


                                            Ali Ladjevardi

[BOUNTY - $500] Add support for quantized models with tinygrad

Hi, I started working on this, my WIP PR is [#630 ](https://github.com/exo-explore/exo/pull/630). As @varshith15 pointed out, naively dequantizing weights before each forward is not performant, however this is not an...

FSDP in (New!) Tinygrad [WIP]

I've found the Issue with sharding both model and data. it's in Embedding layer, it boils down to: ```python from tinygrad import Tensor, nn B, T = 4, 64 vocab_size...

FSDP in (New!) Tinygrad [WIP]

@chenyuxyz yeah I was waiting for cast_before_view, now it's merged and I'm continuing work, will update soon. Update 1: - Embedding still has OOM issue, copy is still stuck after...

FSDP in (New!) Tinygrad [WIP]

COPY after expand in Multi-GPU only happens in 1 case: ALU on two MULTI with different axis, where the source that ends up being copied is a result of an...

Tinygrad Quantization Support [WIP]

Also, I'm doing math in float32 right now, which adds overhead. when I change it to float16, I think something overflows and model outputs nothing. I will fix this.