YassineYousfi issues

Repositories
Issues
Comments

Results 8 issues of


YassineYousfi

long_mpc: fix e2e source condition

bugfix

controls

use the ``enabled`` arg in GradScaler

GradScaler has an argument for enabling/disabling the scaler. When disabled, ``scaler.step()`` simply invokes ``optimizer.step()``, and the other methods are no-ops. I thought this made the code a bit cleaner by...

non flash attention: speedup by avoiding ddp broadcasts of causal mask

In the manual implementation of causal self-attention, the causal mask is registered as a buffer, which causes DDP to broadcast it at every step. Excluding it from being broadcasted gives...

output height from the model and track it in calibrationd

enhancement

research

When can we expect the code release?

great work @agrimgupta92! When can we expect the code release? Thanks!

fix input_pos shape in comment

Currently the code only supports bs=1 with input_pos being one dimensional. This fixes input_pos shape in the comments.

CLA Signed

YassineYousfi