Abhishek Dubey

Results 1 issues of Abhishek Dubey

Updating the forward function in Transformer block. The change is simple, but still trying my best to explain below: As per original paper: In 'Add & Norm' block of Transformer,...