Antoine Bergerault

Results 2 issues of Antoine Bergerault

In the lab 10 there is one function that is the perfect opportunity to use `np.einsum`, a method that turns out to be very useful in practice. It makes it...

Hello, I followed the blog post https://zenn.dev/selllous/articles/retnet_tutorial shared in #52 in order to train RetNet, and it seems to work well for small models (< 3B). But I am unable...