DCC
DCC copied to clipboard
epsilon / np.sqrt(embedding_dim)
I guess this formula may be wrong. Should we change that to this: np.sqrt(epsilon / embedding_dim)?
Why do you say so ?
I guess this formula may be wrong. Should we change that to this: np.sqrt(epsilon / embedding_dim)?
Why do you say so ?