Gabriele Macchi

Results 2 comments of Gabriele Macchi

In ELECTRA embedding size and hidden size should be the same. For what I know only ALBERT has a matrix factorization of the embedding matrix, thus having a smaller embedding...

run `!pip install tensorflow_text` in a code cell