Gabriele Macchi
Results
2
comments of
Gabriele Macchi
In ELECTRA embedding size and hidden size should be the same. For what I know only ALBERT has a matrix factorization of the embedding matrix, thus having a smaller embedding...
run `!pip install tensorflow_text` in a code cell