hyperformer
hyperformer copied to clipboard

Published 20 hours ago •

Reame
Issues

What is the strategy for initializing the task_embedding, layer_id_embeddings, and adapters_block_type embeddings?

Open jianghaojun opened this issue 2 years ago • 0 comments

@rabeehk It seems all these embeddings are initialized from a pytorch default gassian normal distribution with N(0, 1).

Sep 26 '22 15:09 jianghaojun