multilingual-modeling icon indicating copy to clipboard operation
multilingual-modeling copied to clipboard

Control Extra Params (use Adapter 16x reduction size as control)

Open yongzx opened this issue 3 years ago • 0 comments

The following info is for Bloom-1.3B and embedding-and-MADX-adapters (with replace strategy) with the default bottleneck reduction size of 16.

Total frozen parameters: 1208602624
Total trainable parameters: 24979456
Total emb parameters: 20488192
Total MAD-X adapter parameters: 4,491,264

yongzx avatar Jun 26 '22 13:06 yongzx