megablocks
megablocks copied to clipboard
Can we change self.blocking in dmoe.py from 128 to 64?
I use megablocks to implement a fine-granded moe, the ffn_hidden_size is divisible by 64, but is not divisible by 128, can we change it to 64? Thanks a lot