laekov comments

Results 38 comments of


                                            laekov

我们有线上沟通的群吗

如果您希望使用 im 软件进行沟通, 您可以加入我们的 slack workspace 来和大家沟通. README 上有邀请链接.

Example to run Megatron

To run FastMoE with Megatron, you are supposed to use Megatron's main function, e.g. `pretrain_gpt.py`, with FastMoE's patch applied.

Example to run Megatron

You should use the patch that matches your Megatron version. The key operation to enable moe is adding `--fmoefy` argument to the `pretrain_xxx.py`

multi-node problem

这个问题的原因看起来跑到 `naive_gate.py:33` 这里的时候 `k` 变成 5 了, 比较奇怪. 您可以在 python 里找一下这个 k 是在哪里变成 5 的吗? 谢谢.

开启Smart schedule时报错Segmentation fault

> It's not clear to me what input and output that need to be constrained means > The input and output features have to be of the same length for...

开启Smart schedule时报错Segmentation fault

> if the result that local_expert_count gets on each card (worldsize) is the same or different `local_expert_count` differs on each GPU, because it includes the counters of samples in the...

Mixture of Expert in Vison Task (Segmentation )

For using customized expert module, see #121 as a reference. For customized gates, you can refer to our gate implementation, e.g. [NaiveGate](https://github.com/laekov/fastmoe/blob/master/fmoe/gates/naive_gate.py). You can then feed the class into `FMoE`...