LuGY

Results 28 comments of LuGY

> ### 🐛 Describe the bug > I met overflow using the official scripts for GPT2. Is that a normal case? > > ``` > cd XXX/ColossalAI/examples/language/gpt > export DATA=/data/scratch/gpt_data/small-gpt-dataset.json...

Hmm, It seems that keyword args in `_forward` of CheckpointModule is not supported.

Oh, I found that this model was defined with `torch.nn`. :x

Hi, have you installed FastFold before using?

Hi, sorry for being late. I am not sure about the environment you run the code and the one you install FastFold is the same one. The error message indicates...

Hi, training of FastFold with DAP is still an experimental feature. Drop `dap_size` and DDP is in use by default. 40GB Memory should be enough.

Hi, apologize for my misunderstanding. Multimer training is not supported yet.

The thing is that neither the multimer model training nor bf16 precision is proven to work as expected. Usually, not the process raises bugs, but some results are not right,...

Thanks for your contribution. Wish you have a nice trip.