Wentao Jiang comments

Results 21 comments of


                                            Wentao Jiang

CUDA out of memory

> Replace all `64` with `32` of `Generator.__init__` ( in net.py ), the max gpu memory usage is around 7500MiB, it goes well with my GTX 1070 That's a good...

Error occurred when i run demo.py

you can refer to #37

Pre-trained models

> BTW, how to choose the best G model from snapshots? Through qualitative evaluation.

python gradio/app.py运行报错，提示只能用A100 GPU

Change the dtype in inference/sample.py to fp16 could work `dtype = "fp16" # use fp16 instead of bf16`

After finetune the model, inference still get noise.

> @wtjiang98 genius Still don't know how to solve the problem. Should we modify the inference code?

After finetune the model, inference still get noise.

> > For fine-tuning, we offer the following suggestions: > > > > 1. Reduce the learning rate. We recommend a learning rate of 1e-5 to 1e-6 for fine-tuning. >...

After finetune the model, inference still get noise.

> For fine-tuning, we offer the following suggestions: > > 1. Reduce the learning rate. We recommend a learning rate of 1e-5 to 1e-6 for fine-tuning. > 2. If there...

After finetune the model, inference still get noise.

> > > 对于微调，我们提出以下建议： > > > > > > 1. 降低学习率。我们建议使用 1e-5 到 1e-6 的学习率进行微调。 > > > 2. 如果添加了其他模块，请加载预先训练的权重并使用零初始化进行推理。这将验证初始化或代码是否正确。 > > > 3. 时刻关注loss曲线，如果出现loss的尖峰，那么很有可能模型崩溃了，从最近的checkpoint恢复训练。如果训练过程频繁崩溃，那么可以考虑增加batch size或者继续降低学习率。 > > >...

After finetune the model, inference still get noise.

> > > > For training from scratch, this is normal. The point of confusion for me is that if you guys are zero-init from pre-training weights, then the results...

After finetune the model, inference still get noise.

> > > > > > For training from scratch, this is normal. The point of confusion for me is that if you guys are zero-init from pre-training weights, then...