2K2K icon indicating copy to clipboard operation
2K2K copied to clipboard

Training models at 512 or 1024 resolutions

Open xuling00 opened this issue 1 year ago • 2 comments

Hello Thank you for your research! I have some questions about training:

  1. I see your code is mainly for res2048 training, then for res512 and res1024 picture , do you need additional training? You mentioned in your article that training res2048 takes 3 days. Will Training res512 take less time?
  2. If I want to train res512, which part of the training code should I change? To the code of the human body parts cut parts, do I need to change because of the res512?

Image

xuling00 avatar Mar 21 '25 08:03 xuling00

  1. Yes. In the case of res512, the training time is greatly reduced. If I remember correctly, I remember that I got good results by using about 1 day for training in stage 2.
  2. You can reduce the resolution of the cut part of the picture by 1/4; You don't have to reduce it by exactly 1/4. But since there is a part that down-samples the network, you can set an appropriate value by adjusting it to a multiple of 16 or 32.

SangHunHan92 avatar Mar 21 '25 08:03 SangHunHan92

  1. Yes. In the case of res512, the training time is greatly reduced. If I remember correctly, I remember that I got good results by using about 1 day for training in stage 2.
  2. You can reduce the resolution of the cut part of the picture by 1/4; You don't have to reduce it by exactly 1/4. But since there is a part that down-samples the network, you can set an appropriate value by adjusting it to a multiple of 16 or 32.

Thank you very much for your answer. I will try it!

xuling00 avatar Mar 24 '25 05:03 xuling00