content-style-disentangled-ST icon indicating copy to clipboard operation
content-style-disentangled-ST copied to clipboard

Style not learned

Open ZilinIB opened this issue 5 years ago • 11 comments

Hello,

I've been trying to train the model with the default parameters that are provided in the script. But the results in the whole run are not quite as expected(up until 15k steps), the styles are not learned, as is showed below.

Do you have any suggestions or ideas about the potential cause of that? step_156600

ZilinIB avatar May 06 '20 21:05 ZilinIB

Hi,

Thanks for the interest! 15k steps is indeed to few. As stated in launch_multistep_training.sh:

...
declare -a IMAGE_SIZES=(128 256 384 512 640 768)
declare -a TOTAL_STEPS=(50000 75000 100000 125000 150000 1000000)
...

for ((i=0;i<${#IMAGE_SIZES[@]};++i))
do
    CUDA_VISIBLE_DEVICES=$DEV_NMBR python main.py --model_name=${MODEL_NAME} \
        --total_steps=${TOTAL_STEPS[i]} \
        ...

done

you should train several rounds for 50000, 75000 ,100000 ,125000, 150000 ,1000000 iterations. The last round could be finished earlier though.

Best, Artsiom

asanakoy avatar May 07 '20 00:05 asanakoy

Hi Artsiom!

Sorry, I've made a typo, the result I've got is from 150k steps instead of 15k step. I used exactly the script you attached and the last iteration(1m steps) has already started. Is that still normal though?

Best, Zilin

ZilinIB avatar May 07 '20 08:05 ZilinIB

@ZilinIB I also met this problem, seems not learn the style at all with enough steps. Have you solved this?

CrossLee1 avatar May 14 '20 11:05 CrossLee1

@ZilinIB any results after the last round (with 1M iteration) ended?

asanakoy avatar May 14 '20 11:05 asanakoy

@CrossLee1 No I'm still stuck there :(

@asanakoy I didn't continue to train the model after the round with 150k iteration since the losses are not getting optimized for a long time already.

ZilinIB avatar May 14 '20 12:05 ZilinIB

@asanakoy This is the result of step_150600. Is it in line with expectations?

image

CrossLee1 avatar May 15 '20 05:05 CrossLee1

@CrossLee1 you mean 150rk step from the very beginning? @dimakot55 could you pelase take a look?

asanakoy avatar May 15 '20 09:05 asanakoy

@CrossLee1 you mean 150rk step from the very beginning?

Yes, 150k step from the begining.

CrossLee1 avatar May 15 '20 09:05 CrossLee1

I had a similar turnout after 500,000 steps, showing very little to no improvement from the 150000 step example.

johnsutor avatar May 15 '20 19:05 johnsutor

Guys, try changing hyperparameters a bit if it doesn't work. @dimakot55 is busy with NeurIPS submission right now and will be able to answer you after the deadline.

asanakoy avatar May 19 '20 17:05 asanakoy

Hello, I also met this problem, seems not learn the style at all. Have you solved this?

int-fanle avatar Apr 24 '21 10:04 int-fanle