darts I can't reproduce the architecture search process following the configuration in paper.

I can't reproduce the architecture search process following the configuration in paper.

Open ghost opened this issue 4 years ago • 13 comments

Hi Hanxiao @quark0 ,

Recently, I try to reproduce your work. I use the code https://github.com/quark0/darts/ and reproduce the test process. It's great! But I cannot reproduce the architecture search process to obtain the same or similar as the DART_V2 architecture. I run "train_search.py" following your paper. There was an overfitting during the training and the final valid loss was 6.56.

Is this result normal? If not, can you give me some advice?

Aug 16 '19 01:08 ghost

Or can others reproduce the architecture search? We discuss it together.

Aug 16 '19 01:08 ghost

I cannot reproduce the architecture search process neither. I did not change any code, but I obtained a different architecture claimed in the paper. It is strange.

Aug 22 '19 14:08 brycexu

yes, the structure is different. The paper said we need to run many times.

Aug 23 '19 12:08 ray-lee-94

@VCBE123 So have you reproduced the results when you run many times?

Aug 27 '19 07:08 ShihuaHuang95

We need to change seed and run many times, but until now I can not get a similar result.

Aug 29 '19 01:08 ghost

Have any other reproduce the result successfully? I got the best valid acc 88% in seaching process with the random seed setted as 2(without cutout data precessing).

Jan 02 '20 08:01 pingguokiller

Have any other reproduce the result successfully? I got the best valid acc 88% in seaching process with the random seed setted as 2(without cutout data precessing).

May I ask you which version of Pytorch are you using? Thanks

Jan 10 '20 03:01 HaFred

Have any other reproduce the result successfully? I got the best valid acc 88% in seaching process with the random seed setted as 2(without cutout data precessing).

May I ask you which version of Pytorch are you using? Thanks

I use torch 1.2.

Jan 10 '20 04:01 pingguokiller

Have any other reproduce the result successfully? I got the best valid acc 88% in seaching process with the random seed setted as 2(without cutout data precessing).

Hi @pingguokiller , I am also trying to reproduce the results. Do you change the seed to search architectures? It's strange that once I changed the seed the searching time greatly increase.

Feb 21 '20 09:02 Debrove

Have any other reproduce the result successfully? I got the best valid acc 88% in seaching process with the random seed setted as 2(without cutout data precessing).

Hi @pingguokiller , I am also trying to reproduce the results. Do you change the seed to search architectures? It's strange that once I changed the seed the searching time greatly increase.

I have reproduced the results successfully(First order). I overlooked some important hyper-parameters. Other researchers told the results were very random with the change of seed.

Feb 21 '20 09:02 pingguokiller

@pingguokiller Do you reproduce the results(First order) with the process of searching four times, and then choosing the best valid accuracy of the arch. Finally training it three times and get the average accuracy?

Moreover, would you mind pointing out what important hyperparameters is you mentioned above.

Thank you very much.

Feb 21 '20 09:02 Debrove

@pingguokiller , same question here, could you help us getting the right shot?

Oct 06 '20 01:10 d12306

@pingguokiller , same question here, could you help us getting the right shot?

The search progress is very random. Maybe it's difficult to get the same architecture as presented in paper. You can reproduce the valid process with the code provided by the author on cifar10. However, I still can't reproduce the valid process on imagenet until now.

Oct 06 '20 03:10 pingguokiller

darts darts copied to clipboard

I can't reproduce the architecture search process following the configuration in paper.

darts
darts copied to clipboard