mentornet icon indicating copy to clipboard operation
mentornet copied to clipboard

Question About .csv file

Open zevyu opened this issue 6 years ago • 10 comments

I want to know how the csv file used to train mentor-dd is generated. My understanding is to train the baseline model with a clean tag dataset and use the Corrupted Labels dataset to calculate the loss to get the csv file. Can you tell me the details of generating a csv file?

zevyu avatar Dec 07 '18 08:12 zevyu

We first train our model for 18 epochs on the noisy dataset. Then we use the model to evaluate on another small dataset, where we have some clean labels. The model will outputs all the feature (on the small dataset) to generate the csv.

roadjiang avatar Dec 07 '18 18:12 roadjiang

I got it,thanks

zevyu avatar Dec 08 '18 01:12 zevyu

We first train our model for 18 epochs on the noisy dataset. Then we use the model to evaluate on another small dataset, where we have some clean labels. The model will outputs all the feature (on the small dataset) to generate the csv.

Hi,I want to make sure how the csv file is generated.You said that it pre-trained model on the nosiy dataset firstly, and then evaluate the model on the small dataset(whose size is 10 percents?). So ,the clean labels in the csv file is the true labels of clean data, while the noisy labels is the prediction of the model?Is right?

ruirui88 avatar Jan 11 '19 14:01 ruirui88

Details are in https://github.com/google/mentornet/blob/master/TRAINING.md

clean label column: ground-truth labels on small clean dataset noisy label column: given labels on the current noisy dataset loss column: loss computed using the noisy label

roadjiang avatar Jan 11 '19 19:01 roadjiang

Details are in https://github.com/google/mentornet/blob/master/TRAINING.md

clean label column: ground-truth labels on small clean dataset noisy label column: given labels on the current noisy dataset loss column: loss computed using the noisy label

Sorry ,i don't quite get it. Whether if evaluating the pre-trained model on the clean and noisy dataset together? The samples whose ground-truth label and noisy label is the same comes from clean dataset, while the others come from noisy dataset. What's more, how does calculate the value in the clean label column for this noisy dataset. Is it manually annotated or prediciton of pre-trained model?

ruirui88 avatar Jan 14 '19 14:01 ruirui88

Could you upload the files or code about the function 'provide_resnet_noisy_data' for extract resnet features in the cifa_eval.py(line 186)?

wffzxyl avatar May 13 '19 09:05 wffzxyl

Details are in https://github.com/google/mentornet/blob/master/TRAINING.md clean label column: ground-truth labels on small clean dataset noisy label column: given labels on the current noisy dataset loss column: loss computed using the noisy label

Sorry ,i don't quite get it. Whether if evaluating the pre-trained model on the clean and noisy dataset together? The samples whose ground-truth label and noisy label is the same comes from clean dataset, while the others come from noisy dataset. What's more, how does calculate the value in the clean label column for this noisy dataset. Is it manually annotated or prediciton of pre-trained model?

Have you finished the generation of the csv files? could you give me the csv file generation code. I can't found it in these files

wffzxyl avatar May 13 '19 09:05 wffzxyl

The authors are supposed to generate their own csv files from their models.

On Mon, May 13, 2019, 2:27 AM wffzxyl [email protected] wrote:

Details are in https://github.com/google/mentornet/blob/master/TRAINING.md clean label column: ground-truth labels on small clean dataset noisy label column: given labels on the current noisy dataset loss column: loss computed using the noisy label

Sorry ,i don't quite get it. Whether if evaluating the pre-trained model on the clean and noisy dataset together? The samples whose ground-truth label and noisy label is the same comes from clean dataset, while the others come from noisy dataset. What's more, how does calculate the value in the clean label column for this noisy dataset. Is it manually annotated or prediciton of pre-trained model?

Have you finished the generation of the csv files? could you give me the csv file generation code. I can't found it in these files

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/google/mentornet/issues/2#issuecomment-491747708, or mute the thread https://github.com/notifications/unsubscribe-auth/ADGNQMO5KZDE5UDQCTNUFT3PVEYBJANCNFSM4GJACEOA .

roadjiang avatar May 13 '19 14:05 roadjiang

We first train our model for 18 epochs on the noisy dataset. Then we use the model to evaluate on another small dataset, where we have some clean labels. The model will outputs all the feature (on the small dataset) to generate the csv.

Hi,I want to make sure how the csv file is generated.You said that it pre-trained model on the nosiy dataset firstly, and then evaluate the model on the small dataset(whose size is 10 percents?). So ,the clean labels in the csv file is the true labels of clean data, while the noisy labels is the prediction of the model?Is right?

Hi @ruirui88 , did you manage to create your csv file?

AnnPe avatar Nov 12 '19 12:11 AnnPe

Details are in https://github.com/google/mentornet/blob/master/TRAINING.md clean label column: ground-truth labels on small clean dataset noisy label column: given labels on the current noisy dataset loss column: loss computed using the noisy label

Sorry ,i don't quite get it. Whether if evaluating the pre-trained model on the clean and noisy dataset together? The samples whose ground-truth label and noisy label is the same comes from clean dataset, while the others come from noisy dataset. What's more, how does calculate the value in the clean label column for this noisy dataset. Is it manually annotated or prediciton of pre-trained model?

Have you finished the generation of the csv files? could you give me the csv file generation code. I can't found it in these files

Hi @wffzxyl , did you manage to generate the csv file? I can not reproduce the authors' results, so im afraid Im doing all the wrong way round

AnnPe avatar Nov 12 '19 13:11 AnnPe