plur icon indicating copy to clipboard operation
plur copied to clipboard

Query about evaluation script

Open smith-co opened this issue 3 years ago • 3 comments

Readme mentions to run plur_evaluator.py.

python3 plur_evaluator.py
   --dataset_name=manysstubs4j_dataset 
   --target_file_pattern=/tmp/manysstubs4j_dataset/targets.txt 
   --prediction_file_pattern=/tmp/manysstubs4j_dataset/predictions.txt

Here the ground truth is targets.txt (`-target_file_pattern=/tmp/manysstubs4j_dataset/targets.txt).

I have two queries:

  • What's the format of this file?
  • Should not be this targets.txt file created after running the Data loader script (plur_data_generation.py)?

But after running plur_data_generation.py, I don't find the targets.txt in the /tmp/manysstubs4j_dataset/ folder.

@smoitra87 @VHellendoorn @dan-zheng can you please help me with this query?

smith-co avatar Oct 06 '22 21:10 smith-co

Hi there! I haven't tried running PLUR personally, so I'm not sure how much I can help.

But after running plur_data_generation.py, I don't find the targets.txt in the /tmp/manysstubs4j_dataset/ folder.

Did you try specifying the --stage_1_dir and --stage_2_dir flags? They seem to control where data is generated.

https://github.com/google-research/plur/blob/6181909a0d09cc6492a5819f68ad0ca9cdc082e8/plur/plur_data_generation.py#L75-L78

dan-zheng avatar Oct 07 '22 21:10 dan-zheng

Hi @smith-co and thanks @dan-zheng for jumping in. The format of targets.txt should be here - https://github.com/google-research/plur/blob/main/plur/eval/README.md

Hope this helps. If you're still having trouble generating the dataset please post here again with a full set of commands you ran and the outputs you obtained.

smoitra87 avatar Oct 09 '22 02:10 smoitra87

@smoitra87 where is the equivalent file for /tmp/code2seq_dataset/targets.txt with the manysstubs4j_dataset? I dont see targets.txt file for the manysstubs4j_dataset.

@smith-co have you experienced the same?

nashid avatar Oct 09 '22 02:10 nashid