NYU-DLSP20 <Fix> evaluation dataset, printed samples

<Fix> evaluation dataset, printed samples

Open hypothe opened this issue 1 year ago • 1 comments

Bunch of minor "theoretical" changes in the evaluation function:

test_data_gen was used as the data generator in the evaluation, instead of data_generator, thereby evaluating the net on the test set used for training (not an actual issue here given the sequences are randomized and not sampled from existing datasets, but in principle would lead to a data leak in realistic scenarios);
the correct sequences printed were a sampling (with reinsertion) of the first 10 evaluated, instead of 10 sampled from the whole set of correct ones;
the condition for printing the incorrectly classified sequences would declare the absence of misclassifications if verbose==False, independently of their actual presence;

Jan 05 '23 17:01 hypothe