promptbench icon indicating copy to clipboard operation
promptbench copied to clipboard

Parsing the output of a model

Open jylong91 opened this issue 1 year ago • 4 comments

The output of the model largely depends on the prompt. When performing text classification tasks such as sst2, which requires the model to give negative or positive responses, the model may provide various forms of answers, and the form of answers from different models may vary. Is there a good way to parse these results?

jylong91 avatar Jan 05 '24 02:01 jylong91

Hi, our current approach involves directing the model to specifically output the desired labels ('positive' or 'negative'), which are then enclosed within a unique pattern (e.g., '<<<>>>'). This format allows for easy extraction using regular expressions.

Immortalise avatar Jan 05 '24 06:01 Immortalise

im getting issues regarding qqp dataset evaluation for diff attacks. it gives me dataset_name attribute error when I run attacks

Rizwanali324 avatar Mar 03 '24 19:03 Rizwanali324

Hi, I have tested qqp dataset with Flan-T5 model using StressTest attack, it works for me. Could you please test it and paste the detailed error messages here?

Immortalise avatar Mar 20 '24 03:03 Immortalise

Stale issue message

github-actions[bot] avatar May 19 '24 06:05 github-actions[bot]