promptbench Parsing the output of a model

Parsing the output of a model

Open jylong91 opened this issue 1 year ago • 4 comments

The output of the model largely depends on the prompt. When performing text classification tasks such as sst2, which requires the model to give negative or positive responses, the model may provide various forms of answers, and the form of answers from different models may vary. Is there a good way to parse these results?

Jan 05 '24 02:01 jylong91

Hi, our current approach involves directing the model to specifically output the desired labels ('positive' or 'negative'), which are then enclosed within a unique pattern (e.g., '<<<>>>'). This format allows for easy extraction using regular expressions.

Jan 05 '24 06:01 Immortalise

im getting issues regarding qqp dataset evaluation for diff attacks. it gives me dataset_name attribute error when I run attacks

Mar 03 '24 19:03 Rizwanali324

Hi, I have tested qqp dataset with Flan-T5 model using StressTest attack, it works for me. Could you please test it and paste the detailed error messages here?

Mar 20 '24 03:03 Immortalise

Stale issue message

May 19 '24 06:05 github-actions[bot]

promptbench promptbench copied to clipboard

Parsing the output of a model

promptbench
promptbench copied to clipboard