OpenPrompt
OpenPrompt copied to clipboard
Some questions about the verbalizer
Hi, thank you very much for your open source. I have a few small questions that I hope you can answer. What is the logic for writing the process_logits() function and the Project() function in manual_verbalizer.py, please? Is there a paper on why it is feasible to do the mapping this way? Why don't the logits here just use the logits of mlm and formalise the loss as the loss of the mlm task.
May I ask how the following scenario was arrived at? Are there any relevant papers available for reference? A whole framework to process the original logits over the vocabulary, which contains four steps: (1) Project the logits into logits of label words if self.post_log_softmax is True: (2) Normalize over all label words (3) Calibrate (optional) (4) Aggregate (for multiple label words)