gitkolento

Results 1 comments of gitkolento

> My scenario is like this, I have an initial prompt p, for example, a harmful jailbreak prompt, "how to steal", the model's response to this input is: "I'm sorry,...