Open-Assistant icon indicating copy to clipboard operation
Open-Assistant copied to clipboard

Feature suggestion: Given an assistant reply, let users write a prompt for it

Open erkinalp opened this issue 2 years ago • 6 comments

This training scenario might be useful to train latent diffusor (http://arxiv.org/abs/2212.09462) and combined diffusor+GAN architectures of generative models.

erkinalp avatar Feb 16 '23 19:02 erkinalp

interesting idea, like that one game show i'm not gonna name. I can see the potential benefits.

mm12 avatar Feb 17 '23 00:02 mm12

By prompt, do you mean the prompt that would generate the reply? I wonder what we could expect to see from that. I'm assuming the user would have to provide something that would work as an initial prompt, because "Now give me a list of these places." wouldn't be a very effective example. Then again, the assistant could be making a reference to an earlier statement, in which case the correct prompt would be a user reply - unless you limit the assistant messages to the first ones of each tree only.

horribleCodes avatar Feb 17 '23 12:02 horribleCodes

By prompt, do you mean the prompt that would generate the reply?

Yes

erkinalp avatar Feb 17 '23 12:02 erkinalp

This could be used as a way to train and tune doc2query as well - there a somewhat related issue in this project with #645. Could also be used to connect a given response to more prompts and thereby increasing the dataset for the reward model to train on.

michaelbogdan avatar Feb 18 '23 18:02 michaelbogdan

This is actually a brilliant idea @erkinalp. This will create diversity for the data. You would then be creating a new sister subtree tho. So this could complicated things. I will leave for the webteam to decide if this can be implemented. i adding appropriate labels to let that team know.

huu4ontocord avatar Feb 20 '23 13:02 huu4ontocord

Maybe the trees created from this should be in a different dataset altogether? That'd make things easier, but I don't know if there's a benefit in putting the predicted prompts in the same dataset.

horribleCodes avatar Feb 21 '23 08:02 horribleCodes