nanoGPT icon indicating copy to clipboard operation
nanoGPT copied to clipboard

Is there an easy way to add generation of multiple possible sequences?

Open vladimirlitvinyuk opened this issue 1 year ago • 3 comments

vladimirlitvinyuk avatar Mar 12 '23 12:03 vladimirlitvinyuk

Isn't that literally what Num_samples in sample.py is for?

runs off x number of possible sequences?

or are you meaning stop part way through and then give options of branching output?

Coriana avatar Mar 12 '23 23:03 Coriana

Hi. Sorry for long answer. As far as I can see, samples are generated independently, so unique sequences are not guaranteed. I need to get strictly unique sequences as a minimum requirement. However, stop part way through and give options of branching output could be also an interesting case.

vladimirlitvinyuk avatar Mar 27 '23 16:03 vladimirlitvinyuk

@vladimirlitvinyuk

You are talking about a model with anywhere between 124 million to 1.5 billion parameters. It doesn't make duplicates very often, unless you give it some very narrow input.

About your other mention, with branching outputs, make a loop and put the generation function in it, give generate function very few tokens to generate, and do it all over again

VatsaDev avatar Aug 23 '23 20:08 VatsaDev