progen icon indicating copy to clipboard operation
progen copied to clipboard

How to specify a control tag in the sample script?

Open LiorZ opened this issue 3 years ago • 3 comments

In the paper it was mentioned that one can sample sequences based on a control tag. Could you provide an example for using a control tag with the sample.py script?

LiorZ avatar Jan 31 '23 14:01 LiorZ

I'm very curious about that too. Based on the paper I'm assuming they introduced the control tags during fine tuning on a sample protein family (lysozymes in the case of the paper).

For my use case unfortunately there isn't a well known family with enough sequence data to train on. I'm wondering if we could prompt the model with an existing sequence to generate new ones based on distance.

okaris avatar Feb 04 '23 19:02 okaris

@okaris Apparently this repo is for ProGen2. In ProGen2 the control tags were removed AFAIK. The Nature Biotechenology paper refers to ProGen1 and that's what made the confusion.

Will be happy for a confirmation from one of the authors

LiorZ avatar Feb 05 '23 09:02 LiorZ

That's really confusing. (And I do NOT understand why they'd remove the control tags :( )

ddofer avatar Jan 18 '24 09:01 ddofer