foundry icon indicating copy to clipboard operation
foundry copied to clipboard

Timeline for training data release

Open twidatalla opened this issue 2 months ago • 0 comments

Hello,

Thank you for open-sourcing this amazing work, AtomWorks will be great for the community. To this end, I was curious if/when the following datasets will be released?

We also develop two new nucleic acid distillation datasets, described within the supplementary methods: a protein-nucleic acid complex distillation set and an RNA distillation set (with 27K examples and 10K examples, respectively)

If not, could more details (Ideally atleast the raw sequences) about how this dataset was created be made available?

Best, Talal

twidatalla avatar Oct 16 '25 17:10 twidatalla