test-datasets icon indicating copy to clipboard operation
test-datasets copied to clipboard

source for oncoanalyser test data ?

Open aeozdemr opened this issue 2 months ago • 2 comments

Hi,

I just want to know where this data comes from, any info for that ?

aeozdemr avatar Oct 30 '25 16:10 aeozdemr

This would be a good question for someone like @rayanhassaine or @scwatts !

It should be documented, here: https://github.com/nf-core/test-datasets/tree/oncoanalyser

But I don't see anything currently. Maybe it can be updated?

jfy133 avatar Nov 03 '25 09:11 jfy133

I generated this test data by manually simulating a minimal set of reads needed to produce somewhat sensible results quickly when analysed by oncoanalyser. This was done with a combination of Python / SAMtools / wgsim, and includes germline predisposition small variants, somatic driver small variants, somatic structural variants + gene fusions, somatic viral integration

scwatts avatar Nov 05 '25 03:11 scwatts