blobtools icon indicating copy to clipboard operation
blobtools copied to clipboard

Why use assembled genomes vs raw reads in example workflow 1

Open Rob-murphys opened this issue 4 years ago • 4 comments

I would leave this question on the google group but the link´is broken and I can't find it myself.

Why use an assembled genome vs raw reads in the example workflow on the readme.io as how can we be sure if some of the sequence in your assembly should not be there in first place?

Rob-murphys avatar Oct 20 '20 11:10 Rob-murphys

Hi Lamm-a,

sorry, i don't understand the question.

please elaborate.

cheers,

dom

DRL avatar Oct 22 '20 15:10 DRL

Hi Dom,

I mean that why does Blobtools operate via detecting comtamination in an already assembled genome vs before assembly?

Would the down side of using an assembled genome not be that we can't be sure if some of the sequence in your assembly should not be there in first place as it might come from a contaminant?

Rob-murphys avatar Oct 22 '20 15:10 Rob-murphys

blobtools just takes information about sequences (length, GC, sequence similarity, coverage, ...) and aggregates it for each sequence.

I mean that why does Blobtools operate via detecting comtamination in an already assembled genome vs before assembly?

before assembly there are only reads ... are you asking why are we not doing something with the reads?

Would the down side of using an assembled genome not be that we can't be sure if some of the sequence in your assembly should not be there in first place as it might come from a contaminant?

I don't understand what you think the alternative is.

DRL avatar Oct 22 '20 15:10 DRL

before assembly there are only reads ... are you asking why are we not doing something with the reads?

Yes. Would it be possible to use mapped raw reads in a similar fashion?

Sorry if this is a ignorant question.

Rob-murphys avatar Oct 26 '20 10:10 Rob-murphys