chipseq icon indicating copy to clipboard operation
chipseq copied to clipboard

Add IDR analysis

Open drpatelh opened this issue 5 years ago • 5 comments

Would be nice to add the IDR analysis that is currently carried out by the ENCODE pipelines: https://github.com/kundajelab/atac-seq-pipeline https://docs.google.com/document/d/1f0Cm4vRyDQDu0bMehHD7P7KOMxTOP-HiNoIvL1VcBt8/edit

Another implementation in NF: https://github.com/DoaneAS/atacflow

ENCODE ATAC-seq Guidelines: https://www.encodeproject.org/atac-seq/

Amongst other things, this will probably also involve creating pseudo-replicates by merging and equally sampling alignments across replicates.

See: https://github.com/nf-core/atacseq/issues/36

drpatelh avatar May 30 '19 16:05 drpatelh

This one could be also relevant for the implementation of the IDR analysis specifically for ChIP-seq analysis, implemented in NF: https://github.com/bioinfo-pf-curie/chip-seq

JoseEspinosa avatar Oct 26 '21 21:10 JoseEspinosa

We would need to think about how we define "replicates" here to use that right? Now that we have removed the replicate column from the samplesheet. Maybe we can have an optional column for replicates in the samplesheet for this kind of thing? 🤔

drpatelh avatar Oct 26 '21 21:10 drpatelh

Yes, that could be an option, another one could be to have a separate file with the information about replicates... In the case we want to keep the samplesheet standard across pipelines

JoseEspinosa avatar Oct 26 '21 21:10 JoseEspinosa

I did think about having a separate metadata file but why?! Keep it simple and have one would be my preference.

drpatelh avatar Oct 26 '21 21:10 drpatelh

Yes, I agree two metadata files means increasing at least 2x the possibility of making mistakes :stuck_out_tongue_closed_eyes:

JoseEspinosa avatar Oct 26 '21 21:10 JoseEspinosa