deepTools icon indicating copy to clipboard operation
deepTools copied to clipboard

deeptools cluster by logfc in peak regions

Open das2000sidd opened this issue 5 years ago • 2 comments

Welcome to deepTools GitHub repository! Before opening the issue please check that the following requirements are met :

  • [ ] Search whether this issue (or a similar issue) has been solved before using the search tab above. Link the previous issue if appropriate below.

  • [ ] Paste your deepTools version (deeptools --version) and your python version (python --version) below.

  • [ ] Paste the full deepTools command that produces the issue below (ignore if you simply spotted the issue in the code/documentation).

  • [ ] Paste the output printed on screen from the command that produces the issue below (ignore if you simply spotted the issue in the code/documentation).

das2000sidd avatar Jun 22 '20 18:06 das2000sidd

Hi I have called some H3K4me3 peaks in HCT116 cells. I have cells treated with DMSO or the experimental compound. I intend to cluster these peaks by log fold change after compound treatmen and use deeptools for doing that. My approach was to use deeptools in the reference point mode with center of the peak set as reference point. Below are my two scripts to generate the matrix and then do the clustering while plotting the data. Can someone please comment if my approach is correct?

computeMatrix reference-point --referencePoint center --beforeRegionStartLength 5000 --afterRegionStartLength 5000 --binSize 10
--scoreFileName HCT116_10cm4M_2926_60uM_DMSO_075per_H3K4me3_518_121919_logfc.bw
--regionsFileName HCT116_10cm4M_DMSO_075per_H3K4me3_518_121919.macsPeaks_peaks.bed
--outFileName HCT116_10cm4M_2926_60uM_DMSO_075per_H3K4me3_518_121919_logfc.gz
--missingDataAsZero -p 8 \

plotHeatmap -m HCT116_10cm4M_2926_60uM_DMSO_075per_H3K4me3_518_121919_logfc.gz --kmeans 3 -out HCT116_10cm4M_2926_60uM_DMSO_075per_H3K4me3_518_121919_logfc_3C.pdf --colorList blue,white,red --samplesLabel "2926/DMSO" --regionsLabel "C1" "C2" "C3" --outFileSortedRegions HCT116_10cm4M_2926_60uM_DMSO_075per_H3K4me3_518_121919_logfc_3C.bed --heatmapWidth 10 --heatmapHeight 30 --missingDataColor "white" --yMin -1 --yMax 1 --zMin -1 --zMax 1 --xAxisLabel "" --sortRegions descend --sortUsing sum

das2000sidd avatar Jun 22 '20 18:06 das2000sidd

That looks reasonable, though you're only clustering by a single sample then (I imagine you'd want all of the samples together when running computeMatrix). In that case you might take all of the peaks for all of the samples and merge them together with bedtools (there's a reduce function I think to merge overlapping peaks together).

dpryan79 avatar Jun 24 '20 11:06 dpryan79