ExPecto icon indicating copy to clipboard operation
ExPecto copied to clipboard

averaging forward and reverse strand from output of chromatin.py

Open cmlakhan opened this issue 4 years ago • 2 comments

For other purposes I wanted to get a single chromatin effect score for each SNP from what I can tell if

X = num_variants

then for each SNP i then the forward and backward strands are at index i and i+X, is that correct? So if I want a single score then I should average row i and row i+X? That is the impression I get from the code in predict.py below:

snp_temp = (np.asarray(h5f[index_start:index_end,:])+ np.asarray(h5f[index_start+int(h5f.shape[0]/2):index_end+int(h5f.shape[0]/2),:]))/2.0

Just wanted to clarify, thanks!

cmlakhan avatar Jan 28 '21 22:01 cmlakhan

Thanks for the question! Yes that is right. The first half of the chromatin predictions are computed from the forward strand sequences and the second half is for the same number of reverse complement sequences.

jzthree avatar Jan 29 '21 15:01 jzthree

Thanks for clarifying! You typically just average the two values, correct?

cmlakhan avatar Jan 29 '21 16:01 cmlakhan