TADbit icon indicating copy to clipboard operation
TADbit copied to clipboard

abnormal eigenvectors

Open shiyi-pan opened this issue 3 years ago • 4 comments

Hi, I used find_compartments function to get eigenvectors and compartments, but I think the distribution of eigenvectors is abnormal,here is my eigenvectors: EigVect1.pdf The eigenvalues are negative at both ends of the chromosome and positive at the middle of the chromosome. Generally speaking, when the eigenvalue is negative, the corresponding compartment should be B compartment, and B compartment is heterochromatin with lower interaction and fewer genes. However, it is generally believed that both ends of the chromosome are usually euchromatin, and there are more genes at both ends of the chromosome.Therefore, I think there is a contradiction here. Could you help me explain the problem reasonably. Thank you very much.

shiyi-pan avatar Jan 16 '21 09:01 shiyi-pan

Hi, As you said the eigenvectors doesn't seem to show you the compartments. First of all, the sign of the eigenvalue does not tell you which type of compartment is. The eigenvectors are used to identify the transitions between compartments. Then to decide if a compartment is A or B you have to use some biological measure. For example in the examples (http://3dgenomes.github.io/TADbit/tutorial/tutorial_8-Compartments_and_TADs_detection.html#compartments) we use the amount of GC content because it's known that active regions have a higher ratio, but you can use whatever measure you like if you have a track of activity.

As for your example, I can't tell without seeing the matrix but it could be that either your matrix is not dense enough (you need more reads) or that the first eigenvector is telling you another kind of division, for example between arms. In the latter you will have to look at the second eigenvector. David

david-castillo avatar Jan 19 '21 09:01 david-castillo

Thank you for your reply, David. first, there are 47,696,249 valid reads after filter and my genome size is 960Mb. So do you think the matrix is dense enough or not? Second,is TADbit support other track like gene density ? Thank you again , David.

shiyi-pan avatar Jan 19 '21 15:01 shiyi-pan

Seems a good amount of reads to me. Have you checked the second eigenvector? You can use any track you wish as a list of values and use the rich_in_A parameter in find_compartments. Just be sure the size of the list matches the size of the column/rows of the matrix.

david-castillo avatar Jan 21 '21 09:01 david-castillo

Thank you for your reply, David. The second eigenvector seems normal. The A compartments distribute at the end of the chromosome and the B compartments distribute near the centromere . I will try to use gene density as the track.Thank you for your help.

shiyi-pan avatar Jan 21 '21 12:01 shiyi-pan