NOVOPlasty icon indicating copy to clipboard operation
NOVOPlasty copied to clipboard

help for detecting heteroplasmy

Open Mirror1211 opened this issue 5 years ago • 43 comments

Hi everybody, I'm Eda. Recently, I am going to detect the heteroplasmy within chloroplast genome to investigate wether there is biparental heteroplasmy within my samples. I am really glad to find the NOVOPlasty since it can detect heteroplasmy for chloroplast genome. However, there is no detailed introductions to this analysis and I always obtained blank result files for heteroplasmy analyses. Could anybody help me for the settings of the configure file??? Thanks a lot!

Mirror1211 avatar Jun 08 '19 07:06 Mirror1211

I have already obtained the complete chloroplast genomes assembled by my colleague, and took them as their own reference genome and seed genome, while the results files also were blank.

Mirror1211 avatar Jun 08 '19 07:06 Mirror1211

Hi, could you send me the log, so I can check if all the parameters are correct. And do you have enough coverage for heteroplasmy detection? I haven't test it much on chloroplast genome but it is more complicated because of the many duplicated regions in the mitochondrial genome

ndierckx avatar Jun 08 '19 15:06 ndierckx

The mean coverage of cp genomes ranged from 200 to 7900 X. So, I think the coverage of cp genomes is enough to detece heteroplasmy. I have deleted the log file. I will conduct this analysis this afternoon and upload the log files here. Thanks a lot

Mirror1211 avatar Jun 09 '19 02:06 Mirror1211

Reading Input......OK

Scan reference sequence......OK

Building Hash Table......OK

Subsampled fraction: 100.00 %

Retrieve Seed...

However, there was no futher infomation for the next step. I guess there may be some errors in my configure files.

Mirror1211 avatar Jun 09 '19 03:06 Mirror1211

heter.txt log_test_chloro.txt Here are my configure and log files. Thanks

Mirror1211 avatar Jun 09 '19 03:06 Mirror1211

Your MAF is way too low, you are looking for heteroplasmy of 0,01%, that is impossible. If you want 1%, put 0,01. I will put an automatic allert in the next version. Try what it gives with 0,01. But i see you have reads of 90bp, that means they are old i guess, so less accurate. So definitely don't go below 0.01

ndierckx avatar Jun 09 '19 04:06 ndierckx

Thanks. Actually, I used 0.01 for MAF first. The result files were still blank.

Mirror1211 avatar Jun 09 '19 05:06 Mirror1211

Could I know when the next version will be uploaded in gitub????

Mirror1211 avatar Jun 09 '19 05:06 Mirror1211

Is there any errors in my configure files except for MAF????

Mirror1211 avatar Jun 09 '19 05:06 Mirror1211

Send me the log of the 0,01 then

ndierckx avatar Jun 09 '19 06:06 ndierckx

OK. I will upload the log of the 0.01 this afternoon.

Mirror1211 avatar Jun 10 '19 02:06 Mirror1211

ok, and when you did it before, it also got stuck at Seed retrieval? With a MAF lower than 0.01 that can happen

ndierckx avatar Jun 10 '19 06:06 ndierckx

yes, it also got stuck at Seed retrieval.

Mirror361025 avatar Jun 11 '19 02:06 Mirror361025

Sorry, our serve is maintaining, so I can not download the logfile.

Mirror361025 avatar Jun 11 '19 02:06 Mirror361025

Exactly, I have arond 50 accessions that need to conduct the heteroplasmy detection.

Mirror361025 avatar Jun 11 '19 02:06 Mirror361025

Hi,

Could you send me the seed file?

ndierckx avatar Jun 11 '19 02:06 ndierckx

ok. The serve maintaince will be finished tommorrow. After that, I will upload my seed file.

Mirror361025 avatar Jun 11 '19 02:06 Mirror361025

results.zip hi, I have analyzed another sample with two different MAF. I obtained the results of heteroplasmy. Here are the all results.

Mirror361025 avatar Jun 12 '19 01:06 Mirror361025

I don't know whether the analysis is correct. In addition, the results confused me.

Mirror361025 avatar Jun 12 '19 01:06 Mirror361025

Did you use the chloroplast assembly as a reference and seed or some online reference?

ndierckx avatar Jun 12 '19 01:06 ndierckx

Because you have a lot of homoplasmies, so your reference is probably not from that dataset? if you run again, please use the extended log (set to 1), then I can see when something went wrong. And you should try a high coverage dataset first

ndierckx avatar Jun 12 '19 02:06 ndierckx

Ah sorry coverage seems enough.., but results are bit weird indeed, if you send me one dataset, I can also try myself if you want

ndierckx avatar Jun 12 '19 02:06 ndierckx

But since all heteroplasmy is found in one short region, I would guess there is no heteroplasmy, something surely went wrong in that area

ndierckx avatar Jun 12 '19 02:06 ndierckx

yes, I used the assembled chloroplast genome as reference genome. I am sure the chloroplast genome is consistent with the reads I used. I wonder whether I should use the raw reads from the whole-genome resequencing or use the filtered data that only contained chloroplast reads??

Mirror361025 avatar Jun 12 '19 03:06 Mirror361025

I wonder whether I can have your email for the upload of raw data???

Mirror361025 avatar Jun 12 '19 03:06 Mirror361025

nicolasdierckxsens at hotmail dot com

ndierckx avatar Jun 12 '19 03:06 ndierckx

Thanks

Mirror361025 avatar Jun 12 '19 03:06 Mirror361025

hello, the reads should be the raw whole-genome sequences or the filter data that only contained chloroplast reads??

Mirror361025 avatar Jun 12 '19 03:06 Mirror361025

How did you filter them? And are the raw files very large?

ndierckx avatar Jun 12 '19 07:06 ndierckx

We used the complete cp genomes of related species as reference genomes to screen out the cp genomes reads. The raw reads are ~40 G

Mirror1211 avatar Jun 12 '19 23:06 Mirror1211