Z.-L. Deng comments

Results 14 comments of


                                            Z.-L. Deng

memory steadily increases as run continues

Had the same issue with both 0.23.2 and 0.23.4 when processing Nanopore reads data. It consumed more than 128GB memory for 100Mb reads. `--dont_eval_duplication` does not help. `--disable_adapter_trimming` helped.

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

Thank you so much for your response! The dmp files were not deleted. Had a manual inspect, they are always there. I ran parallel jobs with Snakemake submitting jobs to...

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

Thank you so much for your help. Just tested it with multiple jobs on the same node. And it was fine without any issue.

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

Found another issue might be related to this one. I was trying to use taxopy with python multiprocessing where multiple processes used the taxdb created by ```python taxdb = taxopy.TaxDb(nodes_dmp=nodes_dmp_file,...

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

Haven't try joblib on it. You are right, I should give it a try. Thank you for your prompt response.

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

Thank you for testing the example. You are right. It is fine if I define the `job` and create the `taxdb` outside of `class`. The issue I encountered seems to...

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

That would be great. Do you also expect to have speedup with sqlite? I know ETE3 is using sqlite, and it seems to be very fast.

ExtractionError when running taxopy on several samples in parallel with specified dmp files and keep_files=True

I tested ETE3 for taxonomy queries this afternoon. The multiprocessing is an issue as each process must create a new sqlite database connection. Otherwise, it gives `sqlite3.DatabaseError: database disk image...

Add separate Nanopore input option

agree. it is important to support long reads only input data as long reads sequencing is becoming more and more popular

Codon whl is not a supported wheel on this platform

On macOS, I could install it with Python3.9. But on Linux, it did not work even after switched to Python 3.6