Z.-L. Deng
Z.-L. Deng
Had the same issue with both 0.23.2 and 0.23.4 when processing Nanopore reads data. It consumed more than 128GB memory for 100Mb reads. `--dont_eval_duplication` does not help. `--disable_adapter_trimming` helped.
Thank you so much for your response! The dmp files were not deleted. Had a manual inspect, they are always there. I ran parallel jobs with Snakemake submitting jobs to...
Thank you so much for your help. Just tested it with multiple jobs on the same node. And it was fine without any issue.
Found another issue might be related to this one. I was trying to use taxopy with python multiprocessing where multiple processes used the taxdb created by ```python taxdb = taxopy.TaxDb(nodes_dmp=nodes_dmp_file,...
Haven't try joblib on it. You are right, I should give it a try. Thank you for your prompt response.
Thank you for testing the example. You are right. It is fine if I define the `job` and create the `taxdb` outside of `class`. The issue I encountered seems to...
That would be great. Do you also expect to have speedup with sqlite? I know ETE3 is using sqlite, and it seems to be very fast.
I tested ETE3 for taxonomy queries this afternoon. The multiprocessing is an issue as each process must create a new sqlite database connection. Otherwise, it gives `sqlite3.DatabaseError: database disk image...
agree. it is important to support long reads only input data as long reads sequencing is becoming more and more popular
On macOS, I could install it with Python3.9. But on Linux, it did not work even after switched to Python 3.6