LongQC
LongQC copied to clipboard
incomprehensible issue
Goodmorning,
I run longqc with ONT data with the following command : srun longQC.py sampleqc -x ont-rapid -s ${sample} -p 30 -o ${folder_sampleqc}/${sample}_500X_rapid ${long_read} and some of my samples succeeded but some other seemed to crash and I don't understand why (the issue is uncomprehensible for me, I'm biologist...-> see below for the issue) I've questioned the cluster manager @lecorguille and he thinks it's rather a tool dependant issue than an installation one. Could you help us to resolve it ?
thans a lot regards
Chloé
lq_coverage:2021-03-16 18:57:46,714:374:INFO:Coordinates of coverage analysis were parsed.
Traceback (most recent call last):
File "/opt/LongQC/longQC.py", line 933, in
Hello @ChloeAmbroset,
Thank you for your interest in LongQC. As you can run the tool for some datasets, some unexpected output could be generated by some of your datasets.
If it is possible, could you please share 2 files (coverage_err.txt and coverage_out.txt) for a dataset faced this issue?
They should be located in a folder ${folder_sampleqc}/${sample}_500X_rapid/analysis/minimap2/
.
I would be able to spot the issue from these files.
For now, I think lengths of reads may have caused this issue (either much shorter/longer than our expectations).
Technically these files should not contain sensitive information, but you can edit (sample name, path, etc.) when you need.
Yoshinori
Hello Yoshinori,
Thank's a lot for your reply. As request, I share you the following files : coverage_err_Mbv-L15762C1.txt coverage_out_Mbv-L15762C1.txt
hope you will be abble to read it. otherwize I'll copy-paste them again.
Thank's a lot in advance
Best regards
Chloé
Hello @ChloeAmbroset,
Thank you for sharing the files. They're very helpful. Let me check at my side. In the meantime, upgrading the version by cluster manager (@lecorguille) might resolve the issue. Your output was generated by a bit older version (<version 1.2), and may have hit some old code issue.
Will update once I pinpoint the issue. thanks.
Best, Yoshinori
Hello @ChloeAmbroset,
Thank you for your patience and sharing. I pushed revised codes, and could you please ask upgrading LongQC?
Version 1.2.0b should be able to handle your datasets faced some issues.
$python longQC.py --version
The command should return "LongQC 1.2.0b".
Yoshinori
Hi @yfukasawa , can you tag the 1.2.0b version. We provide longqc as a Singularity image that pull this github repository
Thanks!
Hi @lecorguille,
Sure, I just added 1.2.0b in release. Could you please check it?
We provide longqc as a Singularity image that pull this github repository
that's cool. never tested on Singularity, but our cluster also uses it. it's good to know, thanks!
Y.
Hi @yfukasawa ,
@lecorguille installed and updated the new version of longqc/1.2 and longqc crashed again ... I had this error message after this following command: srun longQC.py sampleqc -x ont-rapid -s ${sample} -p 30 -o ${folder_sampleqc}/${sample}_rawreads-rapid ${long_read}
Traceback (most recent call last): File "/opt/LongQC/longQC.py", line 957, in main(args) File "/opt/LongQC/longQC.py", line 63, in main args.handler(args) File "/opt/LongQC/longQC.py", line 370, in command_sample df_mask = pd.read_table(lm.get_outfile_path(), sep='\t', header=None) File "/opt/conda/lib/python3.8/site-packages/pandas/io/parsers.py", line 689, in read_table return _read(filepath_or_buffer, kwds) File "/opt/conda/lib/python3.8/site-packages/pandas/io/parsers.py", line 462, in _read parser = TextFileReader(filepath_or_buffer, **kwds) File "/opt/conda/lib/python3.8/site-packages/pandas/io/parsers.py", line 819, in init self._engine = self._make_engine(self.engine) File "/opt/conda/lib/python3.8/site-packages/pandas/io/parsers.py", line 1050, in _make_engine return mapping[engine](self.f, **self.options) # type: ignore[call-arg] File "/opt/conda/lib/python3.8/site-packages/pandas/io/parsers.py", line 1898, in init self._reader = parsers.TextReader(self.handles.handle, **kwds) File "pandas/_libs/parsers.pyx", line 521, in pandas._libs.parsers.TextReader.cinit pandas.errors.EmptyDataError: No columns to parse from file srun: error: cpu-node-39: task 0: Exited with exit code 1
thanks again for your help
Chloé
Hi @ChloeAmbroset,
Thank you for letting me know. The error looks unrelated to the issue revised recently, unfortunately, so let me ask a below question to trouble shoot.
- do you get some errors when you execute below? (from the command, I understood LongQC was installed under /opt)
/opt/LongQC/minimap2-coverage/sdust
the expected message is below:
Usage: sdust [-w 64] [-t 20] <in.fa>.
If you don't get the above message, a program 'sdust' may haven't run in your environment due to some reasons. This may not be the case, but let me check this point first.
Yoshinori
Indeed, my bad! I had a misspelling in the minimap2-coverage path (minimap2_coverage) and so sdust wasn't compiled