rnaseq
rnaseq copied to clipboard
Add NGSCheckMate
This PR adds the NGSCheckMate tool to the pipeline. I find this an essential tool and part of my initial QC for all sequencing we perform. This tool takes a bed file with a set of SNPs and tries to determine if samples come from the same individuals. Developed and generally used in humans (there are SNP bed files for hg19/hg38/GRCh37/GRCh38 available), but could be used in other species if a suitable set of common SNPs was provided. This should be configured to run automatically on GRCh37/38, hg37/hg38 I think needs the files adding to igenomes - need to check.
This ensures that no sample swaps have occurred by checking:
- Replicates come from the same individual
- Paired samples (e.g. timepoints or tissue samples) come from the same individual
- Non-paired samples didn't come from the same individual! It works on human derived cell lines, but won't separate different treatments applied to the same cell line.
Testing. I have used a bed file on the test datasets to get it to run on yeast, haven't tried the full tests yet.
- [X] This comment contains a description of changes (with reason).
- [ ] If you've fixed a bug or added code that should be tested, add tests!
- [X] If you've added a new tool - have you followed the pipeline conventions in the contribution docs- [X] If necessary, also make a PR on the nf-core/rnaseq branch on the nf-core/test-datasets repository.
- [ ] Make sure your code lints (
nf-core lint
). - [x] Ensure the test suite passes (
nextflow run . -profile test,docker --outdir <OUTDIR>
). - [x] Usage Documentation in
docs/usage.md
is updated. - [x] Output Documentation in
docs/output.md
is updated. - [x]
CHANGELOG.md
is updated. - [ ]
README.md
is updated (including new tool citations and authors/contributors).