taxpasta
taxpasta copied to clipboard
TAXnomic Profile Aggregation and STAndardisation
### Checklist - [X] There are [no similar issues or pull requests](https://github.com/taxprofiler/taxpasta/issues) for this yet. ### Problem A simmilar issue of different file formats exists for amplicon analysis tools, such...
### Is there an existing issue for this? - [X] I have searched the existing issues ### Problem description See [discussion on Slack](https://nfcore.slack.com/archives/C031QH57DSS/p1701903800138819) reported by @AmaliT. The following config causes...
### Checklist - [X] There are [no similar issues or pull requests](https://github.com/taxprofiler/taxpasta/issues) for this yet. ### Problem Currently taxpasta can add taxonomic names when standardizing profiler output. Sometimes a table...
### Is there an existing issue for this? - [X] I have searched the existing issues ### Problem description As in title, this report is forward from https://github.com/nf-core/taxprofiler/issues/396. The MetaPhlAn...
In order to use the [OPAL tool](https://github.com/CAMI-challenge/OPAL) for analysis and visualization, it might be useful to convert any supported profiler to [that format](https://github.com/bioboxes/rfc/tree/master/data-format).
### Checklist - [X] There are [no similar issues or pull requests](https://github.com/taxprofiler/taxpasta/issues) for this yet. ### Problem I would like to merge multiple Bracken output tables. Prior to merging them,...
I think sourmash is an interesting tool, as it is so fast in scanning vast libraries of genomes. We should add support for its output.
We are in full control of the creation of domain models, hence, we should not allow domain models to coerce their input types. This increases overall type strictness.
From a comment in mOTUs PR so we do not forget: Apparently, mOTUs profiles can contain duplicate tax IDs. Clarify with Sofia and Maxime. For now, sum up read counts.