taxondna
taxondna copied to clipboard
Exclude loci or taxa from data export if they did not sequence/capture above a threshold
Export option to exclude loci not present in X% of taxa. Export option to exclude taxa with fewer than X % of loci. It should be possible to use both of the above selective criteria at once for any data export format. When data are excluded there should be a report of what was left out.
I'm trying to figure out what the best user interface for this would be:
- Have each of these be a menu option, but I think it'd be nice for users to be able to see which loci and taxa they're exporting.
- An "Export dataset by criteria ..." menu item, which opens a dialog box that provides the following options:
- [ ] Exclude loci not present in [100%] of taxa.
- This will exclude the following loci: [List]
- [ ] Exclude taxa not present in [0%] of loci.
- This will exclude the following taxa: [List]
- [A table, similar to the main table, showing what will be output]
- Buttons that provide exports in the formats listed in the "Export ..." menu, as well as an additional button for "Open filtered dataset in a new window".
- [ ] Exclude loci not present in [100%] of taxa.
- A "Filter by criteria ..." menu item, which would open a dialog box identical to that above, but which would only have an "Open filtered dataset in a new window" button. Once you do that, you can then export the new dataset in whichever formats you like.
What do you think?
As per https://twitter.com/RobLanfear/status/1519534783050952704, it might also be useful to filter taza/loci that have more or less than a certain threshold of gaps/unknowns.