emp icon indicating copy to clipboard operation
emp copied to clipboard

taxons with empo habitats

Open marctormo opened this issue 5 years ago • 2 comments

Hello!

Do you know if there is any place with a list with information of taxon and habitat? I can't find this information, and I would like to assign it to some taxons (genus or species) like this: taxon1 habitat1 taxon2 habitat2 ... If not, is there any method to extract this info?

Thank you!

marctormo avatar Apr 12 '19 10:04 marctormo

Hi! The closest thing to what you're requesting is an "OTU summary" which lists for each unique tag sequence (variously called "ASVs" or "sOTUs") the samples in which it is found, along with some summary statistics. Combined with the mapping file, which lists the habitat of each sample, you can generate the file you're interested in. Of course, many sequences are found in more than one habitat. There are also different definitions of habitat, e.g., ENVO, EMPO, etc.

The OTU summary file is here (I suggest the version with chloroplast sequences filtered out): ftp://ftp.microbio.me/emp/release1/otu_distributions/otu_summary_no_chl.emp_deblur_90bp.subset_2k.rare_5000.tsv

The associated mapping file is here: ftp://ftp.microbio.me/emp/release1/mapping_files/emp_qiime_mapping_subset_2k.tsv

cuttlefishh avatar May 02 '19 21:05 cuttlefishh

Thank you! I think this is a great solution for me.

marctormo avatar May 03 '19 08:05 marctormo