biomartr icon indicating copy to clipboard operation
biomartr copied to clipboard

Which other databases should be included in biomartr?

Open HajkD opened this issue 8 years ago • 9 comments

I would like to trigger a brainstorming around the question which other databases could be useful to be included in biomartr. I would be grateful to receive input from different life science communities working with genomics data.

Many thanks!

HajkD avatar Jan 19 '17 15:01 HajkD

Uniprot?

ARamesh123 avatar Mar 11 '17 01:03 ARamesh123

Hi @ARamesh123

Sounds like a very good idea!

I just checked: http://www.uniprot.org/help/programmatic_access

Should be feasible :) So I will try to get familiar with the API and will implement Uniprot retrieval functions.

I am happy for any further suggestions or for pointing out even more detailed functionality that is needed.

Many thanks!

Hajk

HajkD avatar Mar 11 '17 10:03 HajkD

fungene http://fungene.cme.msu.edu/ Its terribly painful to extract reference genes for functional genes from fungene with appropriate taxonomic strings. It seems as though their data is protected though.

jvineis avatar Nov 03 '17 13:11 jvineis

JGI IMG is critical. Thanks Hajk! https://img.jgi.doe.gov/

jvineis avatar Nov 17 '17 14:11 jvineis

I just finished implementing the proteome retrieval interface to UniProt.

Please see the example here for details.

In short users can now specify:

biomartr::getProteome( db = "uniprot",  organism = "Mus musculus" )

I hope this is useful.

Best, Hajk

HajkD avatar Jan 18 '18 18:01 HajkD

ensembl plants

Norbittner avatar Feb 13 '20 14:02 Norbittner

Not sure if this has been covered but:

Homo sapiens (human) genome assembly GRCh38 (hg38)

sofsta avatar Apr 14 '20 13:04 sofsta

This should work with:

biomartr::getGenome(db = "refseq", organism = "GCF_000001405.39")

The refseq id was taken from: https://www.ncbi.nlm.nih.gov/assembly/GCF_000001405.39

Does this solve the human genome issue?

HajkD avatar Apr 14 '20 15:04 HajkD