Andrew Su

Results 29 comments of Andrew Su

Looking at the thread above, looks like this data plugin is ready for deployment as a pending API... Assigning to @erikyao to evaluate...

I also think this data would be super useful, so I looked into it a bit. Just recording what I found... Most of our NCBI data comes from https://ftp.ncbi.nlm.nih.gov/gene/DATA/. It...

Just to add a tiny bit more info. I _suspect_ the difference in behavior between `P63044` and `P23819` is due to the lack of an Entrez Gene mapping in the...

example record showing association between `Alacrimia-choreoathetosis-liver dysfunction syndrome` (`OrphaCode:404454`) and `N-glycanase 1` (`ENSG00000151092`). On the scale of things, this is a well-structured resource that should be relatively simple to parse......

Also note that some people (eg our cognoma friends) may be relying on the existing scoring scheme for production applications. So we should consult with them before pushing major changes...

This file has the gene-to-family links: http://ftp.ebi.ac.uk/pub/databases/genenames/hgnc/csv/genefamily_db_tables/gene_has_family.csv hgnc_id | family_id -- | -- 11148 | 3 3960 | 3 3961 | 3 3477 | 1963 4621 | 1963 4622 |...

re: @mcourtot's comment, it's true that Wikidata probably generally satisfies the attribution requirement for CC-BY. But Wikidata itself is CC0, so if you grant Wikidata permission to use your data,...

> IMO, the stronger argument (which I have also made) is that CC-BY as a legal instrument is the wrong tool to bring to bear for declaring an attribution requirement...

A few quick notes... - instead of looping over `mg.query`, you can also perform batch queries via `mg.querymany`, as described in https://pypi.org/project/mygene/ - to perform the query *and* get specific...

Just a bit more info on mygene.info in case it's useful: * We pull GO annotations from [NCBI's gene2go file](https://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2go.gz), so should have the same data as the Entrez API...