Rik Smith-Unna issues

Results 73 issues of


                                            Rik Smith-Unna

Package idea: export citations

Take a (sub)set of search results and export them in a variety of citation formats, ready to go into a reference manager. Could also take the references of all the...

enhancement

Package idea: premined terms packages

EuropePMC runs various text mining pipelines on their papers. We can get the datasets and make a sciencefair package for each one. For example, adding `eupmc-premined-genes` would get the genes...

enhancement

Interest in other languages?

At the moment the study group resources cover R and Python. Is there any interest in having resources for other languages? I specifically have Ruby in mind.

Use cachedown or another memory-first leveldb adapter

by default it would be great to use memdown for indexing and then flush to disk https://github.com/mvayngrib/cachedown does this for you - so we could use it as default instead...

optimise (search-index) memory usage

See https://github.com/fergiemcdowall/search-index/issues/261 and https://github.com/fergiemcdowall/search-index-adder/issues/2 Some of the issues are addressed by the NLP pipeline which greatly reduces redundancy in the index. However, memory usage is still high. A test case...

streaming interface

we should expose a streaming interface this requires search-index to support streaming addition

delete records

My initial use-case doesn't require record deletion. However, most people do want to do that.

export database to a compressed archive

expose archiving in the API preferably `tar.xz`

Consider contributing to BioJulia

Hi, OpenGene! OpenGene is heading towards covering much of the same territory as [BioJulia](https://github.com/BioJulia/Bio.jl). Rather than create a new package, would you consider contributing to BioJulia? We'd be very glad...

windows end-of-line characters break eXpress

Windows line endings in the FASTA reference cause eXpress to die with error: `Target 'some_target' exists in MultiFASTA but not alignment (SAM/BAM) file`

bug