Rik Smith-Unna

Results 73 issues of Rik Smith-Unna

Take a (sub)set of search results and export them in a variety of citation formats, ready to go into a reference manager. Could also take the references of all the...

enhancement

EuropePMC runs various text mining pipelines on their papers. We can get the datasets and make a sciencefair package for each one. For example, adding `eupmc-premined-genes` would get the genes...

enhancement

At the moment the study group resources cover R and Python. Is there any interest in having resources for other languages? I specifically have Ruby in mind.

by default it would be great to use memdown for indexing and then flush to disk https://github.com/mvayngrib/cachedown does this for you - so we could use it as default instead...

See https://github.com/fergiemcdowall/search-index/issues/261 and https://github.com/fergiemcdowall/search-index-adder/issues/2 Some of the issues are addressed by the NLP pipeline which greatly reduces redundancy in the index. However, memory usage is still high. A test case...

we should expose a streaming interface this requires search-index to support streaming addition

My initial use-case doesn't require record deletion. However, most people do want to do that.

expose archiving in the API preferably `tar.xz`

Hi, OpenGene! OpenGene is heading towards covering much of the same territory as [BioJulia](https://github.com/BioJulia/Bio.jl). Rather than create a new package, would you consider contributing to BioJulia? We'd be very glad...

Windows line endings in the FASTA reference cause eXpress to die with error: `Target 'some_target' exists in MultiFASTA but not alignment (SAM/BAM) file`

bug