Cornelius Roemer comments

Results 325 comments of


Cornelius Roemer

Vim navigation doesn't work in Jupyter Notebooks

Vim should definitely be disabled in jupyter so that one can at least use emacs keybindings for things like `ctrl+e` etc

Zoom to node by node name in the label URL param

As discussed here this sounds like it requires an Augur solution, see nextstrain/augur#1068 for a recent duplicate.

Usher server down: "Very early error"

Oh nice, I didn't know about the Euro mirror, that might be a better option for me anyways! https://genome-euro.ucsc.edu/cgi-bin/hgPhyloPlace This seems to work!

Usher server down: "Very early error"

Hmm ok it fails soon after submitting sequences:

Add `Pango lineage assigned by Usher` etc as colorings/metadata also for uploaded samples

Very cool! I always use dev.usher.bio anyways so don't mind if it's released or not ;)

ENH(parse): Allow specification of which field to retain as sequence id, e.g. accession instead of strain name

Workaround using seqkit for the GISAID case (keeping second field as id): ``` seqkit seq \ --id-regexp "^.*\|([^\|]+)\|" \ -i \ {input.fasta} ```

Feature: allow reading from stdin (ideally with schema inference)

Here we go! https://github.com/apache/arrow-rs/issues/1059 OK, duckdb just lost because they also require insane amounts of memory. Your implementation is super lean, on track to be the winner if we can...

Feature: allow reading from stdin (ideally with schema inference)

Oh duckdb just seems to read everything into memory, then write out rather than stream. It's kind of a hack how I'm using it for ETL - so not surprising...

Feature: allow reading from stdin (ideally with schema inference)

Ok that seekable thing doesn't work. So back to inferring schema when reading from stdin. One could multipeek on the stdin reader for a few lines, collect into a vec,...

Feature: allow reading from stdin (ideally with schema inference)

As a workaround (for now) I can just do the parquet conversion on a cluster where I have TB of disk space. Should be possible to feed the 100GB file...