Nikhil Thorat

Results 53 issues of Nikhil Thorat

Currently it's a mess: Let's clean this up a bit more so it's more opinionated.

See https://lilacai-lilac.hf.space/datasets#lilac/opus100-en-es-validation

When searching by a concept it would be great if we could show the concept metrics with the pill, for the given embedding.

Doing this will allow us to avoid hosting the_movies_dataset.csv dataset.

We currently don't show a message saying if a dataset has been sampled.

We should link to the source code for each signal in the signal preview page.

This will let us load popular datasets like blimp without having to run many datasets: https://huggingface.co/datasets/blimp

Turning on parallel reading with multi-line CSV rows (e.g. a quoted string on multiple line) messes up the parallel reader. We should file an issue against duckdb with a minimal...

Currently, schema has "leafs" which returns all the nodes with values. This is technically incorrect, so let's introduce "petal" terminology for nodes with values. Leafs are truly leafs.