LMdiff issues

Show TopK Diff in detailed example analysis

![image](https://user-images.githubusercontent.com/24350185/123656012-86036800-d7fd-11eb-81d9-278233241431.png) Should also allow `Top-10 Diff`, corresponding to the search results ![image](https://user-images.githubusercontent.com/24350185/123656118-a0d5dc80-d7fd-11eb-9440-aa745be234a3.png) It is returned as part of the following packet: ``` return { "text": text, "tokens": tokens, "m1": {...

bhoov

Get distribution for all samples and show only interesting ones

> how difficult would it be to show distribution over all the data and highlight the picked examples? ![image](https://user-images.githubusercontent.com/24350185/123303478-d4f28a00-d4eb-11eb-9cd1-273de5345cac.png)

bhoov

Show use case of models evaluated on model-generated dataset

1

Create datasets and analysis results of `gpt-gen` and `distillgpt2-gen`. Questions - How would you generate diverse phrases? (one per line)? Would you have a prompt dataset?

bhoov

Create comprehensive docs

Consider porting documentation to `mkdocs` to have a professional feel for this tool

bhoov

Mutliple copies of results in search

3

When querying the API for text snippets from pre-computed corpus, some snippets are duplicates which violates the uniqueness requirement for the list.

HendrikStrobelt

Add readable name + short description to deployed dataset

1. `mrpc` is a sufficient description of `glue_mrpc` since glue is the task name encompassing several datasets 2. We would like to provide a popup near the dataset name that...

bhoov

LMdiff
LMdiff copied to clipboard

Metadata

Show TopK Diff in detailed example analysis

Get distribution for all samples and show only interesting ones

Show use case of models evaluated on model-generated dataset

Create comprehensive docs

Mutliple copies of results in search

Add readable name + short description to deployed dataset

← Metadata

Owner

Metadata

LMdiff LMdiff copied to clipboard

Metadata

Show TopK Diff in detailed example analysis

Get distribution for all samples and show only interesting ones

Show use case of models evaluated on model-generated dataset

Create comprehensive docs

Mutliple copies of results in search

Add readable name + short description to deployed dataset

← Metadata

Owner

Metadata

LMdiff
LMdiff copied to clipboard