lewtun

https://lewtun.github.io/blog/ [email protected]

@huggingface Bern, Switzerland LLM Research and Engineering @ Hugging Face

Results 77 issues of


lewtun

Support running private models with community inference API

1

comment

As a user it would be nice to use the inference API widgets for private models generated via one of the Docker images in `api-inference-community`. Here's an example [model repo](https://huggingface.co/lewtun/superb-s3prl-osanseviero__hubert_base-asr-d570d5d0)...

Missing py7zr dependency for `samsum`

I realise this is an optional dependency for `datasets` end-users, but it would be nice to include in the viewer ![Screen Shot 2021-06-21 at 6 30 19 pm](https://user-images.githubusercontent.com/26859204/122796490-bf4c4d00-d2be-11eb-894d-214544b94424.png)

Default config for `common_voice` throws a cryptic error

Link to reproduce: https://huggingface.co/datasets/viewer/?dataset=common_voice Changing the subset to something different from `ab` seems to resolve the problem. ![Screen Shot 2021-06-21 at 6 27 56 pm](https://user-images.githubusercontent.com/26859204/122796218-6aa8d200-d2be-11eb-94fb-28a1ad07277c.png)

Fix dataset URL

Fixes small typo in dataset URL

Fix human_ans_spans entries to match snippets from human_ans_indices

This PR fixes a mismatch between some entries of the `human_ans_spans` column and the corresponding span of text in the `review` column. For example, line 17 of _electronics/splits/train.csv_ has in...

Make docs work

1

comment

Integrate feedback from the community

This issue collects links of community feedback on the type of content to include in the handbook. Feel free to post a comment below with other ideas / requests! *...

Expose a few model predictions / gold answers in the logs

1

comment

For generative benchmarks like MATH / GSM8k / IFEval, it would be great to have some visibility in the logs on how the prompts are formatted, what the generations look...

Add Sympy equivalence for MATH / GSM8K?

1

comment

In the [Minerva](https://arxiv.org/pdf/2206.14858.pdf) and LLeMMa papers, `sympy` is used to ensure equivalence of predicted / gold answers, e.g. ensuring $1/ \sqrt{3}$ and $\sqrt{3}/3$ are treated the same. From the Minerva...

`Could not initialize the JudgeOpenAI model` and `openi` import error

1

comment

FYI I'm seeing a lot of these "errors" in my logs when running lighteval ``` Could not initialize the JudgeOpenAI model: [Errno 2] No such file or directory: 'src/lighteval/tasks/extended/mt_bench/judge_prompts.jsonl' ```...

bug