Ian Magnusson issues

Results 19 issues of


                                            Ian Magnusson

gpt2 results with past_key_values not the same as when computed from scratch

### System Info - `transformers` version: 4.20.1 - Platform: Linux-5.4.0-89-generic-x86_64-with-glibc2.31 - Python version: 3.9.12 - Huggingface_hub version: 0.8.1 - PyTorch version (GPU?): 1.12.0+cu113 (False) - Tensorflow version (GPU?): 2.9.1 (False)...

bug

Starting a llm-eval branch

The llm-eval team is building out scripts to convert evaluation data to a common format for the purposes of deduplication. This format is line separated json for each eval example...

An eval pipeline for reporting token specific perplexities

This PR makes use of new features in Catwalk's perplexity evaluations in https://github.com/allenai/catwalk/pull/155 that report avg logits for tokens.

Improving evaluation readme

A few small changes for clarity

Ensuring Data Order Tracking for Reproducibility

### 🚀 The feature, motivation and pitch Yesterday we spoke about where responsibility for data order lives between the llm-model and llm-data workstreams. I thought it might be good to...

type/feature

Perplexity suite paper

Fixes # Changes proposed in this pull request: - ## Before submitting - [ ] I've read and followed all steps in the [Making a pull request](https://github.com/allenai/ai2-llm-eval/blob/main/.github/CONTRIBUTING.md#making-a-pull-request) section of the...

Ian Magnusson

gpt2 results with past_key_values not the same as when computed from scratch

Starting a llm-eval branch

An eval pipeline for reporting token specific perplexities

Improving evaluation readme

Ensuring Data Order Tracking for Reproducibility

Perplexity suite paper

Make standard Dockerfile

Perceptual Hashing for Face Identity and Visual Quality

Distributed Data Parallel Training

Generalized ia3