instruct-eval
instruct-eval copied to clipboard

→

Metadata

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Reame
Issues

Results 24 instruct-eval issues

Sort by recently updated

Add zero-shot evaluation results

1

Hi all, I read the code and realized that the results were obtained from 3-shot demonstrations. However, some models were trained to follow instructions without demonstrations. These models may have...

LeeShiyang

Multi GPU Support is required

Please enable multi-gpu support.

chintan-ushur

Support for lm_eval v0.4 and higher

Hi, I noticed that you're using lm_eval v0.2.0 as an evaluation flow for more tasks, however, in v0.4.0 and later lm_eval added more datasets and made major changes to the...

Shinning-Zhou

Evaluating with adapters

2

Hello, Is there a way to evaluated a model that we have trained an e.g. LoRA adapter on? Thanks

maanasharma5

‹
1
2
3

About

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

llm

instruct-tuning

514

Stars

39

Forks

Watchers

Owner

declare-lab

← Metadata

514

Stars

39

Forks

Watchers

Owner

declare-lab

Metadata

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Back

instruct-eval instruct-eval copied to clipboard

Metadata

Add zero-shot evaluation results

Multi GPU Support is required

Support for lm_eval v0.4 and higher

Evaluating with adapters

← Metadata

Owner

Metadata

instruct-eval
instruct-eval copied to clipboard