prompt2model issues

Create hosted demo of a few trained models to put on web site

Some people might be interested in playing around with models that have been trained with prompt2model. Currently, prompt2model supports building gradio demos using the trained models. Here is an example...

neubig

enhancement

good first issue

Consolidate directories for writing out files

Currently prompt2model writes out many files, which can be basically split into two categories: ## Cache These are all of the files that are used by the various models that...

neubig

refactoring

Empirical comparison to gpt-llm-trainer

There is another nice library for training models from prompts, [gpt-llm-trainer](https://github.com/mshumer/gpt-llm-trainer) that came out simultaneously with prompt2model. There are some differences in terms of features: | Aspect | prompt2model |...

neubig

experiment

Check and, where possible, remove over-broad catches

1

Currently there are several places where we have ```python try: ... catch Exception: ... ``` blocks. Where possible, it's better to avoid catching something as broad as this. Once we...

neubig

refactoring

Benchmarking of prompt2model on composite benchmarks

Currently, we have benchmarked prompt2model extensively on three tasks (as detailed in our [preprint](https://arxiv.org/abs/2308.12261)). But it would be much cooler if we could benchmark it on a bunch of tasks...

neubig

experiment

Ability to use multiple retrieved datasets

Currently we only use a single retrieved dataset, but we could use multiple retrieved datasets if multiple ones are relevant.

neubig

enhancement

When evaluating trained T5-base on the generated test set of `SQuAD`

Token indices sequence length is longer than the specified maximum sequence length for this model (973 > 512). Running this sequence through the model will result in indexing errors

zhaochenyang20

bug

Remaining Problems Regarding Training metric

1

@Eren Chenyang Zhao 赵晨阳, of course, you can use `on_epoch_end`, but I had a few concerns at that time. (1) What should I do if I want the `eval_strategy` to...

zhaochenyang20

enhancement

Our Prompt Parser cannot support very long prompts

1

#34 introduced a prompt parser which uses a long "meta-prompt" to parse a given prompt. Since the "meta-prompt" is long (1.8k tokens), this limits the prompts that can be provided...

viswavi

enhancement

Mock model loading in `test_run_locally`

1

In our integration test, we're currently loading a model in the Trainer. this should be mocked to avoid having to download a full transformers model each time this test is...

viswavi

prompt2model
prompt2model copied to clipboard

Metadata

Create hosted demo of a few trained models to put on web site

Consolidate directories for writing out files

Empirical comparison to gpt-llm-trainer

Check and, where possible, remove over-broad catches

Benchmarking of prompt2model on composite benchmarks

Ability to use multiple retrieved datasets

When evaluating trained T5-base on the generated test set of `SQuAD`

Remaining Problems Regarding Training metric

Our Prompt Parser cannot support very long prompts

Mock model loading in `test_run_locally`

← Metadata

Owner

Metadata

prompt2model prompt2model copied to clipboard

Metadata

← Metadata

Owner

Metadata

prompt2model
prompt2model copied to clipboard