metaseq issues

Implement `finish_reason` in API response

## 🚀 Feature Request Implement `finish_reason` as in the OpenAI API specification. Currently it's default to `"length"`. ### Motivation It is useful for saving generation times and generate only until...

frankxu2004

enhancement

generation_args["stop"] doesn't work for stop sequence "\n\n"

## 🐛 Bug The `stop` parameter provided to the API is not working as expected when `stop = "\n\n"` ### To Reproduce Steps to reproduce the behavior (**always include the...

frankxu2004

bug

How to load an OPT-IML checkpoint and use it for inference?

7

Thanks for your work on OPT-IML. But I am confused about how to load the OPT-IML checkpoints for inference. #### Code Following the instruction in Inference API, I run the...

linmou

question

A scholar who loves AI asked

I didn't understand the code Can you give a simple example of how to use it based on the existing model How the imported text and model are used as...

597038837

question

Bring CM3 in!

See https://arxiv.org/abs/2201.07520

suchenzang

enhancement

Remove Megatron dependency - move entirely to Fairscale

1

This is to look into whether or not we can remove our Megatron dependency and rely entirely on our Fairscale dependency (model parallelism implementation seems to be identical between the...

suchenzang

good first issue

better-eng

Code for OPT-IML Bench?

I'm very excited that you built OPT-IML bench with so many tasks/datasets. It seems like a great tool to compare LLMs. Have you released the code? I may just not...

alexkreidler

bug

Integrate LucidRain's RotaryEmbeddings

2

See https://github.com/lucidrains/rotary-embedding-torch/blob/main/rotary_embedding_torch/rotary_embedding_torch.py And from PaLM paper: > We use RoPE embeddings (Su et al., 2021) rather than absolute or relative position embeddings, since RoPE embeddings have been shown to have...

suchenzang

enhancement

good first issue

Add test_sequence_parallel

**Patch Description** Adding tests for sequence_parallel flag to check rough equivalence between going through the sequence-parallel code-path (say, with MP 2) vs the current non sequence-parallel run. **Testing steps** black...

bashnick

cla signed

Pulling Igors barrierless init change in

1

suchenzang

cla signed

metaseq
metaseq copied to clipboard

Metadata

Implement `finish_reason` in API response

generation_args["stop"] doesn't work for stop sequence "\n\n"

How to load an OPT-IML checkpoint and use it for inference?

A scholar who loves AI asked

Bring CM3 in!

Remove Megatron dependency - move entirely to Fairscale

Code for OPT-IML Bench?

Integrate LucidRain's RotaryEmbeddings

Add test_sequence_parallel

Pulling Igors barrierless init change in

← Metadata

Owner

Metadata

metaseq metaseq copied to clipboard

Metadata

← Metadata

Owner

Metadata

metaseq
metaseq copied to clipboard