metaseq issues

Results 170 metaseq issues

Sort by recently updated

[BFloat16] Support bf16 inference

**Patch Description** Describe your changes **Testing steps** Describe how you tested your changes

cla signed

feat: add unified reshard script

## Issue Training requires flattened models, any MP and FSDP Inference requires unflattened models with FSDP 1 We wanted AML jobs which train model (which produces Flattened checkpoint output), reshards...

mattmazzola

cla signed

feat: add .devcontainer to standardize development environment setup

⚠️ This PR likely won't work directly but I wanted to share code from our fork that may be modified to integrate ⚠️ ## Issue (This may not be 100%...

mattmazzola

cla signed

feat: add demonstration of Sphinx documentation system

⚠️ This PR is not intended to be merged directly, but to demonstrate documentation from our fork ⚠️ ## Issue Current documentation in Metaseq repo is very minimal. - Given...

mattmazzola

cla signed

fix: ensure last checkpoint is always saved, refactor training stop conditions to be computed in single location

# Issues ## 1 Inconsistent checkpoint filenames saved by trainer In our pipeline we often have sequence of steps such as (train, reshard/unflatten, evaluate). The output files of the training...

mattmazzola

cla signed

Grammatical Error Correction (GEC) prompt for OPT-IML

## ❓ Questions and Help ### Before asking: - [x] search the issues. - [x] search the docs. #### What is your question? The OPT-IML paper evaluates the models on...

yulonglin

question

Add type

**Patch Description** Describe your changes **Testing steps** Describe how you tested your changes

zycalice

cla signed

Generation should stop after two new lines if that is the stop criteria

This addresses Issue 642. When the stop token is \n\n the generation should stop after generation two new lines. Check the previous token that is generated and if it is...

Vidyaranya

cla signed

Sub-workers exits without messages

## 🐛 Bug I use the script as follow: CUDA_VISIBLE_DEVICES="0, 1, 2, 3" metaseq-train --task streaming_language_modeling \ data/pile-test/ \ --num-workers 4 \ --reset-dataloader \ --vocab-filename ./vocab/gpt2-vocab.json \ --merges-filename ./vocab/gpt2-merges.txt \...

GongZhengLi

bug

How to finetune from a consolidated model ?

There are the ways to reshard the trained model to inference model, but how to retrain the model from the consolidated model ? (like llama)

GongZhengLi

question

metaseq
metaseq copied to clipboard

Metadata

[BFloat16] Support bf16 inference

feat: add unified reshard script

feat: add .devcontainer to standardize development environment setup

feat: add demonstration of Sphinx documentation system

fix: ensure last checkpoint is always saved, refactor training stop conditions to be computed in single location

Grammatical Error Correction (GEC) prompt for OPT-IML

Add type

Generation should stop after two new lines if that is the stop criteria

Sub-workers exits without messages

How to finetune from a consolidated model ?

← Metadata

Owner

Metadata

metaseq metaseq copied to clipboard

Metadata

← Metadata

Owner

Metadata

metaseq
metaseq copied to clipboard