benchmarks issues

How to add a custom key to config file?

Hi, I'm currently working with the [resnet50 training recipe](https://github.com/mosaicml/examples/tree/main/examples/benchmarks/resnet_imagenet#using-mosaic-recipes). However, I'm aiming to adapt Mosaic to my custom MobileNetV2 model and need to incorporate a custom parameter into the model....

pvti

Fix betas bug

Fixes a bug where the betas could be improperly converted from a omega ListConfig

Skylion007

Error when training with Mosaic-Bert

I have forked the docker from the README and installed the dependencies from requirements.txt. One difference is I'm using singularity instead of docker. I face the following error only when...

naveenkumar2703

MosaicBERT: Convert composer weights to HF

1

Hi, we could sucessfully pretrain various MosaicBERT models and evaluations with composer-based fine-tuning look really good :) However, when using a/the conversion script `llm-foundry/scripts/inference/convert_composer_to_hf.py` the converted HF model seems to...

stefan-it

Modernize MosaicBERT

8

This PR modernizes the MosaicBERT codebase with Flash Attention 2, PyTorch 2 (`torch==2.1.1`), and an updated version of composer (`mosaicml>=0.17`). In particular, this updates MosaicBERT to be compatible with [Flash...

Skylion007

MosaicBERT: pretraining configuration for models > 128 seq. length

5

Hi MosaicML team, many thanks for releasing the code and models for your MosaicBERT! I highly appreciate the effort that you put in modernizing the BERT architecture. I am interested...

stefan-it

Change bf16 to amp_bf16

3

Hi, I tried replicating the pretraining bert script and when I ran it with the yaml script I got the following error: Value bf16 is not available in Precision. I...

Srini-98

Fixes #322 (Change bf16 to amp_bf16)

This fixes #322. When I was working through #440, I got bitten by this bug.

Taytay

config class for bert is not consistent

2

Hey I am trying to pull the model from huggingface repo using `AutoModelForMaskedLM.from_pretrained( 'mosaicml/mosaic-bert-base-seqlen-2048', trust_remote_code=True, revision='b7a0389')` (with revision param and without) I am getting the same error that goes like...

DanielWit

Please bring code features from MPT-7b back to MPT-1b for use of MPT-1b with SFTTrainer.

What I want to do: ``` model = MosaicGPT.from_pretrained( "mosaicml/mpt-1b-redpajama-200b", trust_remote_code=True, attn_impl='torch' ) trainer = SFTTrainer( model=model, tokenizer=tokenizer, train_dataset=tokenized_train_data["train"], eval_dataset=tokenized_val_data["validation"], dataset_text_field="text", args=training_args, neftune_noise_alpha=5 #the only one important thing for me...

OlegJakushkin

benchmarks
benchmarks copied to clipboard

Metadata

How to add a custom key to config file?

Fix betas bug

Error when training with Mosaic-Bert

MosaicBERT: Convert composer weights to HF

Modernize MosaicBERT

MosaicBERT: pretraining configuration for models > 128 seq. length

Change bf16 to amp_bf16

Fixes #322 (Change bf16 to amp_bf16)

config class for bert is not consistent

Please bring code features from MPT-7b back to MPT-1b for use of MPT-1b with SFTTrainer.

← Metadata

Owner

Metadata

benchmarks benchmarks copied to clipboard

Metadata

← Metadata

Owner

Metadata

benchmarks
benchmarks copied to clipboard