llm-foundry issues

Finetune MPT models with local dataset

13

Hello Team, Can you please guide on how to finetune on local datasets, the instructions given in scripts/train are not so clear. Below yaml file was given as sample example:...

arpitkk

How to fine-tune instruct mpt-7b model?

5

I see in the `train.py` under `scripts/train`, it gets a model when given a model configuration. I took a look at this yaml `7b_dolly_sft.yaml` and do you think I could...

dydx-git

Finetuning with llm-foundry (composer format)

Hi, Is there any option to convert HF ckpt to composer and FT with the llm-foundry scripts? Thanks!

xgal

How to install MPT-7B?

2

Hey, I wanted to use the HF transformer library for the currently fastest inference possible (triton). I am trying to use https://github.com/mosaicml/llm-foundry/blob/main/scripts/inference/hf_generate.py ```sh python test.py --temperature 1.0 \ (transformers-direct) --name_or_path...

SinanAkkoyun

MPT-7b models take very long to load (before actual model is loaded)

Hello, the model takes very long to load for some reason. The actual shard loading is very fast but the delay before is several minutes on a 5950x and 3090...

SinanAkkoyun

fatal error: cuda.h: No such file or directory

1

When running the training section of the `readme` i get an error regarding `cuda.h`. Is it possible to specify a path for the `composer` to look for the `cuda` support?...

Babramson

Fix licensing typo & context window typo

• Storywriter is not commercially viable • 65k not 64k context window for storywriter

brianjking

MPT-7B Finetuning Jupyter notebook request

3

@vchiley @samhavens @alextrott16 , i was going through the MPT-7B model fine tuning documentation. It is def well written but quite hard to grasp in the first look. Therefore, I...

GeorvityLabs

Set pad_token_id to tokenizer.pad_token_id if not set on command line

The hf_chat.py program emits this warning message before each chat response: The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior....

patrickhwood

does it work on local machine or someone with limited resources

5

does it work on my local machine or is necessary in need GPU for it to run this model? I tried to load a model on my local machine with...

rabsher

llm-foundry
llm-foundry copied to clipboard

Metadata

Finetune MPT models with local dataset

How to fine-tune instruct mpt-7b model?

Finetuning with llm-foundry (composer format)

How to install MPT-7B?

MPT-7b models take very long to load (before actual model is loaded)

fatal error: cuda.h: No such file or directory

Fix licensing typo & context window typo

MPT-7B Finetuning Jupyter notebook request

Set pad_token_id to tokenizer.pad_token_id if not set on command line

does it work on local machine or someone with limited resources

← Metadata

Owner

Metadata

llm-foundry llm-foundry copied to clipboard

Metadata

← Metadata

Owner

Metadata

llm-foundry
llm-foundry copied to clipboard