llm-foundry issues

Set eval shuffle to False

I think eval dataloaders should not shuffle data for consistent evaluation results

Feature/peft compatible models

3

Edits needed to support a combo of composer with hf/peft. Pipeline is: 1. load a hf model e.g., mpt-7b 2. use hf/peft to add lora modules or adapter modules. 3....

danbider

I'm trying to convert c4 dataset from your `convert_hf` code [here](https://huggingface.co/datasets/allenai/c4) they say `en` subset is 305 Gb but if I'm give `c4` and `en` as arguments, it looks like...

germanjke

Please document how to fine tune mpt-7b with longer contexts

3

## 🚀 Feature Request I believe that, to infer using a longer context, I can set max_seq_len to something longer when starting the huggingface based inference driver. However, I don't...

jwatte

enhancement

mpt-7b generates no output when finetuned with 2.0.1 based llm-foundry, works with 1.13.1

3

## Environment I'm using the Docker images for llm-foundry, training on 8xA100. mosaicml/llm-foundry:1.13.1_cu117-latest mosaicml/llm-foundry:2.0.1_cu118-latest Collecting system information... --------------------------------- System Environment Report Created: 2023-06-24 17:52:44 UTC --------------------------------- PyTorch information ------------------- PyTorch...

jwatte

bug

max_seq_length doesn't override model configuration

Hello! In many examples including this one (https://github.com/mosaicml/llm-foundry/blob/90795f37c16c008aae954df55fc4f3323bc581e4/scripts/train/yamls/finetune/mpt-7b_dolly_sft.yaml#L1), the max_seq_length doesn't affect model configuration implicitly. That means the configuration of the model has to be overriden explicitly: ``` model: config_overrides:...

PNAKTEMPORAL

bug

Model gauntlet

Created model gauntlet. This PR makes a number of significant changes. It checks in 38 datasets, it adds a callback which can compute model gauntlet scores from a large number...

bmosaicml

MPT-30B Functions support

Hi, I'm working on fine-tuning the MPT-30B for function calling. Currently still preparing the fine-tuning dataset. AFAIK there is no open-source fine-tuned model for function support(let me know if you...

musabgultekin

question

Fine tuning on A10 GPUs

## ❓ Question Fine tuning of MPT-7B failing on A10 GPUs... Can you help with a script for the same

singhalshikha518

question

Random inference results after conversion to FasterTransformers

3

I am seeing random results after converting model to FT. I used the conversion and inference scripts included in this repo by @dskhudia. To reproduce the issue, 1. Downloaded mpt-7b-instruct...

savemuri

question

llm-foundry
llm-foundry copied to clipboard

Metadata

Set eval shuffle to False

Feature/peft compatible models

size of c4 dataset en subset?

Please document how to fine tune mpt-7b with longer contexts

mpt-7b generates no output when finetuned with 2.0.1 based llm-foundry, works with 1.13.1

max_seq_length doesn't override model configuration

Model gauntlet

MPT-30B Functions support

Fine tuning on A10 GPUs

Random inference results after conversion to FasterTransformers

← Metadata

Owner

Metadata

llm-foundry llm-foundry copied to clipboard

Metadata

← Metadata

Owner

Metadata

llm-foundry
llm-foundry copied to clipboard