llm-foundry issues

AssertionError: Different ranks have different values for step.

checkpoints continuous training, when re-saving, this error occurs ![image](https://github.com/mosaicml/llm-foundry/assets/133207026/145100da-f40f-4de1-9ffb-c7b8b3ba67ff)

sysusicily

inference - is it optimized for api usage?

1

## ❓ Question Does the inference command have a public api endpoint or batching of requests? ## Additional context I was wondering if this could be deployed on a production...

mantrakp04

question

FasterTransformer support for storywriter

2

## 🚀 Feature Request The script located under scripts/inference for converting HF checkpoint to FT format doesn't work for MPT-7B-Storywriter because it has clip_qkv = 6 unlike other MPT-7B models...

lorabit110

enhancement

Inference Speed Benchmark

## ❓ Question Hi, I am looking for the matric to compare the inference speed of the 7B, 13B and 70B models. More precisely I am looking for something like...

RaiAmanRai

question

HumanEval Benchmark

2

## 🚀 Feature Request I found there is only ICL Benchmark in eval folder, but the HumanEval Benchmark is reported in MPT-30B. I want to reproduce the HumanEval results with...

L1aoXingyu

enhancement

Benchmark details with hardware configuration.

1

## ❓ Question Would you please add the hardware configuration details like Network bandwidth etc. in your benchmark page [here](https://github.com/mosaicml/llm-foundry/tree/main/scripts/train/benchmarking). ## Additional context

goswamig

question

Question: Is the HF inference speed the same as MosaicML API inference?

## ❓ Question Is the HF inference speed the same as MosaicML API inference? (regarding starter and enterprise inference solutions)

SinanAkkoyun

question

Why there are chat and instruct models for 13b parameters?

1

It is quite interesting that there are chat and instruct models on the same structure! Why can’t we train one model that can do both? Is that because 13B is...

Sidhbcu

question

Evaluation of self-trained mpt-7b-instruct for mosaic checkpoint broken

3

## Environment ``` PyTorch version: 2.0.1+cu118 Is debug build: False CUDA used to build PyTorch: 11.8 ROCM used to build PyTorch: N/A OS: Debian GNU/Linux 11 (bullseye) (x86_64) GCC version:...

olaf-beh

bug

unable to save ckpt for mpt-30b

## Environment --------------------------------- System Environment Report Created: 2023-06-24 01:07:08 UTC --------------------------------- PyTorch information ------------------- PyTorch version: 2.0.1 Is debug build: False CUDA used to build PyTorch: 11.8 ROCM used to...

bpucla

bug

llm-foundry
llm-foundry copied to clipboard

Metadata

AssertionError: Different ranks have different values for step.

inference - is it optimized for api usage?

FasterTransformer support for storywriter

Inference Speed Benchmark

HumanEval Benchmark

Benchmark details with hardware configuration.

Question: Is the HF inference speed the same as MosaicML API inference?

Why there are chat and instruct models for 13b parameters?

Evaluation of self-trained mpt-7b-instruct for mosaic checkpoint broken

unable to save ckpt for mpt-30b

← Metadata

Owner

Metadata

llm-foundry llm-foundry copied to clipboard

Metadata

← Metadata

Owner

Metadata

llm-foundry
llm-foundry copied to clipboard