llm-foundry issues

Composer crashes when attempting to load sharded checkpoint

3

When attempting load a sharded checkpoint, we (@prigoyal and I) hit the following error: ``` 595 │ /usr/lib/python3/dist-packages/composer/utils/checkpoint.py:287 in │ 596 │ load_checkpoint │ 597 │ │ 598 │ 284...

growlix

bug

Handle large files during text to mds conversion

Previously, large files are read entirely at once via `file.read()`. This reads the file and tokenizes in chunks.

irenedea

Tessa/callibration script

2

Here is code we use to test our benchmark tasks by using a series of progressively more advanced models to see if the benchmarks effectively differentiate between them, and at...

tbarton16

How to support multi-threaded parallel data preprocessing?

11

I want to pretrain an LLM with 2T tokens using llm-foundry. But before training, the data processing time is too long. Is there any way to accelerate it?

YixinSong-e

enhancement

Add big bench hard

Adding Big Bench Hard subset as a set of combined CoT tasks, formatted according to the specification in [this repo](https://github.com/suzgunmirac/BIG-Bench-Hard/tree/main). These tasks are quite large and quite slow. I don't...

bmosaicml

mosaicml-turbo: Where to find the repo?

8

I'm trying to implement DecoupledLionW_8bit in my fine-tuning script, but I get the following error: > ERROR: Could not find a version that satisfies the requirement mosaicml-turbo=0.0.2; extra == "gpu"...

agarvic

question

Enable streaming of local finetuning dataset

1

Current path for streaming of finetuning datasets does not allow for streaming from local path (which works for text datasets out of the box and is also supported by `StreamingFinetuningDataset`...

eldarkurtic

WIP: Preventing the loss from being computed when the input token is EOS Token

5

The model should not be trained to predict the word after the eos_token, because it comes from a different sequence. This PR implements this logic. TODO: Experimental verification.

ShashankMosaicML

[wip] F1 score

1

Implement F1 score for reference-based grading of QA tasks. This PR is dependent on Max's [refactor](https://github.com/mosaicml/composer/pull/2713) added quac, natural questions, and narrative qa Tested mpt-7b-instruct: ``` | Category | Benchmark...

bmosaicml

Delta input cpt support

Enable delta table as input for CPT For CPT, you need to provide some tokenizer arguments so the resulted MDS dataset can be written python scripts/data_prep/convert_delta_to_json.py --delta_table_name main.streaming.random_cpt_table --processes 128...

XiaohanZhangCMU

llm-foundry
llm-foundry copied to clipboard

Metadata

Composer crashes when attempting to load sharded checkpoint

Handle large files during text to mds conversion

Tessa/callibration script

How to support multi-threaded parallel data preprocessing?

Add big bench hard

mosaicml-turbo: Where to find the repo?

Enable streaming of local finetuning dataset

WIP: Preventing the loss from being computed when the input token is EOS Token

[wip] F1 score

Delta input cpt support

← Metadata

Owner

Metadata

llm-foundry llm-foundry copied to clipboard

Metadata

← Metadata

Owner

Metadata

llm-foundry
llm-foundry copied to clipboard