llm-foundry issues

adding option for softcap in attention and lm_head logits.

Adding option for softcap in attention and lm_head logits, to allow Gemma-like models. The config names are same as the huggingface names here: https://github.com/huggingface/transformers/blob/96a074fa7e2c04b904f72d9e827398d4c5f90f25/src/transformers/models/gemma2/modeling_gemma2.py#L371

ShashankMosaicML

Upgrade to v0.10.0

This PR upgrades the Habana support to llm-foundry v0.10.0

hlahkar

External library usage interface

A similar feature to the [library usage of lm-eval-harness](https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/interface.md) using a simple registry add-on.

moeiniamir

Update datasets requirement from <2.20,>=2.19 to >=2.20.0,<2.21

1

Updates the requirements on [datasets](https://github.com/huggingface/datasets) to permit the latest version. Release notes Sourced from datasets's releases. 2.20.0 Important Remove default trust_remote_code=True by @lhoestq in huggingface/datasets#6954 datasets with a python loading...

dependabot[bot]

dependencies

Add EnvironmentLogger Callback

2

This is a new callback to simplify logging environment metadata for reproducibility purposes: - Git Commits for packages under workspace_dir, useful for mcli integrations - Package versions for core dependencies,...

josejg

Any example script to run multi-node training for slurm?

7

Hi, I was trying to run multi-node training on slurm nodes but I have no idea how to configure `composer` arguments and commands. Is there any example script to run...

wavy-jung

enhancement

Update StreamingTextDataset to support truncation with multiple truncated items out.

1

## 🚀 Feature Request The current StreamingTextDataset truncate the text/tokens to the max_seq_len directly and throw out all left text/tokens. It is possible to support the truncate the text/tokens to...

LingxiaoShawn

enhancement

Wrap `FileNotFound` exceptions in the finetuning dataloader and `convert_text_to_mds`

2

Creates two exceptions: - `DatasetMissingFileError` --> Tells the user that a dataset file could not be found during a failure in the finetuning dataloader where `split` / `path` were not...

angel-ruiz7

LLaMA PRO training resume problem

6

Hello, I'm currently training LLaMA PRO. Initially, I expanded the model from 32 layers to 40 layers and proceeded to train only the newly added 8 layers (every fifth layer)....

germanjke

question

Fill in the middle

1

Hi! Do you support fill in the middle technique in pretrain pipelines? If yes, do you have some documentation about this? Thanks!

germanjke

enhancement

llm-foundry
llm-foundry copied to clipboard

Metadata

adding option for softcap in attention and lm_head logits.

Upgrade to v0.10.0

External library usage interface

Update datasets requirement from <2.20,>=2.19 to >=2.20.0,<2.21

Add EnvironmentLogger Callback

Any example script to run multi-node training for slurm?

Update StreamingTextDataset to support truncation with multiple truncated items out.

Wrap `FileNotFound` exceptions in the finetuning dataloader and `convert_text_to_mds`

LLaMA PRO training resume problem

Fill in the middle

← Metadata

Owner

Metadata

llm-foundry llm-foundry copied to clipboard

Metadata

← Metadata

Owner

Metadata

llm-foundry
llm-foundry copied to clipboard