h2o-llmstudio issues

[BUG] Exporting / downloading model larger, than VRAM available (trained with DeepSpeed) fails

8

I trained a 33b model with DeepSpeed on 40GB cards. Based on the traceback, the model seems to be too large to fit into one GPU. Is it possible to...

AlexanderZhk

type/bug

[FEATURE] Option to add BOS token

4

### 🚀 Feature Similar to EOS token, we should offer an option to add BOS token to the beginning. Might be useful for models like Gemma.

psinger

type/feature

type/good first issue

Apply chat template for HF

close #551

haqishen

[FEATURE] Support for minimum learning rate

1

### 🚀 Feature Support for specification of a minimum learning rate ### Motivation Often in the research literature minimum learning rates are set when fine-tuning a model using a cosine...

tmostak

type/feature

[BUG] Scheduler should consider gradient accumulation while assigning `epoch_steps`?

### 🐛 Bug Here: https://github.com/h2oai/h2o-llmstudio/blob/a9d72fffef370c7fd0dd9f29ece06ab45b6c5815/train.py#L562 Let's say total data batches are 160 and gradient accumulation is 10. Optimization step is happening only 10 times. But here scheduler is called every...

rohitgr7

type/bug

[FEATURE] Use Free Form Chat Template to Define Input Styling

### 🚀 Feature Instead of using 6 or more different settings to control how the input can be transformed for the LLM, we can allow the use of chat templates....

pascal-pfeiffer

type/feature

[FEATURE] Add DoRA: Weight-Decomposed Low-Rank Adaptation

1

### 🚀 Feature Add support for Weight-Decomposed Low-Rank Adaptation (DoRA). "DoRA decomposes the pre-trained weight into two components, magnitude and direction, for fine-tuning, specifically employing LoRA for directional updates to...

pascal-pfeiffer

type/feature

[BUG] Tokenizer config has add_bos_token=true while LLM Studio is training with add_special_tokens=False

1

### 🐛 Bug The generated `tokenizer_config.json` has `add_bos_token=true` while H2O LLM Studio is training with `add_special_tokens=False`. Using the default AutoTokenizer, this leads to different behaviors. We should be explicit/correct about...

pascal-pfeiffer

type/bug

[FEATURE] Training with QLoRA + FSDP

1

### 🚀 Feature New advancements bringing quantized LoRA and FSDP together. https://github.com/AnswerDotAI/fsdp_qlora ### Motivation Train larger models on consumer GPUs or older generation Data Center GPUs such as V100 Lets...

pascal-pfeiffer

type/feature

Length Consistency in LLM Outputs with Token Length based Penalty Loss Functions

1

Reopened the same PR #556 with correct local branch as suggested by @psinger. Adding support for custom loss functions aimed at improving the length consistency in responses generated by fientuned...

Nischaydnk

h2o-llmstudio
h2o-llmstudio copied to clipboard

Metadata

[BUG] Exporting / downloading model larger, than VRAM available (trained with DeepSpeed) fails

[FEATURE] Option to add BOS token

Apply chat template for HF

[FEATURE] Support for minimum learning rate

[BUG] Scheduler should consider gradient accumulation while assigning `epoch_steps`?

[FEATURE] Use Free Form Chat Template to Define Input Styling

[FEATURE] Add DoRA: Weight-Decomposed Low-Rank Adaptation

[BUG] Tokenizer config has add_bos_token=true while LLM Studio is training with add_special_tokens=False

[FEATURE] Training with QLoRA + FSDP

Length Consistency in LLM Outputs with Token Length based Penalty Loss Functions

← Metadata

Owner

Metadata

h2o-llmstudio h2o-llmstudio copied to clipboard

Metadata

← Metadata

Owner

Metadata

h2o-llmstudio
h2o-llmstudio copied to clipboard