h2o-llmstudio issues

[CODE IMPROVEMENT] Utility code for pushing to HF

3

We currently only directly support the push to HF for the GUI via: https://github.com/h2oai/h2o-llmstudio/blob/main/app_utils/sections/experiment.py#L1596 When using the CLI, there is no dedicated util function. We should prepare a notebook and/or...

psinger

area/core

[BUG] Docker Image CUDA ERROR

7

### 🐛 Bug I am getting the warning below and the nightly Docker image doesn't see my GPU. I have RTX 3090 with Driver Version: 470.182.03 CUDA Version: 11.4 on...

aerdem4

type/bug

Using both LORA and FSDP results in error

2

### 🐛 Bug Setting both LORA and FSDP options to true while fine tuning results in > ValueError: FlatParameter requires uniform dtype but got torch.float16 and torch.float32 ### To Reproduce...

shridharsamantaroy

type/bug

[BUG] Exception due to noescapechar set when BLEU evaluation is being stored to csv

2

### 🐛 Bug My experiment fails pretty early, with the following stacktrace upon BLEU evaluation: ``` 2023-05-18 17:58:52,287 - INFO: Validation BLEU: 0.32177 2023-05-18 17:58:52,333 - ERROR: Exception occurred during...

DavidFarago

type/bug

[Feature] Allow fine tuning T5 models?

1

### 🚀 Feature I'd like to experiment with fine tuning small T5 based models, but it looks like there are some assumptions made in the code about the type of...

Taytay

type/feature

[BUG] GPU ids are not checked when using cfg.yaml

2

### 🐛 Bug Starting a new experiment from `cfg.yaml` causes an error if number of gpu specified in `cfg.yaml` exceeds the number of gpus on the target machine. Within the...

maxjeblick

type/bug

[CODE IMPROVEMENT] Redirect stdout logging from huggingface download to the logging module

2

### 🔧 Proposed code refactoring Redirect stdout logging from huggingface download to the logging module Something on these line could solve it (untested): ```python import logging from transformers import logging...

pascal-pfeiffer

area/core

[FEATURE] Replace Summary tab with a model card

### 🚀 Feature Replace Summary tab with a model card can be the same model card as being published ### Motivation More useful than the rarely used experiment summary

pascal-pfeiffer

type/feature

[CODE IMPROVEMENT] Rework plotting

The current plotting functionality could need some rework, some suggestions: - For training plotting, I would just plot the whole sample, and (color)mark the labels. - For validation insights we...

psinger

area/core

Dtype & HF Push Changes

This PR: - Removes the casting to float32 in case of LORA + float16 - Adds a new mode for pushing to HF which first loads the model on CPU...

psinger

h2o-llmstudio
h2o-llmstudio copied to clipboard

Metadata

[CODE IMPROVEMENT] Utility code for pushing to HF

[BUG] Docker Image CUDA ERROR

Using both LORA and FSDP results in error

[BUG] Exception due to noescapechar set when BLEU evaluation is being stored to csv

[Feature] Allow fine tuning T5 models?

[BUG] GPU ids are not checked when using cfg.yaml

[CODE IMPROVEMENT] Redirect stdout logging from huggingface download to the logging module

[FEATURE] Replace Summary tab with a model card

[CODE IMPROVEMENT] Rework plotting

Dtype & HF Push Changes

← Metadata

Owner

Metadata

h2o-llmstudio h2o-llmstudio copied to clipboard

Metadata

← Metadata

Owner

Metadata

h2o-llmstudio
h2o-llmstudio copied to clipboard