h2o-llmstudio issues

[FEATURE] Add model card and sample code for inference to the Download model feature

### 🚀 Feature Add model card and sample code for inference to the Download model feature ### Motivation UX

type/feature

[FEATURE] Select device that should be used to merge weights (for Download and for push to Hugging Face)

### 🚀 Feature Select device that should be used to merge weights (for Download and for push to Hugging Face) ### Motivation CPU might be slow for larger models

pascal-pfeiffer

type/feature

[CODE IMPROVEMENT] Exponential backoff for API rate limits

https://platform.openai.com/docs/guides/rate-limits/error-mitigation

psinger

area/core

[CODE IMPROVEMENT] Improve mask prompt labels for chained parent data

The setting `Mask Prompt Labels` allows to fully mask the prompt labels and only calculate the loss on the output. When chaining conversation data while training, we will have a...

psinger

area/core

[FEATURE] GPT_EVAL_MAX editable via UI

1

### 🚀 Feature Soweit ich sehen konnte, gibt es derzeit kein Bedienfeld, mit dem ich die angesprochenen Environment Variable editieren kann, was dazu führt, dass das Statement WARNING: `More than...

JulianGerhard21

type/feature

[FEATURE] Add more inference parameters

### 🚀 Feature Add more inference parameters: - Top P - Top K

psinger

type/feature

[BUG] Issues with Cloudflare proxy - Support for RunPod

2

### 🐛 Bug Hi there! I'm working on getting native support h2o-llmstudio on RunPod containers. I was able to get it fully made into docker container that includes training downloads...

kodxana

type/bug

Add falcon peft target modules

4

This PR adds default lora target layers for falcon models. I have excluded MPT models, as they seem to require additional code changes.

maxjeblick

Package update

title

psinger

[FEATURE] Train a reward model

### 🚀 Feature With https://github.com/h2oai/h2o-llmstudio/issues/78 being merged, we may want to consider training own reward models within LLM Studio. This should probably be a new task type and requires a...

pascal-pfeiffer

type/feature

h2o-llmstudio
h2o-llmstudio copied to clipboard

Metadata

[FEATURE] Add model card and sample code for inference to the Download model feature

[FEATURE] Select device that should be used to merge weights (for Download and for push to Hugging Face)

[CODE IMPROVEMENT] Exponential backoff for API rate limits

[CODE IMPROVEMENT] Improve mask prompt labels for chained parent data

[FEATURE] GPT_EVAL_MAX editable via UI

[FEATURE] Add more inference parameters

[BUG] Issues with Cloudflare proxy - Support for RunPod

Add falcon peft target modules

Package update

[FEATURE] Train a reward model

← Metadata

Owner

Metadata

h2o-llmstudio h2o-llmstudio copied to clipboard

Metadata

← Metadata

Owner

Metadata

h2o-llmstudio
h2o-llmstudio copied to clipboard