h2o-llmstudio
h2o-llmstudio copied to clipboard
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
### π Feature I trained an experiment on a specific GPU of a multi-gpu machine. When selecting the `chat` tab within the experiment the model attempts to load on the...
### π§ Proposed code refactoring Check if our default hyperparameters (e.g. [kl_target](https://github.com/h2oai/h2o-llmstudio/blob/main/llm_studio/python_configs/text_causal_language_modeling_config.py#L154)) are correct, see: https://github.com/lvwerra/trl/commit/b56e8b327733baa81c3ef0d6508f08e1b3e33939 and https://github.com/lvwerra/trl/issues/462 Also, RLHF training is quite unstable w.r.t. parameter choices, see e.g. issues...
### π Feature Add DeepSpeed to h2o-llmstudio ### Motivation Many reports say that DeepSpeed allows us to finetune LMMs on cheaper hardware - https://www.databricks.com/blog/2023/03/20/fine-tuning-large-language-models-hugging-face-and-deepspeed.html. Any reason why h2o-llmstudio doesn't have...
### π Bug Int8 finetuning throwing a type error I'm trying to finetune EleutherAI/pythia-2.8b-deduped model with oasst dataset on a machine with 8 V100 GPUs. Only the following parameters are...
This PR add the option to change the gpu id used for the chatbot in the Settings tab. Gpu ids are starting at 1, to be in sync with the...
closes #469 
### π Feature LLM studio supports multiple problem types (Causal Modeling/ Classification), however, during data import, it expects dataset to be named exactly as the problem type, otherwise proper problem...
### π§ Proposed code refactoring Replace `backbone.print_trainable_parameters()` by an explicit log call to propagate the information to the UI. In addition, we could think of adding it to the model...
### π Feature Provide a button to pick a random validation sample from the experiment and run it in the chat window.
ValueError: invalid literal for int() with base 10: βFailed to initialize NVML: Unknown Errorβ
### π Bug q.app q.user q.client report_error: True q.events q.args report_error: True stacktrace Traceback (most recent call last): File β/workspace/./llm_studio/app_utils/handlers.pyβ, line 78, in handle await home(q) File β/workspace/./llm_studio/app_utils/sections/home.pyβ, line 66,...