Hezhi (Helen) Xie
Hezhi (Helen) Xie
In the second paragraph, it states that "If you consider the 3-nearest neighbors of the test point in Figure 3.4, you will see that two of them are positive and...
**What this PR does / why we need it**: Give user functionality to tune HyperParameters of LLMs using simple Python SDK APIs **Which issue(s) this PR fixes** _(optional, in `fixes...
## Motivation The rapid advancements and growing popularity of Large Language Models (LLMs) have driven an increased need for effective LLMOps in Kubernetes environments. To address this, we developed a...
**What this PR does / why we need it**: Due to the update of `huggingface_hub`, `split_torch_state_dict_into_shards` is not supported in v0.19.3. Therefore, I updated the version in the `requirements.txt` for...
**What this PR does / why we need it**: According to [the discussion](https://github.com/kubeflow/katib/pull/2393#discussion_r1691781750), this PR updates the name of PVC in `train` API to have pvc_name == job_name. In this...
### What happened? When I ran the [example](https://www.kubeflow.org/docs/components/training/user-guides/fine-tuning/) of the fine-tuning API, the pod failed due to the following error in the "storage_initializer" container: ```[shell] Traceback (most recent call last):...
**What this PR does / why we need it**: Add data preprocessing for `train_args` and `lora_config` to ensure each parameter's type is consistent with the reference value. This will be...
This PR adds a detailed summary of my GSoC 2024 Project 4: Developing the LLM Hyperparameter Optimization API in Kubeflow's Katib. It highlights the motivation, goals, my contributions, and key...
**What this PR does / why we need it**: Add an e2e test in the `test_e2e_train_api.py` for the train API. **Which issue(s) this PR fixes** _(optional, in `Fixes #, #,...
**What this PR does / why we need it**: This PR adds an e2e test for the `tune` API, specifically for the scenario of importing external models and datasets for...