ludwig icon indicating copy to clipboard operation
ludwig copied to clipboard

Low-code framework for building custom LLMs, neural networks, and other AI models

Results 313 ludwig issues
Sort by recently updated
recently updated
newest added

See https://github.com/ludwig-ai/ludwig/pull/3347

tests

hi i'm getting error while try to fine-tune phi 3 ![Screenshot 2024-04-24 175119](https://github.com/ludwig-ai/ludwig/assets/115357166/ac778b50-4234-46da-9f4d-8d846fd26388) ![Screenshot 2024-04-24 174635](https://github.com/ludwig-ai/ludwig/assets/115357166/dec5aaef-743b-4d15-9f37-b71f52b167f0)

llm

When I run inference on a Llama3 model finetuned using Ludwig, I keep getting this error: ``` set_cols, feature, missing_value_strategy, computed_fill_value, backend) 1756 logger.warning( 1757 f"DROP_ROW missing value strategy applied....

bug
llm

Allows the user to input a regular expression in the yaml config which freezes specific layers of a pretrained model. Adds new CLI option "pretrained_summary" to let users access string...

HI All, I am having some trouble uploading a fine tuned model to the hugging face hub. When I do the following: ```model.upload_to_hf_hub( 'repo_name', 'model_path', repo_type='model', private=False, commit_message='Upload trained [Ludwig](https://ludwig.ai/latest/)...

Describe the bug I am trying to finetune tiiuae/falcon-7b-instruct and I am getting this error. `TypeError: where(): argument 'condition' (position 1) must be Tensor, not bool` **To Reproduce** Steps to...

bug
llm

**Describe the bug** When resuming a model train (retraining) with Ray, using a small dataset the following exception occurs - 2024-04-08 13:13:36,849 WARNING worker.py:1866 -- Traceback (most recent call last):...

bug
ray
dependency

**Is your feature request related to a problem? Please describe.** Pydantic v2 has been out for a few months now and it brings new cool features as well as big...

Is there anyway I can implement GAN and AutoEncoder with ludwig? could anyone provide some examples? I have no idea how I can access the outputs of middle layers nor...

feature