Wing Lian comments

Results 103 comments of


                                            Wing Lian

add support for model revisions

are you thinking like `model_name@revision+lora_path`?

mistral fsdpa qlora crashes (cu_seqlens)

Can you verify that flash attention is installed?

pydantic warning Field "model_type" has conflict with protected namespace "model_".

This should be fixed in the latest main by #1345

pydantic warning Field "model_type" has conflict with protected namespace "model_".

@Jay-Nehra What dependency conflicts? Are you using deepspeed? if so what version? see https://github.com/OpenAccess-AI-Collective/axolotl/issues/1320#issuecomment-1962329372

pydantic warning Field "model_type" has conflict with protected namespace "model_".

@Jay-Nehra there is already a PR upstream to fix this https://github.com/huggingface/transformers/pull/29212

FSDP Full-finetuned Model params and weights are NAN

if you're using fp16, you'll likely have to change your learning rate way down. you're getting over/underflows of the fp16 values leading to 0 loss

ADD: warning hub model

@JohanWork I'm working on a slight refactor of doing validation with Pydantic, so let's fix and merge this after that gets merged. thanks!

ADD: warning hub model

@JohanWork the pydantic refactor has been merged. let me know if you have any questions about that.

Add Prodigy, SophiaG optimizers

I took a first pass at fixing the pylint errors, but mypy has still found a few issues ``` src/axolotl/custom_optim/sophia.py:233: error: "float" has no attribute "neg" [attr-defined] src/axolotl/custom_optim/lion.py:161: error: "None"...

Installation is difficult in different systems. Can you make it run in colab?

I would recommend going to torch 2.1.2