openpi icon indicating copy to clipboard operation
openpi copied to clipboard

Does LoRA fine-tuning freeze the Vision Encoder?

Open zcyqyq opened this issue 1 month ago • 1 comments

Hi, thanks for your great work, which is really impressive! I noticed that when using LoRA fine-tuning configurations (e.g., pi0_fast_libero_low_mem_finetune), the Vision Encoder (SigLIP, PaliGemma/img/*) appears to be trainable rather than frozen. Is this true or did I misunderstand something? Looking at get_freeze_filter(), it only matches .llm. but not .img., which suggests the vision encoder is being trained. Is this intentional?

zcyqyq avatar Oct 24 '25 07:10 zcyqyq