Phi-3CookBook Reproducibility issue for finetuning Phi3 Vision on DocVQA dataset

Reproducibility issue for finetuning Phi3 Vision on DocVQA dataset

Open qwedaq opened this issue 6 months ago • 6 comments

This issue is for a: (mark with an `x`)

- [x] bug report -> please search issues before submitting
- [ ] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

I used the following command to finetune Phi 3 vision using LoRA CUDA_VISIBLE_DEVICES=6 python3 finetune_hf_trainer_docvqa.py --full_train --use_lora --bf16 --lora_rank=32 --lora_alpha_ratio=16 --batch_size=64 --learning_rate=2e-4 --num_train_epochs=2 --freeze_vision_model

Any log messages given by the failure

Screenshot (252)

Expected/desired behavior

The reported ANLS after finetuning in the readme is 82.46, the ANLS I got is 75.68. Infact the ANLS score before finetuning is 77.02.

OS and Version?

Linux with CUDA 12.2

Aug 01 '24 17:08 qwedaq

Phi-3CookBook Phi-3CookBook copied to clipboard

Reproducibility issue for finetuning Phi3 Vision on DocVQA dataset

This issue is for a: (mark with an x)

Minimal steps to reproduce

Any log messages given by the failure

Expected/desired behavior

OS and Version?

Phi-3CookBook
Phi-3CookBook copied to clipboard

This issue is for a: (mark with an `x`)