BK Lee issues

Results 8 issues of


                                            BK Lee

DEFAULT_CROP_RATIO wrong?

You remarked below settings of image resolution as **500px**. ```bash # Required environmental variables for the script: export IMAGENET_DIR=/path/to/pytorch/format/imagenet/directory/ export WRITE_DIR=/your/path/here/ # Starting in the root of the Git repo:...

[Question] layer_norm float32?

### Question Is there any reason for 'norm' to set float32? ![image](https://github.com/haotian-liu/LLaVA/assets/50401429/286dd799-6412-4e37-989c-ad7f93527e23) what if it is set to bfloat16? Do you think the difference between float32 and bfloat16 in 'norm'...

Critical Error I think this package is too bad and errorneous as much

Too much error happens

why are "vlp_train" and "vlp_val" same?

why are "vlp_train" and "vlp_val" same? https://github.com/microsoft/X-Decoder/blob/165f8a6314ac84f5c36aaab7216f90dd97e38a43/datasets/registration/register_vlp_datasets.py#L27 https://github.com/microsoft/X-Decoder/blob/165f8a6314ac84f5c36aaab7216f90dd97e38a43/datasets/registration/register_vlp_datasets.py#L22

How to evaluate VQA and Interactive in X-decoder

I did not find where the evaluation code exist for VQA and Interactive in X-decoder

Accelerate + DeepSpeed

### System Info ```Shell all is the latest ``` ### Information - [ ] The official example scripts - [X] My own modified scripts ### Tasks - [ ] One...

Inquiry to adding new paper

Hi I uploaded new paper: TroL, really efficient vision language models, followed by CoLLaVO, MoAI, and Meteor (there were already listed) paper link: https://arxiv.org/abs/2406.12246 github link: https://github.com/ByungKwanLee/TroL demo link: https://huggingface.co/spaces/BK-Lee/TroL...

support QMoRA?

is it possible to support QMoRA with huggingface bitsandbytes?