xtuner icon indicating copy to clipboard operation
xtuner copied to clipboard

Enhance LLaVA dataset processing with optional text preprocessing

Open wanghao9610 opened this issue 8 months ago • 1 comments
trafficstars

Dear XTuner Contributors,

Thank you for providing the open-source code for MLLM. I noticed that the process_hf_dataset function in the LLaVADataset takes several minutes to preprocess text data each time the program starts. In contrast, some other repositories (e.g., Original LLaVA, LLaVA-Next) handle text data preprocessing during training.

To address this, I have proposed an enhancement to the LLaVA dataset processing by introducing optional text preprocessing. This modification eliminates the need to preprocess text data at runtime, improving efficiency.

I kindly request you review my code and consider merging my PR.

Best regards,

wanghao9610

wanghao9610 avatar Mar 09 '25 02:03 wanghao9610