transformers icon indicating copy to clipboard operation
transformers copied to clipboard

GLM-4.1V Model support

Open zRzRzRzRzRzRzR opened this issue 5 months ago • 0 comments

  1. This PR aims to support the use of the GLM-4-0414 model for training video understanding and image understanding models GLM-4.1V
  2. This PR has completed the refactoring of the related modules. Due to the overlap of F definitions (torch and torchvision), image_processors and videos_processors have not been placed under modular management @zucchini-nlp review sugguest.
  3. This PR is for code review. @ArthurZucker

zRzRzRzRzRzRzR avatar May 28 '25 09:05 zRzRzRzRzRzRzR