transformers
transformers copied to clipboard
GLM-4.1V Model support
- This PR aims to support the use of the GLM-4-0414 model for training video understanding and image understanding models GLM-4.1V
- This PR has completed the refactoring of the related modules. Due to the overlap of F definitions (torch and torchvision), image_processors and videos_processors have not been placed under modular management @zucchini-nlp review sugguest.
- This PR is for code review. @ArthurZucker