CogVideo
CogVideo copied to clipboard
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
I find the codes in arguments.py file below " if args.model_parallel_size
- Changed "it's" to "its" in the example prompt about the SUV on a dirt road. This fixes a grammatical error where "it's" (contraction for "it is") was incorrectly used...
from sat.model.base_model import get_model from sat.training.model_io import load_checkpoint from sat import mpu but not find corresponding files
When I want to use a for loop like this to run multiple results: ```python pipe = CogVideoXPipeline.from_pretrained(...) for prompt in prompts: pipe.enable_sequential_cpu_offload() prompt_embeds, _ = pipe.encode_prompt(prompt=prompt,...) video = pipe(...).frames[0]...
I noticed that position embedding is 3d_sincos_pos_embed, but the position embedding mentioned in the paper is rope. Will the code using rope be released?
通过 DistributedDataParallel 分布式推理报错: 'CogVideoXPipeline' object has no attribute 'named_parameters'
通过 DistributedDataParallel 分布式推理报错: 
testing the model on my machine ,the quality of the generated video is much worse than demo show case.