Zhuoheng Li comments

Results 28 comments of


                                            Zhuoheng Li

Provide more information to the user

> @StarCycle Yes we are working on a PR using `accelerate`: #317 Nice!

Provide more information to the user

Hi @Cadene, Are there other things that I need to complete to merge this PR? (ﾉ"◑ڡ◑)ﾉ

Sometimes all actions are pad in the dataset

The dataset is generated by the following `lmdb_format.py` ``` import gc import shutil from pathlib import Path import lmdb from pickle import loads from torchvision.io import decode_jpeg import numpy as...

Diffusion Policy in vqbet paper has lower performance than the diffusion policy paper

Hello @jayLEE0301 @notmahi , I test diffusion policy using final IoU (following your settings). I load the official checkpoint, evaluate the Diffusion Policy with Unet for 20 episodes and get...

Diffusion Policy in vqbet paper has lower performance than the diffusion policy paper

Hello @jayLEE0301 , Thank you for your response! - I always use `reward = np.clip(coverage, 0, 1)` instead of `reward = np.clip(coverage / self.success_threshold, 0, 1)` in my test. -...

Great work! when are you planning to release image-to-video models?

Hi @zRzRzRzRzRzRzR But as you mentioned in your paper, you already have an image-to-video version of CogVideoX ![图片](https://github.com/user-attachments/assets/192de65d-d980-4937-9233-f7620bb6240c)

Great work! when are you planning to release image-to-video models?

Hi @tengjiayan20, Thank you for the response! Is it difficult to finetune an image-to-video model by myself on the [WebVid10M](https://huggingface.co/datasets/TempoFunk/webvid-10M) dataset? How many samples and trainning steps do you need...

社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX!

@Maikauer Yes, the open-source version does not support image2video at this moment. If there is an open-source strong I2V model, there will be a community finetuning it (like Stable Diffusion...