Muyao Niu comments

Results 31 comments of


                                            Muyao Niu

可以支持cuda12.1吗

> My CUDA version is 12.1. > > 1.pip install cupy-cuda12x. 2.conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia. > > To modify compile_with_cache function: > > *...

inference code

Could you tell me which input form do you want? We are providing the gradio because it is the easiest way to form the input via line drawing.

inference code

You might find it helpful to look at this line of code: https://github.com/MyNiuuu/MOFA-Video/blob/main/MOFA-Video-Traj/run_gradio.py#L570. You can adjust the codes by isolating this function and providing your own input to the function.

huggingface上的演示demo在进行轨迹添加的时候报错

那个demo还在调试中，主要的原因是因为huggingface的ZeroGPU模块无法支持cupy，所以现在的版本还没有办法跑通，我已经向官方进行了反馈，但是可能需要等一段时间

landmarks condition is stronger than flow condition

Hi! Thanks for your interest! To make the model work with only motion flow inputs, you'll need to retrain it. The current model relies on both motion flow and landmarks,...

landmarks condition is stronger than flow condition

We have uploaded the training code.

Provide Training File

Sorry, but this work was done during my internship at Tencent AI Lab. Since I have left the company, I can no longer access the data files stored on their...

Provide Training File

We trained the model for approximately 100,000 iterations using the WebVid-10M dataset, with a batch size of 8 (one per A100 GPU). This means a total of about 800,000 video...

Provide Training File

> > We trained the model for approximately 100,000 iterations using the WebVid-10M dataset, with a batch size of 8 (one per A100 GPU). This means a total of about...

Provide Training File

> different The Encoder in Figure 3 is a part of the Controlnet it self, which is called 'Fusion Encoder', which is illustrated in Figure 3 and the text. The...