suzhenghang issues

Results 24 issues of


                                            suzhenghang

About the focal loss layer

Hi @unsky , The performance in your experiment is amazing. By the way, did you only replace the SoftmaxWithLoss with the focal loss layer in RPN layer or in both...

The performance of integrating InfiNet into the Damo Text-to-Video

Great work! Have there been any experimental results on integrating it into the Damo Text-to-Video system?

about deflicker

Do you have any good solutions for the flickering issue in generated videos?

I got the fllowing logs after running: python test_flops.py --config-file configs/RegNetX-4.0GF.ini，i wonder whether these warnings are normal? [INFO] Register count_convNd() for . [INFO] Register count_bn() for . [INFO] Register zero_ops()...

About VideoLDM

Do you have any knowledge of [VideoLDM](https://research.nvidia.com/labs/toronto-ai/VideoLDM/), and is it possible to integrate its algorithms to further enhance the capabilities of current models, such as generating longer videos?

enhancement

First GPU occupies more VRAM in distributed training

[link](https://github.com/ExponentialML/Text-To-Video-Finetuning/blob/main/utils/dataset.py#L580)， device = torch.device("cuda" if torch.cuda.is_available() else "cpu") cached_latent = torch.load(self.cached_data_list[index], map_location=device) Otherwise, in multi-GPU distributed training, the first GPU may occupy excessive VRAM compared to the other GPUs.

bug

suzhenghang

Where is the color image?

About the focal loss layer

The performance of integrating InfiNet into the Damo Text-to-Video

about deflicker

About some warnings

About VideoLDM

First GPU occupies more VRAM in distributed training

如何清除历史上传的文档啊

about deflicker

多说话人效果