narrowsnap comments

Results 16 comments of


                                            narrowsnap

About CUB-200-2011's accuracy

I reproduce ViT on CUB using the reference https://github.com/rwightman/pytorch-image-models, and got an accuracy of 91.06%(with vit_base_patch16_384(pretrained=True, num_classes=200, drop_rate=0.1), optim.SGD(model.parameters(), lr=0.0001, momentum=0.9, weight_decay=1e-5), lr_scheduler.ReduceLROnPlateau(optimizer, 'max'), run 100 epoch)

AssertionError: only one of size and size_divisor should be valid

What is your model config? I meet the same error because I don't set the data_preprocessor in model.

When will you update the Fine-grained classification results of resnet101

In my experiment, resnet50 got 88.23% accuracy, and resnet101 is worse than that.

Support for hpcaitech OpenSora's STDiT for text2video and text2image generation

@kabachuha @patil-suraj Is there any update?

The training problem

In the DiT training code, the purpose of setting sigma is to learn the mean and variance of the noise, and then calculate the KL loss with the gold standard....

The training problem

> The model released is also 8 channels (including mean and variance). Why set the model to predict mean and variance, but do not calculate the loss of variance. I...

The training problem

> I felt the same doubt. why Just use out_channel=8 ,uses the mean (the first 4 channels) ,dorpout bias? Maybe,Update latent_z by grad without bias > > ``` > dt...

[HELP] In my train OpenSora1.0 process， I got zhe error ['LowLevelZeroOptimizer' object has no attribute '_param_store'], how to fix it？

In my experiments, I change it to `master_param = optimizer.working_to_master_param[param_id]`.

[HELP] In my train OpenSora1.0 process， I got zhe error ['LowLevelZeroOptimizer' object has no attribute '_param_store'], how to fix it？

> After changing it to `master_param = optimizer.working_to_master_param[param_id]`, I still have the `AttributeError: : 'LowLevelZeroOptimizer' object has no attribute 'working_to_master_param'`. Can anyone help? What's your version of colossalai?

Separate inference 720p video with 24G VRAM

> caption_emb_masks Are you update the code of RFLOW?(opensora/schedulers/rf/\_\_init\_\_.py)