Li Bo
Li Bo
Hi Thanks for providing such a wonderful codebase. I have seen and used the save & load in MoE on multiple GPUs, now I can save them on different ranks....
hi thanks for providing such a wonderful work. However, I am curious that will you consider providing pretrained MoE models (e.g. ViT on ImageNet or machine translation tasks)
Hi I had the errors when using `load_importance_loss` (the code works fine when using `gshard_loss`). Does anyone have an idea about it? The error log (in one rank/node) is in...
@jhoffman Same question, how to translate GTA images to CityScapes images by our own? I find it's different from Cyclegan's original code, especially in `cycle_gan_semantic_models.py`. I set my dataset properly...
RT, sorry I didn't find this part..
Hi, Kipf~I really appreciate your work. I am wondering if the GNN is modeling interactions between objects. After heading over the codes, I found that when collision happens from object...
…mismatch in later XMLBert decoder. fix issue #241
Hi sorry to bother, it would be much appreciated if you could take a look at this error on BLIP(v1) with COCO caption task. I was running command ``` python...
Before you open a pull-request, please check if a similar issue already exists or has been closed before. ### When you open a pull-request, please be sure to include the...