NieShenRuc
NieShenRuc
Thank you for your great work! But i have some question when calculating text-to-image fid. In the appendix , I notice that you said that fid is calculated by comparing...
I do not understand the "ThreeCropsTransform" class in dataset.py, would you like to explain it in detail for me? Thanks
Model I am using (VLMO), I found that the text-onlt data is loaded from "wikibk.{index}.txt" where index=0,1,...,49,I want to ask I can I get the .txt files?
Thanks for your excellent work! I have mentioned that torchscale serially executes the operation of mapping x to q, k, and v, in line 84~86 in file torchscale/component/multihead_attention.py. Will this...