Emu
Emu copied to clipboard
Emu Series: Generative Multimodal Models from BAAI
Thanks for the great work! I have some questions about the checkpoints: 1. It seems that BAAI/Emu2 does not include the weight of visual decoder (diffusion unet), but I think...
何时支持中文?
Trying to run your demo, but getting No space left on device error, while loading the model, it takes more than 60GB of memory, on your HF it seems like...
Hello, I saw on the project page that you showed the Generate video from any prompt sequence function, but I didn't seem to see this function in the demo. Will...
Hello. Thank you for sharing such a great work. I am trying to run samples in inference.py. The instruction-tuned worked perfectly. However, the in-context working example for pretrained model did...
There are no details of image regression loss calculation in the code. Can you introduce the implementation details of this part or provide the code? Thanks very much.
Kyaa
Hi, I have a question about the COCO dataset evaluation. What' s the meaning in the benchmark/COCO/annotations/vqa_test.json and vqa_val_eval.json? Or could you please give the link of these dependent datasets?
Thanks for your work. I try to reproduce paper's CIDEr score, but it failed with a difference. I hope you share the original recipe of inference on coco dataset (I2T)...