i-Code
i-Code copied to clipboard
### Finetuninng on RVLCDIP Download RVLCDIP first and change the path For OCR, you might need to customize your code ``` bash scripts/finetune_rvlcdip.sh # Finetuning on RVLCDIP ``` Q1. which...
I will appreciate it to provide the support or apply the following changes to be able run the i-Code-v3 on CPU. the following changes need to apply: /core/common/utils.py --> change...
I will appreciate it if you explain how can I access i-code-studio and how can I connect the custom diffusion models to i-code-v3.
How to use single machine multi card reasoning
I was trying to run the demo notebook on Nvidia A100 80 GB. While trying to load the model from checkpoint, I am facing this issue: ####################### Running in eps...
Hello, thanks for sharing this work! Need to figure it out something in CoDi. Is the environment encoder V in paper as `clap_encode_audio` like [this](https://github.com/microsoft/i-Code/blob/cba6421cb1ceb22d915cdfe3fdcbb8088b77122e/i-Code-V3/core/models/model_module_infer.py#L110C27-L110C44) ?
In web demo https://codi-gen.github.io/ there're videos with at least 16 frames demo notebook fails to generate videos longer than 8 frames (it doesn't crash, just generates complete garbage) Is it...
hello. I am very interested in your research, especially in the latest Any-to-Any model, CoDi-2. My main question is about the whereabouts of the in-context multimodal instruction dataset you built...
Hello. I tried using the demo code of Codi (https://github.com/microsoft/i-Code/tree/main/i-Code-V3) to reproduce results on the AudioCaps dataset. However, I was unable to achieve the results reported in the paper for...
Could you please open-source the basic and correct code for UDOP?