GenerateU
GenerateU copied to clipboard
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
Thank you very much for your outstanding work. However, when I was training vg_swinT.yaml, I encountered the following issue: Some weights of T5ForConditionalGeneration were not initialized from the model checkpoint...
Thank you for your wonderful work, I am curious about the details of your training with frozen MLLM experiment, can you release the relevant code?
what about the inference code? How to evaluate open-vocabulary detection performance? can you show more details?
['similarity= (100.0 * text_features @ self.lvis_embed.to(self.device)).softmax(dim=-1)'](https://github.com/FoundationVision/GenerateU/blob/main/projects/DDETRS/ddetrs/ddetrs_vl_uni.py#L315) Why do we need to perform matrix multiplication on **text_features** and **self.lvis_embed**? What does self.lvis_embed refer to? I look forward to the author's answer.
When I use pip install -r requirements.txt, the following error occurs. Is there any problem with the requirements.txt? 