Ritwick Chaudhry
Ritwick Chaudhry
@rowanz , I wanted to generate scene graphs for images not belonging to the Visual Genome dataset, using your model. Can I do this using the existing codebase? If yes,...
Hi @danfeiX , I really wanted to generate scene graphs for images, which are from some other dataset. I don't have ground truth scene graphs and I don't wish to...
Hi, Thanks a lot for your implementation. I tried running it on the VizWiz data with the default hyperparameters in the repo. Unfortunately, the model only achieves around 36% accuracy...
I really liked the DKVMN model and wished to extend it further. I have read the code in detail and had some doubts: 1. Does the value matrix re-initialize every...
Hi! Thanks for the amazing work. I am trying to use the Phi 3 Mini 128K model. Unfortunately, I get a tokenization mismatch error ([relevant code](https://github.com/mbzuai-oryx/LLaVA-pp/blob/main/Phi-3-V/train.py#L671-L677)). However, it gives an...