multimodal_cognitive

The code of LVLM Interpret.

2

Thank you for the amazing work: LVLM Interpret. Whether the relevant code will be open-sourced in the near future?

about counting

1

Hi, author, if image caption is about counting , e.g. There are four birds in the sky. How to generate counterfactual image，**Change all birds to other animals**？

mirial65

Hope share code about Semi-Structured Chain-of-Thought

wuhongyan123

Bump gradio from 3.39.0 to 4.37.1 in /Demos/NeuroPrompts in the pip group across 1 directory

Bumps the pip group with 1 update in the /Demos/NeuroPrompts directory: [gradio](https://github.com/gradio-app/gradio). Updates `gradio` from 3.39.0 to 4.37.1 Release notes Sourced from gradio's releases. @gradio/model3d@0.11.2 Dependency updates @gradio/atoms@0.7.8 @gradio/icons@0.6.1 @gradio/utils@0.5.2...

dependabot[bot]

dependencies

Great work! When the code of COCO-Counterfactuals could be released?

Great work! Could you please let us know when the code for COCO-Counterfactuals will be released?

ShaoqLin

Support for Gemma 2?

Hi, I am wondering whether this support training for second gen Gemma based llava. When we trained with the new Gemma with this repo, we ran into an error of:...

shan23chen

multimodal_cognitive_ai
multimodal_cognitive_ai copied to clipboard

Metadata

The code of LVLM Interpret.

about counting

Hope share code about Semi-Structured Chain-of-Thought

Add support for llava_mpt

Bump gradio from 3.39.0 to 4.37.1 in /Demos/NeuroPrompts in the pip group across 1 directory

Great work! When the code of COCO-Counterfactuals could be released?

Support for Gemma 2?

← Metadata

Owner

Metadata

multimodal_cognitive_ai multimodal_cognitive_ai copied to clipboard

Metadata

The code of LVLM Interpret.

about counting

Hope share code about Semi-Structured Chain-of-Thought

Add support for llava_mpt

Bump gradio from 3.39.0 to 4.37.1 in /Demos/NeuroPrompts in the pip group across 1 directory

Great work! When the code of COCO-Counterfactuals could be released?

Support for Gemma 2?

← Metadata

Owner

Metadata

multimodal_cognitive_ai
multimodal_cognitive_ai copied to clipboard