Yiyuan Zhang comments

Results 17 comments of


                                            Yiyuan Zhang

Install install 19_large_depthwise_conv2d_torch_extension but ”bash: ./setup.py: Permission denied“

add --user

more experiment on Point cloud analysis

Sorry, I did not try this.

Issues about Image Classification

zero-shot evaluation requires text-encoder.

Issues about Image Classification

Exactly, we use CLIP for pretraining

Issues about Image Classification

Maybe, I think the key is the proposed tokenizer.

Expected Release Date for Training Code?

We will release the training code soon. Please stay tuned.

Some questions about the paper

Exactly. They are just random initialized vanilla positional embeddings.

Some questions about the paper

These paired embeddings share the same weights to label the corresponding text paired datasets

Some questions about the paper

Because they' re two matrix of text and multimodal features. Their dot products are transposed matrix. So for the columns and rows, the summations are different, especially dealing with a...

Some questions about the paper

The match process is provided in our released code: https://github.com/invictus717/MiCo/blob/89c91c9dac68125a18a1a966bd80f9e74e584e80/model/mico.py#L44