SGG_from_NLS icon indicating copy to clipboard operation
SGG_from_NLS copied to clipboard

Would you release the code for the pseudo label generation and assignment?

Open Scarecrow0 opened this issue 2 years ago • 8 comments

Hi!

Thanks for your interesting work and nice codebase on weakly supervised SGG. Your high-quality codebase and comments make me understand your proposed method concretely.

The pseude label generation and assignment is a much different design from previous works of weakly SGG. The generated labels have been provided here. However, I'm wondering about the technical details of seudo label generation and assignment in your work. Do you have a plan to release the preprocess code for the pseudo label generation?

I'm looking forward to your reply.

Scarecrow0 avatar Mar 01 '22 08:03 Scarecrow0

Hi, Yiwu. Same question! In my understanding, the quality of pseudo labels may greatly affect the quality of scene graph generation. Would you release the code for the pseudo label generation and assignment? Thanks!

JingyuLi-code avatar Jun 16 '22 15:06 JingyuLi-code

Same question +1 about parsing labels from language. Have not found related code.

zyong812 avatar Jul 09 '22 14:07 zyong812

Hi @Scarecrow0 @JingyuLi-code @zyong812, thanks for your interests in our work!

Sorry for late response. I haven't got a chance to fully clean my preprocessing code. As you requested, I uploaded my functions used for preprocessing (https://github.com/YiwuZhong/SGG_from_NLS/tree/main/preprocess#pseudo-label-creation). You could use them as reference for your case.

YiwuZhong avatar Jul 20 '22 15:07 YiwuZhong

Hi @Scarecrow0 @JingyuLi-code @zyong812, thanks for your interests in our work!

Sorry for late response. I haven't got a chance to fully clean my preprocessing code. As you requested, I uploaded my functions used for preprocessing (https://github.com/YiwuZhong/SGG_from_NLS/tree/main/preprocess#pseudo-label-creation). You could use them as reference for your case.

Hi, Yiwu. Thanks for your reply. I have checked the link_cap_vg.py, which converts the predicted caption categories into VG standard categories.

How about matching sentence triplets to pairs of regions provided by the object detector. as the paper displays 'This is done by a greedy matching between every triplet from the caption and each region pair from the image.' How do I generate COCO_triplet_labels.npy files?

For pseudo label assignment, many image region pairs may correspond to a caption triple, this may lead to a wrong pseudo label. For example, for image region pairs 'man0', 'man1', 'boat', this will have <man0, boat>, <man1, boat> to correspond to a same caption triple <man, boat>. However, <man1, boat> may have no semantic relation. Do you have a good way to solve this problem?

JingyuLi-code avatar Jul 22 '22 02:07 JingyuLi-code

Hi, Yiwu. Thanks for your reply. I have checked the link_cap_vg.py, which converts the predicted caption categories into VG standard categories.

How about matching sentence triplets to pairs of regions provided by the object detector. as the paper displays 'This is done by a greedy matching between every triplet from the caption and each region pair from the image.' How do I generate COCO_triplet_labels.npy files?

For pseudo label assignment, many image region pairs may correspond to a caption triple, this may lead to a wrong pseudo label. For example, for image region pairs 'man0', 'man1', 'boat', this will have <man0, boat>, <man1, boat> to correspond to a same caption triple <man, boat>. However, <man1, boat> may have no semantic relation. Do you have a good way to solve this problem?

@JingyuLi-code Thanks for your interest in our work. I provided the functions I used to process the pseudo labels link_det_cap.py.

The key idea is simply using WordNet or something equivalent to match the detection labels and caption nouns. If there are multiple objects with same detection class, we select the ones using heuristic approach. For example, inspired by MotifNet, we only care about the objects that have overlapping regions.

YiwuZhong avatar Aug 26 '22 18:08 YiwuZhong

The key idea is simply using WordNet or something equivalent to match the detection labels and caption nouns. If there are multiple objects with same detection class, we select the ones using heuristic approach. For example, inspired by MotifNet, we only care about the objects that have overlapping regions.

Thank you for your work, but for this generation of pseudo labels, can you open source the relevant code, just for the generation of COCO_triplet_labels.npy.

ZHUXUHAN avatar Aug 27 '22 03:08 ZHUXUHAN

Hi Authors, thanks for the code repository. I also have the same question: can you make the code for COCO_triplet_labels.npy available to generate pseudo labels for given COCO-captions dataset.

shwetabhardwaj44 avatar Apr 03 '23 15:04 shwetabhardwaj44

Just release the code for parsing triplets from captions (parse_captions.py): https://github.com/YiwuZhong/SGG_from_NLS/tree/main/preprocess#pseudo-label-creation

YiwuZhong avatar Apr 04 '23 00:04 YiwuZhong