Zhiqing Sun comments

Results 9 comments of


                                            Zhiqing Sun

RotatE: this implementation vs GraphVite

Hi, Thank you for your interest in both projects. The essential differences between this implementation and GraphVite implementation are listed as following: 1. GraphVite uses ASGD rather than mini-batch SGD，the...

how to train/infer with encoder-only experiment?

Hi @qianyizhang, For FPN in TSP-RCNN, we use the RPN's output (RoI) to select top proposals. For FPN in TSP-FCOS, we have a Feature of Interest (FoI) score. As for...

trained models

Hi, Our script should be able to fully reproduce the trained model. [TSP-Detection#rreproducing-results](https://github.com/Edward-Sun/TSP-Detection#rreproducing-results)

About vicuna_dummy_data.json lack 'example_id'

Hi, For now, please try the following code to replace the line of `meta_prompt = meta_prompts[int(example["example_id"]) % total_meta_prompt]`. We will add a commit to solve the issue soon. ``` example_id...

About vicuna_dummy_data.json lack 'example_id'

Hi Harryis, In our codebase, "example_id" only affects which prompt template to use, so it won't affect too much on the performance. Also, if you inspect the data, you would...

About vicuna_dummy_data.json lack 'example_id'

Yes, when we developed this project around March/April, the faster inference techniques (e.g., TGI and vLLM) of llama had not been developed, so we tried our best to use a...

About the way to generate 99,121 synthetic prompts from TGRT Self-Instruct

Hi Harryis, Yes. We generated around 120k synthetic topics (after filtering on the topics) from TGRT Self-Instruct, generated the corresponding 120k prompts, and did some filtering on the prompts to...

About the way to generate 99,121 synthetic prompts from TGRT Self-Instruct

Hi Harryis, We generate the topics in several rounds (called `generation_epoch` in the [code](https://github.com/IBM/Dromedary/blob/927bf5bdaa2b2fe951e014474692d89813e95b00/training/step1_topic_guided_red_teaming_self_instruct/scripts/tgrt_topic_generate_65b_base.sh#L40)), where in each round, we sample all topics from the previous rounds as the seed to...

adapter_name problem

Hi, it's unlike the problem of the difference between 7b and 70b. What is the version of the PEFT you used? We use `peft==4.0.0` in our experiments, and perhaps the...