BackdoorLLM issues

chat_template for both training and evaluation

Hello, I don't find any usage of apply_chat_template in both backdoor_evaluate.py and backdoor_train.py, and my asr rate is very different from your results in the paper. Could u explain the...

the-star-sea

Potential Inconsistencies Between Repo and Model License

2

Hi, while reviewing the licenses for this repository and the model it depends on, I noticed a potential inconsistency that could cause confusion or legal risks in some situations. Your...

yueyangchen1

Can you upload the Lora weights regarding the use of model editing methods

233liang

Tokenization in backdoor_evaluate.py Does Not Include the 'input' Field

In `attack/DPA/backdoor_evaluate.py` at line 151, the code is as follows: ```python instruction = example['instruction'] inputs = tokenizer(instruction, return_tensors="pt") ``` Currently, only the 'instruction' field is tokenized. However, the test data...

Harack1126

您好！请问一下，针对于评估隐藏状态攻击TA2，是如何设置触发器的，如何评估有无触发器下的攻击成功率的？

2

ollxh

Two questions about the data in DPA

1

1- For the data in "attack/DPA/data/test_data/poison/negsentiment, `` you're stupid" consistently exists in the output. Is this wrong? 2- For the data in "attack/DPA/data/poison_data/sst2", are all sentences in the input consistently...

yhzhu66

About Adding Backdoor on Larger Models such as Llama-2-70b-chat

3

Hi, I've encountered a problem about training on Llama-2-70b-chat with A100. When I just follow the command ``torchrun --nproc_per_node=1 --master_port=11222 backdoor_train.py configs/jailbreak/llama2_70b_chat/llama2_70b_jailbreak_badnet_lora.yaml``, it will raise the error ``torch.OutOfMemoryError: CUDA out...

yuxili19

Multiple GPU usage

3

Hi, Thank you very much for open-sourcing your nice work. Could you please give some instructions on running fine-tuning with multiple GPUs? As far as I know, the Trainer from...

xiaoyunxxy

BackdoorLLM
BackdoorLLM copied to clipboard

Metadata

chat_template for both training and evaluation

Potential Inconsistencies Between Repo and Model License

Can you upload the Lora weights regarding the use of model editing methods

Tokenization in backdoor_evaluate.py Does Not Include the 'input' Field

您好！请问一下，针对于评估隐藏状态攻击TA2，是如何设置触发器的，如何评估有无触发器下的攻击成功率的？

Two questions about the data in DPA

About Adding Backdoor on Larger Models such as Llama-2-70b-chat

Multiple GPU usage

← Metadata

Owner

Metadata

BackdoorLLM BackdoorLLM copied to clipboard

Metadata

← Metadata

Owner

Metadata

BackdoorLLM
BackdoorLLM copied to clipboard