awesome-llm-security issues

Results 5 awesome-llm-security issues

Sort by recently updated

add a defense method to PR comments

Hello! I would like to add our completed paper from MSFT Research about defense against adversarial attacks.

zichuan-liu

Added Systematization of Knowledge paper:

Operationalizing a Threat Model for Red-Teaming LLMs

dapurv5

Please include this paper accepted by USENIX'24

Yu, Zhiyuan et al. “Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models.” ArXiv abs/2403.17336 (2024): n. pag.

jrylost

Update README.md

added Machine_Learning_CTF_Challenges from https://github.com/alexdevassy/Machine_Learning_CTF_Challenges

alexdevassy

Kindly request the inclusion on a line of papers on harmful fine-tuning for LLMs

Thank you for the wonderful paper collection. We have a line of research on harmful fine-tuning for LLMs. Could you please include this line of work into the repo? |...

huangtiansheng

awesome-llm-security
awesome-llm-security copied to clipboard

Metadata

add a defense method to PR comments

Added Systematization of Knowledge paper:

Please include this paper accepted by USENIX'24

Update README.md

Kindly request the inclusion on a line of papers on harmful fine-tuning for LLMs

← Metadata

Owner

Metadata

awesome-llm-security awesome-llm-security copied to clipboard

Metadata

add a defense method to PR comments

Added Systematization of Knowledge paper:

Please include this paper accepted by USENIX'24

Update README.md

Kindly request the inclusion on a line of papers on harmful fine-tuning for LLMs

← Metadata

Owner

Metadata

awesome-llm-security
awesome-llm-security copied to clipboard