PurpleLlama icon indicating copy to clipboard operation
PurpleLlama copied to clipboard

Will the dataset be released?

Open para-zhou opened this issue 10 months ago • 3 comments

Appreciate your nice work. Is there any plan to release the dataset or just test set for comparison? Thanks!

para-zhou avatar Mar 30 '24 22:03 para-zhou

Hi there, could you please provide more information, such as whether your question is about Llama Guard or CyberSecEval, and what exact dataset you are looking for? Thanks.

SimonWan avatar Apr 01 '24 23:04 SimonWan

Hey, I have the same question, will the dataset used for training LLaMAGuard be released?

haidequanbu avatar Apr 02 '24 09:04 haidequanbu

hi thank you for asking. I mean for LlamaGuard, Thx!

Hi there, could you please provide more information, such as whether your question is about Llama Guard or CyberSecEval, and what exact dataset you are looking for? Thanks.

para-zhou avatar Apr 02 '24 09:04 para-zhou

Hi, we leverage the human preference data from Anthropic to collect the prompts. At this point, we are unable to share the dataset that was used, but more details about how the data was curated is mentioned in the "Data Collection" section of the Llama Guard paper and the model card.

ujjwalkarn avatar Apr 24 '24 19:04 ujjwalkarn