openchat icon indicating copy to clipboard operation
openchat copied to clipboard

Openchat3.5 training data

Open jzhang38 opened this issue 1 year ago • 6 comments

Congrats for the V3.5 release! May I ask if there are plans to release your finetuning data, just like what you have been always doing with your previous release?

jzhang38 avatar Nov 04 '23 02:11 jzhang38

Thanks! Sorry the dataset contains a bit private data, but we are considering releasing the open subsets of the dataset.

imoneoi avatar Nov 04 '23 12:11 imoneoi

Similar question. Did V3.5 still use the same strategy as the paper openchat describe? Or there were other changes and you will publish a new-version report?

yucc-leon avatar Nov 07 '23 11:11 yucc-leon

Similar question. Did V3.5 still use the same strategy as the paper openchat describe? Or there were other changes and you will publish a new-version report?

Yes, it's mainly based on the method that the paper described (except with more data). We may publish a new report if new modifications are made.

imoneoi avatar Nov 08 '23 17:11 imoneoi

@imoneoi if all data is generated using openai api, where does the private data come from?

timothylimyl avatar Nov 28 '23 08:11 timothylimyl

@imoneoi if all data is generated using openai api, where does the private data come from?

I quess from books

mietekrmd avatar Nov 28 '23 11:11 mietekrmd

does your dataset have system prompts?

timothylimyl avatar Nov 30 '23 11:11 timothylimyl