openchat
openchat copied to clipboard
the model of Pre-tokenized dataset openchat_v3.2_super.train.parquet is Llama2 or Mistral?
the model of Pre-tokenized dataset openchat_v3.2_super.train.parquet is Llama2 or Mistral?
It is Llama2, not Mistral. However, the data is a merge of sharegpt_clean.json and sharegpt_gpt4.json. Who knows?