Llama3-FunctionCalling icon indicating copy to clipboard operation
Llama3-FunctionCalling copied to clipboard

The dataset after be used with the build_dataset.py script.

Open 42Viva opened this issue 1 year ago • 0 comments

Dear Author,

I hope this message finds you well. I am currently training my model using the method you provided, but I have encountered some issues. My model checkpoint file is stored in the safetensors format, and as a result, I am unable to locate or use the file tokenizer.model in the directory ~/models/Llama3/Llama-3-8B-Instruct/ for the build_dataset.py script. I have already cleaned the dataset. Would you be willing to share the processed dataset with me? Even just the format of the samples would be very helpful for me to construct an appropriate dataset. Thank you very much!

42Viva avatar Aug 14 '24 03:08 42Viva