llm.c icon indicating copy to clipboard operation
llm.c copied to clipboard

void tokenizer_init failed

Open Bing1002 opened this issue 9 months ago • 1 comments

allocated 474 MiB for model parameters train_gpt2fp32cu: train_gpt2_fp32.cu:1815: void tokenizer_init(Tokenizer*, const char*): Assertion `header[1] == 1' failed. [1] 2229854 abort (core dumped) ./train_gpt2fp32cu

Bing1002 avatar Apr 30 '24 19:04 Bing1002

Try to rebuild your data files with the train_gpt3.py. The tokenizer headers have changed.

azret avatar Apr 30 '24 20:04 azret

Hey @Bing1002 if you're not facing this problem any more feel free to close the issue, try and rerun python script and then run your C script again and it should be fine.

gordicaleksa avatar Jun 07 '24 14:06 gordicaleksa

Thank you for your response.

Bing1002 avatar Jun 07 '24 14:06 Bing1002