devzzzero comments

Results 7 comments of


                                            devzzzero

trafficstars

Noob Question: Is there way to avoid calling SSL_read multiple times (SSL_pending erroneously returns 0)

Yep. That did it. Thank you.

Noob Question: Is there way to avoid calling SSL_read multiple times (SSL_pending erroneously returns 0)

closing.

[FIXED] Llama 3 Finetuned model is not generating EOS token.

> > @davedgd Oh so Unsloth is fine (the models or just finetuning with Unsloth?) but the Meta ones still don't work as expected? > > Correct, but to clarify,...

[FIXED] Llama 3 Finetuned model is not generating EOS token.

> See https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/commit/4d6c61da057c45bfc4dc4d3bfa5a691ecb9ce0cf > > Yes the pad token is in fact a bug fix Indeed. My pull of the official Llama3 hf models occurred more than 20 days ago...

Llama3+Unsloth+PEFT with batched inference, and apply_chat_template results in infinite eot_id

> There are two problems in your code. First, the llama-3 chat template itself introduces eos_token at the end of every system/user/assistant prompt, so initialize **pad_token = eos_token** is a...

Unexpected OOM issue with LORA fine tune of LLama-3 (possibly perf hit as well)

Its running now, ` per_device_train_batch_size = 1` :-( ETA ~15 hours

Unexpected OOM issue with LORA fine tune of LLama-3 (possibly perf hit as well)

Thank you.