mvuthegoat

Results 4 comments of mvuthegoat

I have a side question, when will the while loop in the `def __iter__(self)` function of PretokDataset break? https://github.com/karpathy/llama2.c/blob/c7a26264a233c32f396b1c67be4ac019d2d8a659/tinystories.py#L206-L223

> As for 7B model, LoRA consumes about 17GB gpu memory. If it consumes only 17GB, why did I get this message? OutOfMemoryError: CUDA out of memory. Tried to allocate...

> have you been able to resolve this? I'm facing the same issue

> I successfully resolved the error and completed the training, **but the result seems incorrect.** > > > I'm facing similar issue > > `Expected q_dtype == torch::kFloat16 || ((is_sm8x...