Aaron Nicolson

Results 63 comments of Aaron Nicolson

Hi @jainnipun11, I think I have found the cause of the issue. The return typing hints on this line are incompatible with `python 3.7` (but work fine with `python 3.9`):...

Hi @jainnipun11, Can you confirm that you have downloaded the files for `distilgpt2` from https://huggingface.co/distilgpt2/tree/main and have placed them in https://github.com/aehrc/cvt2distilgpt2/tree/main/checkpoints/distilgpt2? I have made some updates to check for path...

Hi @jainnipun11, It seems that you did not pull the latest version of the repo. In your error, `AutoTokenizer` is on line 87 ``` File "/content/drive/MyDrive/cvt2distilgpt2/transmodal/config.py", line 87, in get_config...

Hi @jainnipun11, Try this: https://github.com/huggingface/transformers/issues/15212

Hi Nipun, I will look into this tomorrow, but in the meantime could you please try this GPT2 example in the same environment and let me know if it works?...

Thanks! Was it mixed precision with float16 or bfloat16?

Would it be caused by the Hybrid cache? Does it turn on torch dynamo?

Thanks for the reply @subhash686 Unfortunately, it didn't work with: ```python import requests import torch from accelerate import Accelerator from PIL import Image from transformers import AutoModelForImageTextToText, AutoProcessor accelerator =...

Hi @yihp, For which dataset? What values are you getting? Thanks, A.

Your loss curve can depend on many factors, including your dataset, as I am sure you are aware 😀. If you would like you could share your loss curve. More...