WY
Results
2
comments of
WY
Hi developers I am facing the same issues as @mosala777. Here is part of the stdout I observed: > - INFO - __main__ - Loading features from cached file /home/weiyuan/Desktop/rbp/model/HNRNPA1/data_HNRNPA1/cached_dev_DNABERT3_101_dnaprom...
The same problem is discussed here: https://github.com/ThilinaRajapakse/simpletransformers/issues/234 The solutions seem to be: - lower learning rate (--learning_rate 2e-5) - use smaller batch sizes (--per_gpu_train_batch_size 64) - perhaps to delete cache...