Lucas Martins

Results 2 issues of Lucas Martins

Hi, I have a quick question concerning using callback functions inside a training loop using the Trainer class[https://github.com/google-research/kauldron/blob/main/kauldron/train/trainer_lib.py](class). I ask because in the Transformers library, you have a TrainerCallback class...

As noted in the main READ.me, Gemma 3 models are not yet supported by ART, due to Gemma not accepting the enable_prefix_caching parameter. To solve this issue, I've introduced the...