Matt comments

Results 203 comments of


                                            Matt

Activation checkpointing for TFGPT2DoubleHeadsModel

Hi @visionscaper - this seems like something that could work! We haven't experimented with `tf.recompute_grad` in detail but the core code for training our Keras models is in the `train_step`...

Add missing type hints

@johnryan465 I just did it as an example, I'm sorry! I'm marking off the completed models now.

Add missing type hints

@robotjellyzone You can! Please note that we accepted a PR yesterday to add the TF decorator to BART, so make sure you're working on the most recent version of the...

Rebase ESM PR and update all file formats

cc @sgugger @LysandreJik this should now be ready for review! ESM-1b and ESM-2 models are both supported and the discrepancy between our output and the output from the original model...

Rebase ESM PR and update all file formats

(Note that tests will fail until I finish converting and uploading checkpoints)

Rebase ESM PR and update all file formats

@LysandreJik Everything renamed ESM -> Esm!

Documentation example error for Train a TensorFlow model with Keras

Hi @lexipalmer13 - that code runs fine for me locally, but we did have a lot of compatibility issues with TF 2.11. Version 4.26, which we released two days ago,...

Documentation example error for Train a TensorFlow model with Keras

Hi @lexipalmer13, thanks for the error traceback! I believe this error isn't related to `transformers` after all - the issue is an incompatibility specifically triggered by using XLA on TF...

[Examples] TPU-based training of a language model using TensorFlow

@sayakpaul MLM probability added as an arg and I modularized the loading!

Port IDEFICS to tensorflow

Hey @a8nova! I'm the TF maintainer around here, and this sounds like a great idea. Feel free to ping me if you encounter any difficulties. In general when porting code...