transformers
transformers copied to clipboard
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
### System Info - `transformers` version: 4.17.0 - Platform: Linux-5.4.0-122-generic-x86_64-with-debian-bullseye-sid - Python version: 3.7.11 - PyTorch version (GPU?): 1.8.1 (True) - Tensorflow version (GPU?): not installed (NA) - Flax version...
# What does this PR do? Adds [PEGASUS-X](https://arxiv.org/abs/2208.04347) implementation. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...
# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...
# What does this PR do? As discussed in https://github.com/huggingface/transformers/issues/18476 and https://github.com/huggingface/transformers/issues/18239, there are two problems while training DeBERTa v2 with TensorFlow: 1. `TFDebertaV2StableDropout` doesn't work at training time (actually,...
This PR removes the emphasis on NLP and focuses more on `transformers` being designed for all modalities.
This isn't currently working, getting an error while validating the model - ``` onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(float)) , expected: (tensor(int64)) ```...
# What does this PR do? Add dtype to embedding for gpt2 model ## Who can review? @patrickvonplaten, @LysandreJik
### System Info Latest version of transformers, Colab TPU, tensorflow 2. - Colab TPU - transformers: 4.21.0 - tensorflow: 2.8.2 / 2.6.2 - Python 3.7 ### Who can help? @LysandreJik,...
This PR is a rewrite of the TF examples, including several modern methods. I'm focusing on updating everything to use modern methods like `prepare_tf_dataset` and the `evaluate` library as well...
# What does this PR do? This PR tries to limit the calls to requests.head made for cached models every time we try to load them. Currently on the main...