transformers issues

onnx run error at translation model

3

### System Info - `transformers` version: 4.17.0 - Platform: Linux-5.4.0-122-generic-x86_64-with-debian-bullseye-sid - Python version: 3.7.11 - PyTorch version (GPU?): 1.8.1 (True) - Tensorflow version (GPU?): not installed (NA) - Flax version...

xyx361100238

bug

PEGASUS-X

1

# What does this PR do? Adds [PEGASUS-X](https://arxiv.org/abs/2208.04347) implementation. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

zphang

time series forecasting model

1

# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...

kashif

TF: XLA-trainable DeBERTa v2

2

# What does this PR do? As discussed in https://github.com/huggingface/transformers/issues/18476 and https://github.com/huggingface/transformers/issues/18239, there are two problems while training DeBERTa v2 with TensorFlow: 1. `TFDebertaV2StableDropout` doesn't work at training time (actually,...

gante

Update philosophy to include other preprocessing classes

1

This PR removes the emphasis on NLP and focuses more on `transformers` being designed for all modalities.

stevhliu

Documentation

Adds CLIP to models exportable with ONNX

19

This isn't currently working, getting an error while validating the model - ``` onnxruntime.capi.onnxruntime_pybind11_state.InvalidArgument: [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : Unexpected input data type. Actual: (tensor(float)) , expected: (tensor(int64)) ```...

unography

[FLAX] Add dtype to embedding for gpt2 model

3

# What does this PR do? Add dtype to embedding for gpt2 model ## Who can review? @patrickvonplaten, @LysandreJik

merrymercy

Fine tuning TensorFlow DeBERTa fails on TPU

9

### System Info Latest version of transformers, Colab TPU, tensorflow 2. - Colab TPU - transformers: 4.21.0 - tensorflow: 2.8.2 / 2.6.2 - Python 3.7 ### Who can help? @LysandreJik,...

tmoroder

bug

TF Examples Rewrite

2

This PR is a rewrite of the TF examples, including several modern methods. I'm focusing on updating everything to use modern methods like `prepare_tf_dataset` and the `evaluate` library as well...

Rocketknight1

Use commit hash to look in cache instead of calling head

1

# What does this PR do? This PR tries to limit the calls to requests.head made for cached models every time we try to load them. Currently on the main...

sgugger

transformers
transformers copied to clipboard

Metadata

onnx run error at translation model

PEGASUS-X

time series forecasting model

TF: XLA-trainable DeBERTa v2

Update philosophy to include other preprocessing classes

Adds CLIP to models exportable with ONNX

[FLAX] Add dtype to embedding for gpt2 model

Fine tuning TensorFlow DeBERTa fails on TPU

TF Examples Rewrite

Use commit hash to look in cache instead of calling head

← Metadata

Owner

Metadata

transformers transformers copied to clipboard

Metadata

← Metadata

Owner

Metadata

transformers
transformers copied to clipboard