transformers
transformers copied to clipboard
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
# What does this PR do? This PR adds a classes to use [sudachi](https://github.com/WorksApplications/SudachiPy) and [jumanpp](https://github.com/ku-nlp/pyknp) with BertJapaneseTokenizer. As a background, there are traditionally multiple tokenizers in Japanese language processing,...
A continuation of #19767 to add existing official resources (blog posts, notebooks, scripts, etc.) directly to the model docs for 20 of the most popular architectures based on last month's...
# What does this PR do? **(No breaking change in this PR)** **(So far I only change `OneFormerConfig`, but I will update other config classes whose default `layer_norm_eps = 1e-5`)**...
# What does this PR do? This PR adds the TF `compute_transition_scores`, akin to PT's #21191. What seemingly started off as a simple task, ended up being a complex task...
# What does this PR do? This PR adds a script that aggregates model architectures compatible with a task illustrated in a task guide and adds a list of links...
# What does this PR This PR updates the way we generation TF and FLAX to fix the breaking changes that we had. It also adds support for the timestamps...
### Model description @gante Generated HF TFwhisper model into whisper.tfllite model. However, I'm not sure how to evaluate the created whisper tflite model. https://colab.research.google.com/github/usefulsensors/openai-whisper/blob/main/tflite_from_huggingface_whisper.ipynb I would appreciate your assistance in...
# What does this PR do? Adds the model from issue Fixes # (https://github.com/huggingface/transformers/issues/20737) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you...
# What does this PR do? Adding Time Series Informer model https://arxiv.org/abs/2012.07436 Related issue: #20903 @kashif :)
Adds Flax whisper implementations, and adjusts flax generation utils to support it. @ydshieh @ArthurZucker See discussion in #19512