transformers icon indicating copy to clipboard operation
transformers copied to clipboard

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Results 2036 transformers issues
Sort by recently updated
recently updated
newest added

# What does this PR do? This PR adds the EfficientNet model family to HuggingFace transformers proposed in #15759 (PyTorch only for this PR). The implementation is based on that...

### System Info - `transformers` version: 4.21.0.dev0 - Platform: Linux-5.3.0-1017-x86_64-with-glibc2.27 - Python version: 3.9.13 - Huggingface_hub version: 0.8.1 - PyTorch version (GPU?): 1.12.0+cu102 (True) - Tensorflow version (GPU?): not installed...

bug

Hi @patrickvonplaten, I have one basic conceptual NLP question regarding the evaluation for NER. According to [run_ner.py](https://github.com/huggingface/transformers/blob/main/examples/pytorch/token-classification/run_ner.py), the ground truth label is truncated to max_seq_length during prediction. However, this means...

### Feature request Would it be possible to include a `timeout` attribute to the `TrainingArguments` dataclass, such as it will be used as an argument of the `torch.distributed.init_process_group` calls? Reference:...

### Feature request This is a rather minor/trivial feature request. Currently `id2label` is of type `Dict[int, str]` in `PretrainedConfig`. Since this map is used to map class ids to labels,...

### Feature request I think it would be awesome to be able to easily train a Tesla style HydraNet but using a transformer backbone. The model would take a model_id...

…el checkpointing separately in new state.best_metric_checkpoint variable # What does this PR do? Fixes # (issue) ## Before submitting - [x] This PR fixes a typo or improves the docs...

### Feature request `torch.nn.Linear,Conv2d...` will call `self.reset_parameters()` inside their `__init__`. I'd like to make `reset_parameters` be a no-op inside `no_init_weights` context manager. ### Motivation `no_init_weights` is used in `from_pretrained` to...

### System Info All. ### Who can help? @patrickvonplaten ### Reproduction See https://huggingface.co/docs/transformers/main/en/model_doc/longt5 ### Expected behavior In the above document, it said `Unlike the T5 model, LongT5 does not use...

bug

Hello community, I am having the same problem described to save and load a fine tuned model using transformers and tensorflow. I have used save_pretrained, save_weights and model.save with save_format=tf....