Sylvain Gugger

Results 19 issues of Sylvain Gugger

# What does this PR do? This PR tries to limit the calls to requests.head made for cached models every time we try to load them. Currently on the main...

# What does this PR do? As was reported in #18537, when using `AutoConfig` inside the `AutoModel.from_pretrained` method, some kwargs are deleted and not passed to the `from_pretrained` method of...

In Transformers, we very often try to download files that do not exist: this is because each tokenizer has a list of optional files so we try to download them...

Not everyone wants to see progress bars when downloading models/datasets as tqdm can clog the logs pretty easily. That's why we have a mode in Transformers to deactivate tqdm when...

In Transformers, we have [this function](https://github.com/huggingface/transformers/blob/ab2006e3d6db88654526a4169e65d4bfc52da2e3/src/transformers/utils/hub.py#L119) that returns the list of cached models. It would be nice to rely on an util in this repository instead (not limited to models)...

When there is an error message with a git command (for isntance git push), the url is shown and that url contains the user token. Users that dutifully copy the...

# What does this PR do? This PR fixes big model inference for large T5 models. The problem is that T5 models have some weights kept in float32, which interferes...

# What does this PR do? This PR makes sure the LLaMA conversion script stays up to date with `save_pretrained` by having the checkpoint being loaded in an actual model...

# What does this PR do? While building the v2 of the test fetcher, I discovered (I mean the util discovered) that some imports in the source code are wrong....

# What does this PR do? In the upcoming version of Accelerate, `find_tied_parameters` returns a list of list instead of dictionary. While there is a hack in place to make...