Bump transformers from 4.39.3 to 4.40.1

Open dependabot[bot] opened this issue 1 year ago • 1 comments

Bumps transformers from 4.39.3 to 4.40.1.

Release notes

v4.40.1: fix EosTokenCriteria for Llama3 on mps

Kudos to @pcuenca for the prompt fix in:

Make EosTokenCriteria compatible with mps #30376

To support EosTokenCriteria on MPS while pytorch adds this functionality.

v4.40.0: Llama 3, Idefics 2, Recurrent Gemma, Jamba, DBRX, OLMo, Qwen2MoE, Grounding Dino

New model additions

Llama 3

Llama 3 is supported in this release through the Llama 2 architecture and some fixes in the tokenizers library.

Idefics2

The Idefics2 model was created by the Hugging Face M4 team and authored by Léo Tronchon, Hugo Laurencon, Victor Sanh. The accompanying blog post can be found here.

Idefics2 is an open multimodal model that accepts arbitrary sequences of image and text inputs and produces text outputs. The model can answer questions about images, describe visual content, create stories grounded on multiple images, or simply behave as a pure language model without visual inputs. It improves upon IDEFICS-1, notably on document understanding, OCR, or visual reasoning. Idefics2 is lightweight (8 billion parameters) and treats images in their native aspect ratio and resolution, which allows for varying inference efficiency.

Add Idefics2 by @amyeroberts in #30253

Recurrent Gemma

Recurrent Gemma architecture. Taken from the original paper.

The Recurrent Gemma model was proposed in RecurrentGemma: Moving Past Transformers for Efficient Open Language Models by the Griffin, RLHF and Gemma Teams of Google.

The abstract from the paper is the following:

We introduce RecurrentGemma, an open language model which uses Google’s novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide a pre-trained model with 2B non-embedding parameters, and an instruction tuned variant. Both models achieve comparable performance to Gemma-2B despite being trained on fewer tokens.

Add recurrent gemma by @ArthurZucker in #30143

Jamba

Jamba is a pretrained, mixture-of-experts (MoE) generative text model, with 12B active parameters and an overall of 52B parameters across all experts. It supports a 256K context length, and can fit up to 140K tokens on a single 80GB GPU.

As depicted in the diagram below, Jamba’s architecture features a blocks-and-layers approach that allows Jamba to successfully integrate Transformer and Mamba architectures altogether. Each Jamba block contains either an attention or a Mamba layer, followed by a multi-layer perceptron (MLP), producing an overall ratio of one Transformer layer out of every eight total layers.

Jamba introduces the first HybridCache object that allows it to natively support assisted generation, contrastive search, speculative decoding, beam search and all of the awesome features from the generate API!

... (truncated)

Commits

9fe3f58 v4.40.1
f8fec6b Make EosTokenCriteria compatible with mps (#30376)
745bbfe Release: v4.40.0
5728b5a FIX: Fixes unexpected behaviour for Llava / LLama & AWQ Fused modules + rever...
005b957 Add DBRX Model (#29921)
63c5e27 Do not drop mask with SDPA for more cases (#30311)
acab997 Revert "Re-enable SDPA's FA2 path (#30070)" (#30314)
7509a0a Fix RecurrentGemma device_map (#30273)
9459efb Add atol for sliding window test (#30303)
3f20877 Add jamba (#29943)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot merge will merge this PR after your CI passes on it
@dependabot squash and merge will squash and merge this PR after your CI passes on it
@dependabot cancel merge will cancel a previously requested merge and block automerging
@dependabot reopen will reopen this PR if it is closed
@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Apr 29 '24 04:04 dependabot[bot]

Dependency Review

✅ No vulnerabilities or license issues or OpenSSF Scorecard issues found.

OpenSSF Scorecard

Package

Version

Score

Details

pip/transformers

4.40.1

:green_circle: 5.8

Details

Check	Score	Reason
Maintained	:green_circle: 10	30 commit(s) and 21 issue activity found in the last 90 days -- score normalized to 10
Code-Review	:green_circle: 10	all changesets reviewed
CII-Best-Practices	:warning: 0	no effort to earn an OpenSSF best practices badge detected
License	:green_circle: 10	license file detected
Branch-Protection	:warning: -1	internal error: error during branchesHandler.setup: internal error: githubv4.Query: Resource not accessible by integration
Signed-Releases	:warning: -1	no releases found
Security-Policy	:green_circle: 10	security policy file detected
Dangerous-Workflow	:green_circle: 10	no dangerous workflow patterns detected
Token-Permissions	:warning: 0	detected GitHub workflow tokens with excessive permissions
Binary-Artifacts	:green_circle: 10	no binaries found in the repo
Fuzzing	:warning: 0	project is not fuzzed
Packaging	:green_circle: 10	packaging workflow detected
SAST	:warning: 0	SAST tool is not run on all commits -- score normalized to 0
Vulnerabilities	:warning: 0	479 existing vulnerabilities detected
Pinned-Dependencies	:warning: 0	dependency not pinned by hash detected -- score normalized to 0

pip/transformers

4.39.3

:green_circle: 5.8

Details

Check	Score	Reason
Maintained	:green_circle: 10	30 commit(s) and 21 issue activity found in the last 90 days -- score normalized to 10
Code-Review	:green_circle: 10	all changesets reviewed
CII-Best-Practices	:warning: 0	no effort to earn an OpenSSF best practices badge detected
License	:green_circle: 10	license file detected
Branch-Protection	:warning: -1	internal error: error during branchesHandler.setup: internal error: githubv4.Query: Resource not accessible by integration
Signed-Releases	:warning: -1	no releases found
Security-Policy	:green_circle: 10	security policy file detected
Dangerous-Workflow	:green_circle: 10	no dangerous workflow patterns detected
Token-Permissions	:warning: 0	detected GitHub workflow tokens with excessive permissions
Binary-Artifacts	:green_circle: 10	no binaries found in the repo
Fuzzing	:warning: 0	project is not fuzzed
Packaging	:green_circle: 10	packaging workflow detected
SAST	:warning: 0	SAST tool is not run on all commits -- score normalized to 0
Vulnerabilities	:warning: 0	479 existing vulnerabilities detected
Pinned-Dependencies	:warning: 0	dependency not pinned by hash detected -- score normalized to 0

Scanned Manifest Files

setup.py

Apr 29 '24 04:04 github-actions[bot]

airunner
airunner copied to clipboard

Bump transformers from 4.39.3 to 4.40.1

v4.40.1: fix `EosTokenCriteria` for `Llama3` on `mps`

v4.40.0: Llama 3, Idefics 2, Recurrent Gemma, Jamba, DBRX, OLMo, Qwen2MoE, Grounding Dino

New model additions

Llama 3

Idefics2

Recurrent Gemma

Jamba

Dependency Review

OpenSSF Scorecard

Scanned Manifest Files

airunner airunner copied to clipboard

Bump transformers from 4.39.3 to 4.40.1

v4.40.1: fix EosTokenCriteria for Llama3 on mps

v4.40.0: Llama 3, Idefics 2, Recurrent Gemma, Jamba, DBRX, OLMo, Qwen2MoE, Grounding Dino

New model additions

Llama 3

Idefics2

Recurrent Gemma

Jamba

Dependency Review

OpenSSF Scorecard

Scanned Manifest Files

airunner
airunner copied to clipboard

v4.40.1: fix `EosTokenCriteria` for `Llama3` on `mps`