Raushan Turganbay issues

Results 35 issues of


                                            Raushan Turganbay

StoppingCriteria tracks elements separately in the batch

# What does this PR do? As was pointed out in #28932 , StoppingCriteria needs to stop generation per batch element and return a boolean tensor of `batch_size`. This PR...

Add Watermarking LogitsProcessor and WatermarkDetector

# What does this PR do? Adds a watermarking technique proposed in this [paper](https://arxiv.org/abs/2301.10226) to `transformers` logits processor. I added only the simple method (algorithm 2 from paper) and the...

Pass device in Logits Processor's init

# What does this PR do? This PR adds the ability to pass in device when initializing `LogitsProcessors` and is one more step towards `compile` compatibility. ## Who can review?...

Make LogitsProcessor compatible with torch.compile

# What does this PR do? Small part of the issue #28981 . This PR makes sure that Logits Processor and Stopping Criteria are compatible with `torch.compile` when `fullgraph=True`. The...

Phi: static cache & compile compatibility

# What does this PR do? This PR enables compile for Phi models. Checked the correctness by running speed benchmark script (the results is below) and a test for dynamic...

Quantized KV Cache

# What does this PR do? An implementation of quantized cache with `quanto` library. Introduces a new `CacheConfig` to store cache related arguments and a new cache class `QuantoQuantizedCache`. The...

Fix cache type in Idefics2

# What does this PR do? Following #30716 this PR makes the return type of Idefics2 models' cache follow the same way, by returning the type we had in the...

Add Video Llava

# What does this PR do? Fixes #29640 . Add a new model, Video Llava to the library. This is a draft PR, will add more here later. The model...

New model

Video-LLaVa now available in the Transformers library!

Hey! Video-LLaVa is now available in the Transformers library! Feel free to check it out [here](https://huggingface.co/LanguageBind/Video-LLaVA-7B-hf). Thanks to @LinB203 for helping to ship the model 🤗 To get the model,...

Calculate position ids in modeling utils for all generative models

# What does this PR do? As it was discussed under [this PR](https://github.com/huggingface/transformers/pull/29583), position ids in some models are not calculated/inferred from attn mask in `forward`, which gives incorrect positions...