Poedator comments

Results 12 comments of


Poedator

ValueError: dependence_plot error for categorical variables with matplotlib >= 3.5.0

It looks like this issue was fixed [by this commit](https://github.com/slundberg/shap/commit/8065cf7f9239574eed1c7d0ab3b40d538d16608d) on Mar 22, 2022. It is not yet part of release (as of 0.40.0). If necessary - just repeat this...

CUDA out of memory falcon-40b when using 40Gi A100 GPU

Hello, @caleb-artifact, and thank you for interest to SpQR quantization! Most likely you encountered excessive memory usage error that was fixed by now. I just re-tested it today. With PR...

Which dataset should I use?

Hello @ccccj , if you are focused on the best performance in some specific domain (presumably this is the reason for having your own dataset) - then you may get...

Llama: fix custom 4D masks, v2

As s a solution, I added additional `expected_shapes` to `_ignore_causal_mask_sdpa()` and improved StaticCache detection code. Note: it is inconvenient to have StaticCache as layer.self_attn objects and other Caches as model-level...

Llama: fix custom 4D masks, v2

all CI tests are green, SLOW tests were OK on my side yesterday

Llama: fix custom 4D masks, v2

I noticed that mistral model support for 4D masks stayed broken after these fixes. So I added similar lines to `src/transformers/modeling_attn_mask_utils.py::_prepare_4d_causal_attention_mask_for_sdpa()`

Llama: fix custom 4D masks, v2

I added `Mask4DTestHard` tests (without static cache part) to `tests/models/mistral/test_modeling_mistral.py` to ensure that the 4d masks keep working in the models that use `_prepare_4d_causal_attention_mask_for_sdpa()`. These new tests would fail without...