Dudi Lester

Results 3 issues of Dudi Lester

* Done to allow quantization using HQT * Added use_flash_attention, flash_attention_causal_mask and flash_attention_recompute to run_lm_eval * Enforce recompute flag on fsdpa quantization

synapse 1.16_dependency

Added use_flash_attention, flash_attention_causal_mask and flash_attention_recompute to run_lm_eval Enforce recompute flag on fsdpa quantization Allow quantization using HQT Document FusedScaledDotProductAttention quantization

synapse 1.16_dependency

update quantization configuration files

synapse 1.17_dependency