Dudi Lester
Results
3
issues of
Dudi Lester
* Done to allow quantization using HQT * Added use_flash_attention, flash_attention_causal_mask and flash_attention_recompute to run_lm_eval * Enforce recompute flag on fsdpa quantization
synapse 1.16_dependency
Added use_flash_attention, flash_attention_causal_mask and flash_attention_recompute to run_lm_eval Enforce recompute flag on fsdpa quantization Allow quantization using HQT Document FusedScaledDotProductAttention quantization
synapse 1.16_dependency
update quantization configuration files
synapse 1.17_dependency