tritant
tritant
Same here, dev nf4 v2 + lora = generation time x10 (4060 ti 16 gb)
Without lora time is 50 secs, with a fp8 version, it works well, not tested with gguf  When finished 
With gguf it working well 
And to use flux fill with inpainting, is there a solution?
RTX 5000 compatibility install cu128 https://github.com/lllyasviel/stable-diffusion-webui-forge/discussions/2608
Same here
No fix for that? it's really very restrictive.