C comments

Results 70 comments of

Error importing sfast._C with PyTorch 2.1.1 with xformers >= 0.0.22

If you want to use xformers with a version that the official pypi source does not provide a prebuilt binary wheels for a specific torch version, you could download the...

Error importing sfast._C with PyTorch 2.1.1 with xformers >= 0.0.22

And `stable-fast` has no binary dependencies on `xformers` so the failure of loading the C extension should be caused by other reasons. Anyway, `xformers` is just an optional requirement and...

Potential improvements to stable-fast

@arnavdantuluri It would be great! I don't have more than one GPU so haven't even considered tensor parallelism. And writing fx passes is more complicated than torchscript so I haven't...

Potential improvements to stable-fast

@arnavdantuluri Aha, in fact, as you can see, we have a Discord server here: https://discord.gg/kQFvfzM4SJ

FP8 support in stable fast

@jkrauss82 Sorry, FP8 kernels aren't implemented and I guess I lack the time to support them now.

FP8 support in stable fast

@jkrauss82 I have created one new project which supports FP8 inference with diffusers. However, it has not been open-sourced. I hope it could be made publicly soon...

Install fatal error C1083: windows11

@Nucleon729 Try using the following command to install: ```shell pip3 install -e --no-build-isolation -v --no-use-pep517 --debug ```

FP8 support in stable fast

> > Is it planned? > > Currently getting this error when trying to run ComfyUI in fp8 (flags `--fp8_e4m3fn-text-enc --fp8_e4m3fn-unet`): > > ``` > > RuntimeError: "addmm_cuda" not implemented...

Perf regression on A100 in v1.0.0+torch212+cu121+xformers0.23.post1 v.s. 0.0.13+torch2.0.0+cu121+xformers0.22patch7

This shouldn't happen. What's your script?

Perf regression on A100 in v1.0.0+torch212+cu121+xformers0.23.post1 v.s. 0.0.13+torch2.0.0+cu121+xformers0.22patch7

When I run `python3 examples/optimize_lcm_lora.py`, I still see a significant speedup improvement. So I don't know what's wrong.