Queen

Results 2 comments of Queen

Here! And weight_dtype: float16. thank you very much.

it is RTX 4060,VRAM:8G,RAM:16GB. The memory usage during runtime has reached its highest level, while the graphics memory usage is not high. Can my configuration run your code?