Gavin Li issues

Repositories
Issues
Comments

Results 13 issues of


                                            Gavin Li

Compression in reduce side combine

One of the scalability problem we saw is when processing huge data, we need to have large number of reduce splits, which makes the memory overhead of shuffle writers becomes...

add block-wise scaled int8 quantization based on QuantizedLayout mechanism

## Add Block-wise INT8 Quantization This PR adds deepseek-style block-wise INT8 quantization support to ComfyUI, enabling ~50% memory reduction with limited accuracy loss and improved performance on large layers. ###...

Core

add svdquant int4 quantization support based on QuantizedLayout

## Summary This PR adds support for **SVDQuant INT4 quantization** as a new `QuantizedLayout` in ComfyUI, enabling faster inference and reduced VRAM usage. ## Key Changes ### 1. New Quantization...

Core