mlc-llm
                                
                                
                                
                                    mlc-llm copied to clipboard
                            
                            
                            
                        [Question] Difference between the quantization methods of other LLM engines.
❓ General Questions
I am curious if there is a difference between the quantization methods, such as q4f16_0 and q4f32_0 of this engine, and the q4_0 quantization of other LLM engines. If there is a difference, what is it?