AMDMIGraphX icon indicating copy to clipboard operation
AMDMIGraphX copied to clipboard

Uint8 and Int8 quantization handling as a compiler pass

Open CharlieL7 opened this issue 5 months ago • 0 comments

  • Currently we only handle int8 mixed precision, uint8 activations and int8 weights, in GEMMS in the parser.
  • There needs to be an update that adds a pass or updates simplify_qdq to do the same for convolutions and handle all the cases.
  • Got a test model from CK that sees this problem: https://drive.google.com/file/d/12vygauV8sdP19ooBTx7BnxurY3iPVQ1l/view?usp=drive_link

CharlieL7 avatar Aug 13 '25 19:08 CharlieL7