XNNPACK
XNNPACK copied to clipboard
Introduce flags for qb4 scale format in xnn_define_blockwise_quantized_tensor_value
This change extends the xnn_define_blockwise_quantized_tensor_value API to accept flags to control block scale format, though only bf16 is currently supported. The intent of this change is to allow for other block scale formats (fp16 or fp32) in the future without breaking API backwards compatibility.