Robert Knight comments

Results 727 comments of


                                            Robert Knight

Align ReduceMin / ReduceMax etc. handling of empty tensors with spec

The spec-mandated behavior here of returning the minimum or maximum value for the type does have an issue that it can mask errors, less so with floats where the min/max...

@property isn't supported in shadow roots

For reference https://developer.chrome.com/docs/css-ui/css-names discusses how `@property` is _supposed_ to work with Shadow DOM and how it actually behaves. https://github.com/w3c/csswg-drafts/issues/10541 is a specification issue concerning `@property` and Shadow DOM.

MatMulNBits support for 4-bit quantization

Some notes about 4-bit quantization via standard operators only: ONNX Runtime will [fuse DequantizeLinear + MatMul](https://github.com/microsoft/onnxruntime/blob/0463aa9fc3ef02d30d7177c0065cd4b7d36a39f7/onnxruntime/core/optimizer/qdq_transformer/selectors_actions/qdq_actions.cc#L282) into MatMulNBits. This makes it possible to create and distribute int4-quantized models using standard...

Robert Knight

Align ReduceMin / ReduceMax etc. handling of empty tensors with spec

@property isn't supported in shadow roots

MatMulNBits support for 4-bit quantization

Feature request: Add confidence scores to library output(s)

Remove internal operator implementation functions from public API

Expose mask to integer conversions in rten-simd

Expose mask to integer conversions in rten-simd

Expose mask to integer conversions in rten-simd

Expose mask to integer conversions in rten-simd

`Model::total_params` undercounts parameters when model uses 4-bit weights