mobile_app_open icon indicating copy to clipboard operation
mobile_app_open copied to clipboard

feat: use MobileNetV4 Core ML model with NCHW input

Open anhappdev opened this issue 1 year ago • 5 comments

The models can be downloaded from here: https://drive.google.com/drive/folders/1-Mloub0e41mkYs09i9tQZl95dTnGdZ6f?usp=sharing

anhappdev avatar Jul 02 '24 02:07 anhappdev

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

github-actions[bot] avatar Jul 02 '24 02:07 github-actions[bot]

There are 2 issues:

  1. I cannot get a plausible accuracy. I am not sure what is wrong with the code?
  2. There is no official way to know which format the input data has or the benchmark_id to know when to trigger mlperf_backend_convert_inputs()?

anhappdev avatar Jul 02 '24 03:07 anhappdev

@freedomtan to check the NHWC -> NCHW conversion.

freedomtan avatar Jul 16 '24 06:07 freedomtan

@anhappdev this NHWC -> NCHW looks right to me. Maybe you can try to compare with the result of using a single transpose op TFLite model (Yes, it's posssible. E.g., in TFLite label_image, I constructed a TFLite model with only resize op to resize the input image to the size I needed, https://github.com/tensorflow/tensorflow/blob/master/tensorflow/lite/examples/label_image/bitmap_helpers_impl.h#L30-L100)

freedomtan avatar Jul 23 '24 04:07 freedomtan

There was an issue with wrong data type, but I got it working now.

anhappdev avatar Aug 05 '24 06:08 anhappdev