Colby Banbury comments

Results 9 comments of


                                            Colby Banbury

TFDS api

Similar to [common voice](https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/audio/commonvoice.py) or [speech commands](https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/audio/speech_commands.py#L12)

[API] Enable access to data buffer

This looks like a great addition! My suggestion is that we add this change to the next version of the benchmark and keep v0.5 frozen for posterity. We should be...

Where is the Visual Wake Word test set?

Good question! MS-COCO does not publish the labels (aka annotations) for the test set and holds competitions oriented around the test set. This means that Visual Wake Words does not...

Check Mobilenet V4 Large on iPhones

The V4 paper results use an iPhone 13 and fp16 quantization. The model was also derived from a Pytorch equivalent in order to be in (batch, channel, height, width) tensor...

Check Mobilenet V4 Large on iPhones

The PyTorch model has yet to be officially released. Sorry for the delay! The TensorFlow model should still get similar latency results, but let me know if I can help...

Check Mobilenet V4 Large on iPhones

I used MLPackage

addQuantize and addDequantize in th_final_initialize of anamoly detection submitter_implemented.cpp

Those are the functions that add the required operations to the TFLite Micro interpreter. Without them, I don't think the model will be able to successfully execute because it wouldn't...

addQuantize and addDequantize in th_final_initialize of anamoly detection submitter_implemented.cpp

Looking at the model (below) it seems the int8 model we provide doesn't need the quantize/dequantize operations so you should be able to remove those. What do you mean by...

addQuantize and addDequantize in th_final_initialize of anamoly detection submitter_implemented.cpp

If you are using the int8 model we provide then yet I think that should work. Let me know how it goes!