Colby Banbury
Colby Banbury
Similar to [common voice](https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/audio/commonvoice.py) or [speech commands](https://github.com/tensorflow/datasets/blob/master/tensorflow_datasets/audio/speech_commands.py#L12)
This looks like a great addition! My suggestion is that we add this change to the next version of the benchmark and keep v0.5 frozen for posterity. We should be...
Good question! MS-COCO does not publish the labels (aka annotations) for the test set and holds competitions oriented around the test set. This means that Visual Wake Words does not...
The V4 paper results use an iPhone 13 and fp16 quantization. The model was also derived from a Pytorch equivalent in order to be in (batch, channel, height, width) tensor...
The PyTorch model has yet to be officially released. Sorry for the delay! The TensorFlow model should still get similar latency results, but let me know if I can help...
I used MLPackage
Those are the functions that add the required operations to the TFLite Micro interpreter. Without them, I don't think the model will be able to successfully execute because it wouldn't...
Looking at the model (below) it seems the int8 model we provide doesn't need the quantize/dequantize operations so you should be able to remove those. What do you mean by...
If you are using the int8 model we provide then yet I think that should work. Let me know how it goes!