backscrub icon indicating copy to clipboard operation
backscrub copied to clipboard

The MLKit selfie segmentation model should be finetuned for office chairs

Open mikaelhg opened this issue 3 years ago • 5 comments

The MLKit selfie segmentation model models/selfiesegmentation_mlkit-256x256-2021_01_19-v1215.f16.tflite is pretty good, but it could still be finetuned for office chairs.

The model card describes the dataset used to train the model as "1700 images, 100 images from each of 17 the geographical subregions ..."

mikaelhg avatar Apr 28 '21 11:04 mikaelhg

Tagging as an enhancement and help-wanted as I don't know if anyone currently involved can do this!

phlash avatar Apr 28 '21 14:04 phlash

Another idea for "automatic" tracking suggested by @martok the other day was recording some video of your background and automatically tagging everything above some noise threshold as foreground in order to gain training material. That way you could create the necessary amount of tagged images with as little effort as possible. And you even get the network to be quite sensitive only to one person, which could have the advantage that other people in the background could be filtered by such a net.

BenBE avatar Apr 28 '21 15:04 BenBE

Another idea I've been toying with has been approaching the problem from a genetic algorithm perspective, where you have a crawler that starts from the bottom left side of the screen, and moves across the bottom until it figures out where the human figure is, and then just algorithmically looks at the probably border pixels to draw an outline.

mikaelhg avatar Apr 28 '21 15:04 mikaelhg

The model card describes the dataset used to train the model as "1700 images, 100 images from each of 17 the geographical subregions ...

Actually, Google's model cards do not detail training data, but only some validation data. Their goal is to make us aware of potential bias regarding a population without disclosing the training data (AI should not discriminate).

Pandinosaurus avatar May 30 '21 22:05 Pandinosaurus

@BenBE @phlash side note: should we use the selfiesegmentation model as default for the next release?

floe avatar Aug 11 '21 18:08 floe