Daniel Bourke comments

Results 107 comments of


                                            Daniel Bourke

Downloading images from Bing API only downloading first portion of string

thank you, going to test the changes and see what happens

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Potential autolabelling pipeline: - raw images downloaded (e.g. filtered images from large dataset, such as, LAION-COCO) - several rounds of zero-shot classification are run to further filter images - "edible_food"...

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

See `openclip` for zero-shot classification: https://github.com/mlfoundations/open_clip Also see `clip-retrieval` for just embedding/searching a large existing dataset for images specific to a certain task: https://github.com/rom1504/clip-retrieval Can download a large number of...

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Much better to compute image embeddings + class embeddings up front. Then reuse over time where necessary. This could be setup via: - image gets given UUID - image embedding...

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

See this resource for autolabelling object detection: https://github.com/facebookresearch/CutLER

Create `export.py` (or something similar) for exporting models from PyTorch -> CoreML

See this notebook for an example conversion: https://github.com/mrdbourke/nutrify/blob/main/foodvision/notebooks/06_export_model.ipynb **Note:** Prediction/evaluation can only take place on macOS with CoreML, should setup some code to make a prediction over ~100 images with...

Daniel Bourke

Downloading images from Bing API only downloading first portion of string

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Autolabelling: Setup a data collection pipeline (e.g. what happens when new data comes in?)

Create `export.py` (or something similar) for exporting models from PyTorch -> CoreML

Database update: make it so a new food gets added via FDC ID -> information pulled into database

Clean/remove duplicate images with `fastdup`

Clean/remove duplicate images with `fastdup`

Start breaking down `class_names` into more easily understandable foods